{"id":13415644,"url":"https://github.com/omni-us/pagexml","last_synced_at":"2025-03-20T08:33:03.317Z","repository":{"id":57450287,"uuid":"145522858","full_name":"omni-us/pagexml","owner":"omni-us","description":"Library in C++ and a python wrapper for dealing with Page XML files","archived":false,"fork":false,"pushed_at":"2024-01-19T13:44:10.000Z","size":6955,"stargazers_count":13,"open_issues_count":0,"forks_count":2,"subscribers_count":6,"default_branch":"master","last_synced_at":"2024-04-27T05:21:45.586Z","etag":null,"topics":["annotation-processing","docker-image","document-representation","pagexml","python"],"latest_commit_sha":null,"homepage":"","language":"C++","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/omni-us.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE.md","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2018-08-21T07:16:55.000Z","updated_at":"2024-07-30T23:48:23.177Z","dependencies_parsed_at":"2024-01-07T21:13:58.469Z","dependency_job_id":"bbec0c3d-188c-4cd9-a55f-9db049ba740d","html_url":"https://github.com/omni-us/pagexml","commit_stats":{"total_commits":278,"total_committers":2,"mean_commits":139.0,"dds":0.003597122302158251,"last_synced_commit":"3edf6074240355cf8a01552c1bdc7c4dd19f420d"},"previous_names":[],"tags_count":22,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/omni-us%2Fpagexml","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/omni-us%2Fpagexml/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/omni-us%2Fpagexml/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/omni-us%2Fpagexml/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/omni-us","download_url":"https://codeload.github.com/omni-us/pagexml/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":244578221,"owners_count":20475420,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["annotation-processing","docker-image","document-representation","pagexml","python"],"created_at":"2024-07-30T21:00:51.028Z","updated_at":"2025-03-20T08:33:01.912Z","avatar_url":"https://github.com/omni-us.png","language":"C++","funding_links":[],"categories":["1. \u003ca name='Software'\u003e\u003c/a\u003eSoftware","Software"],"sub_categories":["1.3. \u003ca name='OCRfileformats'\u003e\u003c/a\u003eOCR file formats","OCR file formats"],"readme":"# Introduction\n\nLibrary in C++ and a python wrapper for dealing with Page XML files\n\n[![CircleCI](https://circleci.com/gh/omni-us/pagexml.svg?style=svg)](https://circleci.com/gh/omni-us/pagexml)\n\n# Requirements\n\nCheck [py-pagexml/README.rst](py-pagexml/README.rst) and/or [docker/Dockerfile_build](docker/Dockerfile_build), [docker/Dockerfile_runtime](docker/Dockerfile_runtime).\n\n# Contents \n\n- [lib](lib): Directory containing the C++ PageXML and TextFeatExtractor libraries.\n- [py-pagexml](py-pagexml): Swig-based python wrapper for the PageXML library.\n- [py-textfeat](py-textfeat): Swig-based python wrapper for the TextFeatExtractor library.\n\n# Documentation\n\n- [https://omni-us.github.io/pagexml/py-pagexml](https://omni-us.github.io/pagexml/py-pagexml): Online documentation for py-pagexml.\n- [https://omni-us.github.io/pagexml/py-textfeat](https://omni-us.github.io/pagexml/py-textfeat): Online documentation for py-textfeat.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fomni-us%2Fpagexml","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fomni-us%2Fpagexml","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fomni-us%2Fpagexml/lists"}