Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/omni-us/pagexml

Library in C++ and a python wrapper for dealing with Page XML files
https://github.com/omni-us/pagexml

annotation-processing docker-image document-representation pagexml python

Last synced: 2 months ago
JSON representation

Library in C++ and a python wrapper for dealing with Page XML files

Awesome Lists containing this project

README

        

# Introduction

Library in C++ and a python wrapper for dealing with Page XML files

[![CircleCI](https://circleci.com/gh/omni-us/pagexml.svg?style=svg)](https://circleci.com/gh/omni-us/pagexml)

# Requirements

Check [py-pagexml/README.rst](py-pagexml/README.rst) and/or [docker/Dockerfile_build](docker/Dockerfile_build), [docker/Dockerfile_runtime](docker/Dockerfile_runtime).

# Contents

- [lib](lib): Directory containing the C++ PageXML and TextFeatExtractor libraries.
- [py-pagexml](py-pagexml): Swig-based python wrapper for the PageXML library.
- [py-textfeat](py-textfeat): Swig-based python wrapper for the TextFeatExtractor library.

# Documentation

- [https://omni-us.github.io/pagexml/py-pagexml](https://omni-us.github.io/pagexml/py-pagexml): Online documentation for py-pagexml.
- [https://omni-us.github.io/pagexml/py-textfeat](https://omni-us.github.io/pagexml/py-textfeat): Online documentation for py-textfeat.