https://github.com/bertsky/ocrd_origami
OCR-D wrapper for poke1024/origami OLR+OCR
https://github.com/bertsky/ocrd_origami
ocr-d
Last synced: about 1 year ago
JSON representation
OCR-D wrapper for poke1024/origami OLR+OCR
- Host: GitHub
- URL: https://github.com/bertsky/ocrd_origami
- Owner: bertsky
- Created: 2021-11-25T23:58:38.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2022-02-10T15:20:52.000Z (over 4 years ago)
- Last Synced: 2025-01-30T03:27:33.180Z (over 1 year ago)
- Topics: ocr-d
- Language: Python
- Homepage:
- Size: 6.84 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# ocrd_origami
OCR-D wrapper for poke1024/origami OLR+OCR
* [Introduction](#introduction)
* [Installation](#installation)
* [Usage](#usage)
* [OCR-D processor interface ocrd-origami-segment](#ocr-d-processor-interface-ocrd-origami-segment)
* [Testing](#testing)
## Introduction
This offers [OCR-D](https://ocr-d.de) compliant [workspace processors](https://ocr-d.de/en/spec/cli) for
[Origami](https://github.com/poke1024/origami), the document image processing suite for historical newspapers.
... WORK IN PROGRESS ...
## Installation
First install system dependencies:
sudo make deps-ubuntu
(Besides Python>=3.7 you'll need at least `libffi-dev`, `libcgal-dev` and `git`, plus a recent `tesseract`.)
Now clone the subrepository, if you have not already:
make origami
Which is the equivalent of:
git submodule update --init origami
Create and activate a [virtual environment](https://packaging.python.org/tutorials/installing-packages/#creating-virtual-environments) as usual.
To install Python dependencies:
make deps
Which is the equivalent of:
pip install -r requirements.txt
pip install -r origami/requirements/pip.txt
pip install -r origami/requirements/conda.txt
To install this module, do:
make install
Which is the equivalent of:
pip install .
## Usage
### [OCR-D processor](https://ocr-d.de/en/spec/cli) interface `ocrd-origami-segment`
To be used with [PAGE-XML](https://github.com/PRImA-Research-Lab/PAGE-XML) documents in an [OCR-D](https://ocr-d.de/en/about) annotation workflow.
```
... SHOW OCRD CLI HERE...
```
## Testing
(not yet)