Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/cneud/alto-ocr-text
extract text from ALTO file
https://github.com/cneud/alto-ocr-text
Last synced: about 1 month ago
JSON representation
extract text from ALTO file
- Host: GitHub
- URL: https://github.com/cneud/alto-ocr-text
- Owner: cneud
- Archived: true
- Created: 2015-08-31T14:52:05.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2023-09-26T20:48:36.000Z (about 1 year ago)
- Last Synced: 2024-08-04T13:06:51.108Z (5 months ago)
- Language: Python
- Homepage:
- Size: 4.88 KB
- Stars: 9
- Watchers: 1
- Forks: 6
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
## This is no longer supported, please use https://github.com/cneud/alto-tools.
# alto-ocr-text
Extracts the text from an [ALTO](http://www.loc.gov/standards/alto/) file and writes it to `stdout`.Use like:
python alto_ocr_text.py