https://github.com/ryanfb/latinocr-lat
'lat' repository, forked from https://github.com/ryanfb/ancientgreekocr-grc. The final training process for lat.traineddata
https://github.com/ryanfb/latinocr-lat
Last synced: 4 months ago
JSON representation
'lat' repository, forked from https://github.com/ryanfb/ancientgreekocr-grc. The final training process for lat.traineddata
- Host: GitHub
- URL: https://github.com/ryanfb/latinocr-lat
- Owner: ryanfb
- License: apache-2.0
- Created: 2014-12-15T15:55:51.000Z (over 11 years ago)
- Default Branch: master
- Last Pushed: 2016-01-13T15:52:19.000Z (over 10 years ago)
- Last Synced: 2025-06-18T08:07:37.845Z (12 months ago)
- Language: Makefile
- Homepage: https://ryanfb.github.io/latinocr/
- Size: 5.13 MB
- Stars: 13
- Watchers: 4
- Forks: 3
- Open Issues: 4
-
Metadata Files:
- Readme: README
- License: LICENSE
Awesome Lists containing this project
README
Latin OCR Training for Tesseract
================================
Produces: lat.traineddata
You need wget, unzip and the Tesseract training tools to make this
training.
The following files have been automatically generated using the
tools in the lattraining git repository located at
https://github.com/ryanfb/latinocr-lattraining
- training_text.txt
- lat.word.txt
- lat.freq.txt
- lat.unicharambigs
You can see the exact process for generating them in the lattraining
Makefile.
The Latin.unicharset file has been copied from Tesseract's
tesseract-ocr.langdata git repository.