https://github.com/ryanfb/latinocr-lat

'lat' repository, forked from https://github.com/ryanfb/ancientgreekocr-grc. The final training process for lat.traineddata
https://github.com/ryanfb/latinocr-lat

Last synced: 4 months ago
JSON representation

'lat' repository, forked from https://github.com/ryanfb/ancientgreekocr-grc. The final training process for lat.traineddata

Host: GitHub
URL: https://github.com/ryanfb/latinocr-lat
Owner: ryanfb
License: apache-2.0
Created: 2014-12-15T15:55:51.000Z (over 11 years ago)
Default Branch: master
Last Pushed: 2016-01-13T15:52:19.000Z (over 10 years ago)
Last Synced: 2025-06-18T08:07:37.845Z (12 months ago)
Language: Makefile
Homepage: https://ryanfb.github.io/latinocr/
Size: 5.13 MB
Stars: 13
Watchers: 4
Forks: 3
Open Issues: 4
Metadata Files:
- Readme: README
- License: LICENSE

Awesome Lists containing this project

README

          Latin OCR Training for Tesseract

================================

Produces: lat.traineddata

You need wget, unzip and the Tesseract training tools to make this

training.

The following files have been automatically generated using the

tools in the lattraining git repository located at

  https://github.com/ryanfb/latinocr-lattraining

- training_text.txt

- lat.word.txt

- lat.freq.txt

- lat.unicharambigs

You can see the exact process for generating them in the lattraining

Makefile.

The Latin.unicharset file has been copied from Tesseract's

tesseract-ocr.langdata git repository.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ryanfb/latinocr-lat

Awesome Lists containing this project

README