https://github.com/erl-ang/interactive-ocr
Implementation of a couple of heuristics that estimate OCR quality without reliance on ground truth data, focusing on historical documents written in English.
https://github.com/erl-ang/interactive-ocr
ground-truth nlp ocr-quality optical-character-recognition tesseract-ocr word-error-rate
Last synced: 6 months ago
JSON representation
Implementation of a couple of heuristics that estimate OCR quality without reliance on ground truth data, focusing on historical documents written in English.
- Host: GitHub
- URL: https://github.com/erl-ang/interactive-ocr
- Owner: erl-ang
- Created: 2022-03-24T11:47:24.000Z (over 3 years ago)
- Default Branch: master
- Last Pushed: 2023-06-02T07:33:23.000Z (over 2 years ago)
- Last Synced: 2025-03-21T08:48:29.890Z (7 months ago)
- Topics: ground-truth, nlp, ocr-quality, optical-character-recognition, tesseract-ocr, word-error-rate
- Language: Python
- Homepage:
- Size: 3.28 MB
- Stars: 3
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# interactive-ocr
cleaning this up is on my postgrad todo list