An open API service indexing awesome lists of open source software.

https://github.com/ferbcn/pytextractor

Extract text with OCR from images and pdf-image files
https://github.com/ferbcn/pytextractor

pyqt5 python tesseract tesseract-ocr

Last synced: 9 months ago
JSON representation

Extract text with OCR from images and pdf-image files

Awesome Lists containing this project

README

          

# Installation

pip install -r requirements.txt

tesseract-ocr binaries (and maybe PATH)

Tesseract language packages. (and maybe add PATH to tessdata parent directory)

Windows requires poppler for windows (included in most linux distros). You'll need to add binaries folder to PATH