https://github.com/roblib/pdf_ocr_search
extract ocr from existing pdfs, scan ocr text for occurrences of words from a provided list, log the title of the pdf containing the word
https://github.com/roblib/pdf_ocr_search
Last synced: about 1 month ago
JSON representation
extract ocr from existing pdfs, scan ocr text for occurrences of words from a provided list, log the title of the pdf containing the word
- Host: GitHub
- URL: https://github.com/roblib/pdf_ocr_search
- Owner: roblib
- Created: 2018-05-07T13:01:50.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2018-05-07T19:15:20.000Z (about 7 years ago)
- Last Synced: 2025-02-14T19:43:26.220Z (3 months ago)
- Language: Shell
- Size: 11.9 MB
- Stars: 0
- Watchers: 7
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: readme.md
Awesome Lists containing this project
README
notes:
see ./batch.sh