https://github.com/jpleorx/pdf-scan
Experimenting with iTextPdf, PdfBox, Tesseract
https://github.com/jpleorx/pdf-scan
itext itextpdf java ocr pdf pdf-files pdf-generation pdf-reader pdfbox pdfbox2 tess4j tesseract tesseract-ocr
Last synced: 2 months ago
JSON representation
Experimenting with iTextPdf, PdfBox, Tesseract
- Host: GitHub
- URL: https://github.com/jpleorx/pdf-scan
- Owner: JPLeoRX
- Created: 2019-11-15T23:57:39.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2022-06-30T20:21:29.000Z (almost 3 years ago)
- Last Synced: 2025-01-25T11:12:27.640Z (4 months ago)
- Topics: itext, itextpdf, java, ocr, pdf, pdf-files, pdf-generation, pdf-reader, pdfbox, pdfbox2, tess4j, tesseract, tesseract-ocr
- Language: Java
- Homepage:
- Size: 6.84 KB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Experimenting with iTextPdf
Loading text from pdf files.
Building pdf by joining several images on one page.
Extracting text from PDF with Tesseract.