https://github.com/ahmedkhemiri95/PDFs-TextExtract
Multiple and Large PDF Documents Text Extraction.
https://github.com/ahmedkhemiri95/PDFs-TextExtract
data-science extract-text parser pdf pdf-document pdf-processing pdfminer pdfs pdfs-textextract pypdf2 python text-analytics
Last synced: 26 days ago
JSON representation
Multiple and Large PDF Documents Text Extraction.
- Host: GitHub
- URL: https://github.com/ahmedkhemiri95/PDFs-TextExtract
- Owner: ahmedkhemiri95
- License: apache-2.0
- Created: 2020-05-07T04:35:43.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2025-02-10T23:48:32.000Z (3 months ago)
- Last Synced: 2025-02-11T00:29:35.687Z (3 months ago)
- Topics: data-science, extract-text, parser, pdf, pdf-document, pdf-processing, pdfminer, pdfs, pdfs-textextract, pypdf2, python, text-analytics
- Language: Python
- Homepage:
- Size: 11.3 MB
- Stars: 128
- Watchers: 7
- Forks: 65
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# PDFs-TextExtract
Multiple and Large PDF Documents Text Extraction.