Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/bilalhameed248/pdf-document-extraction
Python PDF-to-HTML Converter: Transforming PDF Documents into Structured HTML Tags. - Feb 2022 - Jun 2023
https://github.com/bilalhameed248/pdf-document-extraction
document extraction fitz parser parsing pdf pymupdf pymupdf-fitz python python3
Last synced: about 6 hours ago
JSON representation
Python PDF-to-HTML Converter: Transforming PDF Documents into Structured HTML Tags. - Feb 2022 - Jun 2023
- Host: GitHub
- URL: https://github.com/bilalhameed248/pdf-document-extraction
- Owner: bilalhameed248
- Created: 2022-10-04T18:19:33.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2023-11-05T10:32:25.000Z (about 1 year ago)
- Last Synced: 2024-06-07T20:10:23.372Z (5 months ago)
- Topics: document, extraction, fitz, parser, parsing, pdf, pymupdf, pymupdf-fitz, python, python3
- Language: Python
- Homepage:
- Size: 73.2 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files: