Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/pymupdf/pymupdf
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
https://github.com/pymupdf/pymupdf
data-science epub extract-data font mupdf ocr pdf pdf-documents pymupdf python table-extraction tesseract text-processing text-shaping xps
Last synced: about 15 hours ago
JSON representation
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
- Host: GitHub
- URL: https://github.com/pymupdf/pymupdf
- Owner: pymupdf
- License: agpl-3.0
- Created: 2012-10-06T18:54:25.000Z (almost 12 years ago)
- Default Branch: main
- Last Pushed: 2024-08-30T11:04:39.000Z (26 days ago)
- Last Synced: 2024-08-31T09:58:30.577Z (25 days ago)
- Topics: data-science, epub, extract-data, font, mupdf, ocr, pdf, pdf-documents, pymupdf, python, table-extraction, tesseract, text-processing, text-shaping, xps
- Language: Python
- Homepage: https://pymupdf.readthedocs.io
- Size: 296 MB
- Stars: 4,980
- Watchers: 60
- Forks: 480
- Open Issues: 42
-
Metadata Files:
- Readme: README.md
- Changelog: changes.txt
- License: COPYING
- Support: docs/supported-files-table.rst