Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/hreikin/pdf-toolbox
Extract content from PDF's and convert or create new documents from the content in multiple output formats.
https://github.com/hreikin/pdf-toolbox
adobe document-conversion document-converter document-creation document-creator document-extraction image-extraction pandoc pymupdf pypandoc python python3 scrapy text-extraction
Last synced: about 1 month ago
JSON representation
Extract content from PDF's and convert or create new documents from the content in multiple output formats.
- Host: GitHub
- URL: https://github.com/hreikin/pdf-toolbox
- Owner: hreikin
- License: mit
- Created: 2022-01-28T17:31:32.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2022-03-17T22:53:21.000Z (over 2 years ago)
- Last Synced: 2024-03-15T23:54:44.178Z (8 months ago)
- Topics: adobe, document-conversion, document-converter, document-creation, document-creator, document-extraction, image-extraction, pandoc, pymupdf, pypandoc, python, python3, scrapy, text-extraction
- Language: Python
- Homepage:
- Size: 7.57 MB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 9