https://github.com/drkbluescience/data-extraction-from-documents
https://github.com/drkbluescience/data-extraction-from-documents
Last synced: 1 day ago
JSON representation
- Host: GitHub
- URL: https://github.com/drkbluescience/data-extraction-from-documents
- Owner: drkbluescience
- Created: 2023-07-21T18:09:25.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2023-07-21T19:03:43.000Z (almost 3 years ago)
- Last Synced: 2025-12-05T22:28:12.350Z (6 months ago)
- Language: Python
- Size: 90.5 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Extraction tables from PDF files and extraction speech notes from Powerpoint files and writing them in Text files according to their page number in order.
## Installation
The virtual environment is already there. In addition, you need to download [ghostscrip](https://camelot-py.readthedocs.io/en/master/user/install-deps.html) and setup it in C:\Program Files\.