An open API service indexing awesome lists of open source software.

https://github.com/drkbluescience/data-extraction-from-documents


https://github.com/drkbluescience/data-extraction-from-documents

Last synced: 1 day ago
JSON representation

Awesome Lists containing this project

README

          

Extraction tables from PDF files and extraction speech notes from Powerpoint files and writing them in Text files according to their page number in order.

## Installation

The virtual environment is already there. In addition, you need to download [ghostscrip](https://camelot-py.readthedocs.io/en/master/user/install-deps.html) and setup it in C:\Program Files\.