https://github.com/jai2dev/convert-to-pdf
Convert your documents in pdf format and extract information from them. Supports many extension like docs, docx, rtf etc
https://github.com/jai2dev/convert-to-pdf
api-rest converter document-conversion flask pdf python3
Last synced: 11 months ago
JSON representation
Convert your documents in pdf format and extract information from them. Supports many extension like docs, docx, rtf etc
- Host: GitHub
- URL: https://github.com/jai2dev/convert-to-pdf
- Owner: jai2dev
- Created: 2021-06-17T17:19:31.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2023-10-23T18:12:10.000Z (over 2 years ago)
- Last Synced: 2025-02-05T20:07:54.923Z (about 1 year ago)
- Topics: api-rest, converter, document-conversion, flask, pdf, python3
- Language: Python
- Homepage:
- Size: 4.83 MB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# File Converter (to PDF format)
Convert your documents in pdf format and extract information from them. Supports many extension like docs, docx, rtf etc
# How it works?
1. open the main folder and run the command pip install -r requirements.txt to install required packages
2. run the command python3 app.py
3. open the local host server and follow the given instructions there.
Note: the converted files will be in uploads folder of the app directory
## you can play around with the code to extract information from the document
Currently this app supports file conversion and extracting emails and mobile numbers from the converted document and put them in a csv file and show the output.
Change the regex for extracting other information.