https://github.com/saif807380/form-data-extractor
Project for extracting and analysing text from images.
https://github.com/saif807380/form-data-extractor
Last synced: 6 months ago
JSON representation
Project for extracting and analysing text from images.
- Host: GitHub
- URL: https://github.com/saif807380/form-data-extractor
- Owner: Saif807380
- Created: 2020-04-30T07:41:41.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2020-05-31T07:37:17.000Z (over 5 years ago)
- Last Synced: 2025-02-09T19:57:47.238Z (8 months ago)
- Language: CSS
- Homepage:
- Size: 24.2 MB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Form-Data-Extractor
A Flask WebApp for extracting text from images and performing relevant analysis. Ananlysis includes skills extraction from resumes, sentimental analysis of feedback forms and extractive text summarisation of large texts.
## Directory Structure
```bash
.
├── ResumeAndFeedbackClassifier
│ ├── mylists.py
│ └── test.py
├── ResumeParser
│ ├── config.yaml
│ ├── field_extraction.py
│ ├── generate_top_skills.py
│ ├── lib.py
│ └── main.py
├── templates
│ ├── classifier.html
│ ├── index2.html
│ ├── login.html
│ ├── pdf_template.html
│ ├── register.html
│ ├── resume.html
│ ├── sentimental.html
│ └── summarizer.html
├── static
├── VoiceForm.py
├── cloudmersive_api.py
├── cloudmersive_extract.py
├── main.py
├── requirements.txt
├── text_summariser.py
├── top_skills.csv
└── top_titles.csv
```## Setup
* Clone the repository
```bash
$ git clone https://github.com/Saif807380/Form-Extractor
```
* Create a virtual environment and install requirements.txt
```bash
$ virtual environment VIRTUAL_ENV_NAME$ pip install -r requirements.txt
```
* Download the `model.h5` and `tokenizer.pkl` files from [here](https://drive.google.com/open?id=1yGvxBxezg145-QzZVb5LSy6KjHVnWRjI) and put the files in the root directory of the project.* Set your API KEY in `cloudmersive_api.py`
* Set up your MySQL localhost and password
* Run `main.py`
```bash
$ python main.py
```
## Individual Modules
* [App](https://github.com/Saif807380/Form-Extractor-App)
* [Text Summarizer](https://github.com/Saif807380/Text-Summariser)
* [Resume Parser](https://github.com/Saif807380/Resume-Parser)