https://github.com/saif807380/form-data-extractor

Project for extracting and analysing text from images.
https://github.com/saif807380/form-data-extractor

Last synced: 6 months ago
JSON representation

Project for extracting and analysing text from images.

Host: GitHub
URL: https://github.com/saif807380/form-data-extractor
Owner: Saif807380
Created: 2020-04-30T07:41:41.000Z (over 5 years ago)
Default Branch: master
Last Pushed: 2020-05-31T07:37:17.000Z (over 5 years ago)
Last Synced: 2025-02-09T19:57:47.238Z (8 months ago)
Language: CSS
Homepage:
Size: 24.2 MB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Form-Data-Extractor

A Flask WebApp for extracting text from images and performing relevant analysis. Ananlysis includes skills extraction from resumes, sentimental analysis of feedback forms and extractive text summarisation of large texts.

## Directory Structure
```bash
.
├── ResumeAndFeedbackClassifier
│ ├── mylists.py
│ └── test.py
├── ResumeParser
│ ├── config.yaml
│ ├── field_extraction.py
│ ├── generate_top_skills.py
│ ├── lib.py
│ └── main.py
├── templates
│ ├── classifier.html
│ ├── index2.html
│ ├── login.html
│ ├── pdf_template.html
│ ├── register.html
│ ├── resume.html
│ ├── sentimental.html
│ └── summarizer.html
├── static
├── VoiceForm.py
├── cloudmersive_api.py
├── cloudmersive_extract.py
├── main.py
├── requirements.txt
├── text_summariser.py
├── top_skills.csv
└── top_titles.csv
```

## Setup
* Clone the repository
```bash
$ git clone https://github.com/Saif807380/Form-Extractor
```
* Create a virtual environment and install requirements.txt
```bash
$ virtual environment VIRTUAL_ENV_NAME

$ pip install -r requirements.txt
```
* Download the `model.h5` and `tokenizer.pkl` files from [here](https://drive.google.com/open?id=1yGvxBxezg145-QzZVb5LSy6KjHVnWRjI) and put the files in the root directory of the project.

* Set your API KEY in `cloudmersive_api.py`

* Set up your MySQL localhost and password

* Run `main.py`
```bash
$ python main.py
```
## Individual Modules
* [App](https://github.com/Saif807380/Form-Extractor-App)
* [Text Summarizer](https://github.com/Saif807380/Text-Summariser)
* [Resume Parser](https://github.com/Saif807380/Resume-Parser)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/saif807380/form-data-extractor

Awesome Lists containing this project

README