Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/harshuljain13/ocr
Optical character recognition Using Deep Learning
https://github.com/harshuljain13/ocr
cnn ctc-loss deeplearning lstm lstm-neural-networks optical-character-recognition rcnn tensorflow tensorflow-experiments
Last synced: 3 months ago
JSON representation
Optical character recognition Using Deep Learning
- Host: GitHub
- URL: https://github.com/harshuljain13/ocr
- Owner: harshuljain13
- License: mit
- Created: 2018-06-09T03:20:06.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2018-06-17T07:26:47.000Z (over 6 years ago)
- Last Synced: 2024-10-10T08:42:51.450Z (3 months ago)
- Topics: cnn, ctc-loss, deeplearning, lstm, lstm-neural-networks, optical-character-recognition, rcnn, tensorflow, tensorflow-experiments
- Language: Jupyter Notebook
- Homepage:
- Size: 7.5 MB
- Stars: 30
- Watchers: 3
- Forks: 7
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Optical Character Recognition Using DeepLearning
![Er. Harshul Jain, author](https://img.shields.io/badge/Author-Er.%20Harshul%20Jain%20-blue.svg)Text is everywhere! It is present in PDFs, docs as well as images. There are lots of applications where text data is useful for doing analytics. Such applications include receipts recognition, number plate detection, extracting the latex formulas from the images etc. General Computer Vision can be used for such task but it lacks in accuracy. In order to solve the low accuracy and variance problem, we use the state of the art deep neural networks.
This repository includes:
```
1. A TensorFlow implementation of the CNN+LSTM+CTC model for OCR.
2. supporting scripts to apply the RCNN appraoch for OCR.
```### Architecture
![Architecture](images/cnn_lstm_Architecture.jpeg)### Instructions on How to run
Get the repository
```
git clone https://github.com/harshul1610/OCR.git
```Get the NIST19 dataset
```
mkdir data
wget https://s3.amazonaws.com/nist-srd/SD19/by_class.zip
unzip by_class.zip
mv by_class NIST19
```Get the Captcha data
```
cd OCR
python2 generate_captcha.py
```Run the final notebook for training and testing
```
CNN_LSTM_CTC_OCR-captcha.ipynb
```### LICENSE
MIT