https://github.com/isi-vista/VistaOCR

ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data
https://github.com/isi-vista/VistaOCR

Last synced: 3 months ago
JSON representation

ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data

Host: GitHub
URL: https://github.com/isi-vista/VistaOCR
Owner: isi-vista
License: apache-2.0
Created: 2018-04-27T17:04:27.000Z (about 7 years ago)
Default Branch: master
Last Pushed: 2019-06-27T19:09:40.000Z (about 6 years ago)
Last Synced: 2024-11-03T10:32:53.854Z (8 months ago)
Language: Python
Size: 186 KB
Stars: 23
Watchers: 9
Forks: 10
Open Issues: 3
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# VistaOCR
ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data

# Publications
*"How to Efficiently Increase Resolution in Neural OCR Models"*. Stephen Rawls, Huaigu Cao, Joe Mathai, Prem Natarajan. IEEE Workshop on Arabic Script Analysis and Recognition (ASAR) 2018.

[*"Combining Convolutional Neural Networks and LSTMs for Segmentation Free OCR"*](https://doi.org/10.1109/ICDAR.2017.34). Stephen Rawls, Huaigu Cao, Senthil Kumar, Prem Natarajan. International Conference on Document Analysis and Recognition (ICDAR) 2017.

[*"Combining Deep Learning and Language Modeling for Segmentation-free OCR From Raw Pixels"*](https://doi.org/10.1109/ASAR.2017.8067772). Stephen Rawls, Huaigu Cao, Ekraam Sabir, Prem Natarajan. IEEE Workshop on Arabic Script Analysis and Recognition (ASAR) 2017.

# Model

![VistaOCR Model Diagram](https://github.com/isi-vista/VistaOCR/blob/master/vista-ocr-model-diagram.png)

# Pretrained Models

Coming Soon. Pre-trained models for English, French, and Arabic Handwriting

# Performance Numbers

Coming soon. Expected character and word error rates from public datasets.

# How to Train

Coming soon.

# How to Decode using Existing Model

Coming soon.

# Citation
```
@inproceedings{vistaocr,
author = {Stephen Rawls and Huaigu Cao and Senthil Kumar and Prem Natarjan},
title = {Combining Convolutional Neural Networks and LSTMs for Segmentation Free OCR},
booktitle = {Proc. ICDAR},
year = {2017},
url = {https://doi.org/10.1109/ICDAR.2017.34},
doi = {10.1109/ICDAR.2017.34}
}
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/isi-vista/VistaOCR

Awesome Lists containing this project

README