Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-ocr
https://github.com/techiewonk/awesome-ocr
- tesseract - R bindings for tesseract OCR.
- EasyOCR - Ready-to-use OCR with 40+ languages supported including Chinese, Japanese, Korean and Thai.
- ocropus - OCR engine based on LSTM, `Apache 2.0`
- ocropus 0.4 - Older v0.4 state of Ocropus, with tesseract 2.04 and iulib, C++
- kraken
- gocr - OCR engine under the GNU Public License led by Joerg Schulenburg.
- Ocrad - The GNU OCR. `GPL`
- ocular - Ocular is a state-of-the-art historical OCR system.
- SwiftOCR - fast and simple OCR library written in Swift
- attention-ocr - OCR engine using visual attention mechanisms
- RWTH-OCR - The RWTH Aachen University Optical Character Recognition System
- simple-ocr-opencv - ocr-opencv) - A simple pythonic OCR engine using opencv and numpy
- Calamari - OCR Engine based on OCRopy and Kraken
- doctr
- Clara OCR - Open source OCR in C `GPL`
- Eye - an experimental Java OCR (image-to-text) application
- kognition - An omnifont OCR software for KDE
- OCRchie - Modular Optical Character Recognition Software
- ocre - o.c.r. easy
- xplab - A GTK 2 tool for pattern matching
- hebOCR - Hebrew character recognition library (previously named hocr, see [Wikipedia article](https://de.wikipedia.org/wiki/HebOCR)) `GPL`
- abby2hocr.xslt XSLT script
- ocr-conversion-scripts
- hocr-tools - Tools for doing various useful things with hOCR files, `Apache 2.0`
- hocr-spec - hOCR 1.2 specification
- ocr-transform - CLI tool to convert between hOCR and ALTO, `MIT`
- hocr-parser - hOCR Specification Python Parser
- hOCRTools - hOCR to ALTO conversion XSLT
- ALTO XML Schema - XML Schema and development of the ALTO XML format
- ALTO XML Documentation - Documentation and use cases for ALTO
- alto-tools - Various tools to work with ALTO files, Python
- AbbyyToAlto - PHP script converting from Abbyy 6 to ALTO XML
- TEI-OCR - TEI customization for OCR generated layout and content information
- TEI SIG on Libraries - Best Practices for TEI in Libraries
- GDZ - METS/TEI-based GDZ document format
- PAGE-XML Schema - XML schema of the PAGE XML format along with documentation and examples
- omni:us Pages Format (OPF) - XML schema very similar to PAGE XML that has some additional features.
- py-pagexml - Python library for handling PAGE XML and OPF files.
- OCRmyPDF - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
- Pdf2PdfOCR - A tool to OCR a PDF (or supported images) and add a text "layer" (a "pdf sandwich") in the original file making it a searchable PDF. GUI included. Tesseract and cuneiform supported.
- Ocrocis - Project manager interface for Ocropy, see also [external project homepage](http://cistern.cis.lmu.de/ocrocis/)
- tesseract-recognize - Tesseract-based tool that outputs result in Page XML format ([docker image](https://hub.docker.com/r/mauvilsa/tesseract-recognize)).
- MORAN_v2 - MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition
- thomasjhaung/deep-learning-for-document-dewarping
- unproject_text - Perspective recovery of text using transformed ellipses
- unpaper - a post-processing tool for scanned sheets of paper, especially for book pages that have been scanned from previously created photocopies.
- deskew - Library used to deskew a scanned document
- deskewing - Contains code to deskew images using MLPs, LSTMs and LLS tranformations
- skew_correction - De-skewing images with slanted content by finding the deviation using Canny Edge Detection.
- page_dewarp - Page dewarping and thresholding using a "cubic sheet" model
- text_deskewing - Rotate text images if they are not straight for better text detection and recognition.
- galfar/deskew - Deskew is a command line tool for deskewing scanned text documents. It uses Hough transform to detect "text lines" in the image. As an output, you get an image rotated so that the lines are horizontal.
- xellows1305/Document-Image-Dewarping - No code :(
- Docuwarp
- Alyn
- DewarpNet
- moz-hocr-editor - Firefox Addon for editing hOCR files **Discontinued**
- qt-box-editor - QT4 editor of tesseract-ocr box files.
- ocr-gt-tools - Client-Server application for editing OCR ground truth.
- Paperwork - Using scanners and OCR to grep paper documents the easy way.
- Paperless - Scan, index, and archive all of your paper documents.
- gImageReader - gImageReader is a simple Gtk/Qt front-end to tesseract-ocr.
- VietOCR - A Java/.NET GUI frontend for Tesseract OCR engine, including [jTessBoxEditor](http://vietocr.sourceforge.net/training.html) a graphical Tesseract [box data](https://github.com/tesseract-ocr/tesseract/wiki/Make-Box-Files) editor
- PoCoTo - Fast interactive batch corrections of complete OCR error series in OCR'ed historical documents.
- OCRFeeder - GTK graphical user interface that allows the users to correct characters or bounding boxes, ODT export and more.
- PRImA PAGE Viewer - Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.
- LAREX - LAREX is a semi-automatic open-source tool for layout analysis on early printed books.
- archiscribe - Web application for transcribing OCR ground truth from Archive.org. Deployed instance available at https://archiscribe.jbaiter.de/, results are available in [@jbaiter/archiscribe-corpus](https://github.com/jbaiter/archiscribe-corpus).
- nw-page-editor - Simple app for visual editing of Page XML files. Provides desktop and [server docker-based](https://hub.docker.com/r/mauvilsa/nw-page-editor-web) versions.
- DB
- DeepReg
- CornerText - [paper:2018](https://arxiv.org/abs/1802.08948)) - Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation
- RRPN - Source code of RRPN ---- Arbitrary-Oriented Scene Text Detection via Rotation Proposals
- MASTER-TF - ([paper:2021](https://arxiv.org/abs/1910.02562)) - TensorFlow reimplementation of "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021).
- MaskTextSpotterV3 - ([paper:2020](https://arxiv.org/abs/2007.09482)) - Mask TextSpotter v3 is an end-to-end trainable scene text spotter that adopts a Segmentation Proposal Network (SPN) instead of an RPN.
- TextFuseNet - ([paper:2020](https://www.ijcai.org/Proceedings/2020/72)) A PyTorch implementation of "TextFuseNet: Scene Text Detection with Richer Fused Features".
- SATRN - ([paper:2020](https://openaccess.thecvf.com/content_CVPRW_2020/papers/w34/Lee_On_Recognizing_Texts_of_Arbitrary_Shapes_With_2D_Self-Attention_CVPRW_2020_paper.pdf)) - Official Tensorflow Implementation of Self-Attention Text Recognition Network (SATRN) (CVPR Workshop WTDDLE 2020).
- cvpr20-scatter-text-recognizer - ([paper:2020](https://openaccess.thecvf.com/content_CVPR_2020/papers/Litman_SCATTER_Selective_Context_Attentional_Scene_Text_Recognizer_CVPR_2020_paper.pdf)) - Unofficial implementation of CVPR 2020 paper "SCATTER: Selective Context Attentional Scene Text Recognizer"
- seed - ([paper:2020[https://arxiv.org/pdf/2005.10977.pdf]) - This is the implementation of the paper "SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition"
- vedastr
- AutoSTR - ([paper:2020](https://arxiv.org/pdf/2003.06567.pdf)) Efficient Backbone Search for Scene Text Recognition
- Decoupled-attention-network - ([paper:2019](https://arxiv.org/abs/1912.10205)) Pytorch implementation for "Decoupled attention network for text recognition".
- Bi-STET - ([paper:2020](https://arxiv.org/pdf/1912.03656.pdf)) Implementation of Bidirectional Scene Text Recognition with a Single Decoder
- kiss - ([paper:2019](https://arxiv.org/abs/1911.08400)
- Deformable Text Recognition - ([paper:2019](https://ieeexplore.ieee.org/abstract/document/9064428))
- MaskTextSpotter - ([paper:2019](https://ieeexplore.ieee.org/document/8812908))
- CUTIE - ([paper:2019](https://arxiv.org/abs/1903.12363v4)
- AttentionOCR - ([paper:2019](https://arxiv.org/abs/1912.04561))
- crpn - Corner-based Region Proposal Network.
- Scene-Text-Detection-with-SPECNET - Repository for Scene Text Detection with Supervised Pyramid Context Network with tensorflow.
- Character-Region-Awareness-for-Text-Detection
- Real-time-Scene-Text-Detection-and-Recognition-System - End-to-end pipeline for real-time scene text detection and recognition.
- ocr_attention - Robust Scene Text Recognition with Automatic Rectification.
- masktextspotter.caffee2 - The code of "Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes".
- InceptText-Tensorflow - An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection.
- textspotter - An End-to-End TextSpotter with Explicit Alignment and Attention
- RRD - RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection.
- crpn - Corner-based Region Proposal Network.
- SSTDNet - Implement 'Single Shot Text Detector with Regional Attention, ICCV 2017 Spotlight'.
- R2CNN - caffe re-implementation of R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection.
- RRPN - Source code of RRPN ---- Arbitrary-Oriented Scene Text Detection via Rotation Proposals
- Tensorflow_SceneText_Oriented_Box_Predictor - This project modify tensorflow object detection api code to predict oriented bounding boxes. It can be used for scene text detection.
- DeepSceneTextReader - This is a c++ project deploying a deep scene text reading pipeline with tensorflow. It reads text from natural scene images. It uses frozen tensorflow graphs. The detector detect scene text locations. The recognizer reads word from each detected bounding box.
- DeRPN - A novel region proposal network for more general object detection ( including scene text detection ).
- Bartzi/see - Code for the AAAI 2018 publication "SEE: Towards Semi-Supervised End-to-End Scene Text Recognition"
- Bartzi/stn-ocr - Code for the paper STN-OCR: A single Neural Network for Text Detection and Text Recognition
- beacandler/R2CNN - caffe re-implementation of R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection
- HsiehYiChia/Scene-text-recognition - Scene text detection and recognition based on Extremal Region(ER)
- R2CNN_Faster-RCNN_Tensorflow - Rotational region detection based on Faster-RCNN.
- corner - Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation
- Corner_Segmentation_TextDetection - Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation.
- TextSnake.pytorch - A PyTorch implementation of ECCV2018 Paper: TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes
- AON - Implementation for CVPR 2018 text recognition Paper by Tensorflow: "AON: Towards Arbitrarily-Oriented Text Recognition"
- pixel_link - Implementation of our paper 'PixelLink: Detecting Scene Text via Instance Segmentation' in AAAI2018
- seglink - An Implementation of the seglink alogrithm in paper Detecting Oriented Text in Natural Images by Linking Segments (=> pixe_link)
- SSTD - Single Shot Text Detector with Regional Attention
- MORAN_v2 - MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition
- Curve-Text-Detector - This repository provides train&test code, dataset, det.&rec. annotation, evaluation script, annotation tool, and ranking table.
- HCIILAB/DeRPN - A novel region proposal network for more general object detection ( including scene text detection ).
- TextField - TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)
- tensorflow-TextMountain - TextMountain: Accurate Scene Text Detection via Instance Segmentation
- Bartzi/see - Code for the AAAI 2018 publication "SEE: Towards Semi-Supervised End-to-End Scene Text Recognition"
- bgshih/aster - Recognizing cropped text in natural images.
- ReceiptParser - A fuzzy receipt parser written in Python.
- vedastr
- NoiseRemove.java in MathOCR - Java implementation of Adaptive degraded document image binarization by B. Gatos , I. Pratikakis, S.J. Perantonis
- binarize.c in ZBar - C implementations of two binarization algorithms, based on Sauvola
- typeface-corpus - A repository for typefaces to train Tesseract and OCRopus for natural history collections and digital humanities.
- binarizewolfjolion - Comparison of binarization algorithms. [Blog post](http://zp-j.github.io/2013/10/04/document-binarization/)
- `crop_morphology.py` in oldnyc - Cropping a page to just the text block
- Whiteboard Picture Cleaner - Shell one-liner/script to clean up and beautify photos of whiteboards
- textcleaner - Processes a scanned document of text to clean the text background
- localcontrast - Fast O(1) local contrast optimization
- ARU-Net
- sbb_textline_detector
- watersink/Character-Segmentation
- sharatsawhney/character_segmentation
- githubharald/WordSegmentation
- kcws
- LayoutParser
- eynollah
- chulwoopack/docstrum
- LAREX - LAREX is a semi-automatic open-source tool for layout analysis on early printed books.
- leonlulu/DeepLayout - Deep learning based page layout analysis
- dhSegment
- Pay20Y/Layout_Analysis
- rbaguila/document-layout-analysis
- P2PaLA - Page to PAGE Layout Analysis Tool
- ocroseg - This is a deep learning model for page layout analysis / segmentation.
- DIVA-DIA/DIVA_Layout_Analysis_Evaluator - Layout Analysis Evaluator for the ICDAR 2017 competition on Layout Analysis for Challenging Medieval Manuscripts
- ocrsegment - a deep learning model for page layout analysis / segmentation.
- ARU-Net
- xy-cut-tree
- ocrd_segment
- LayoutML
- LayoutLMv2
- eynollah
- SimpleHTR - Handwritten Text Recognition (HTR) system implemented with TensorFlow.
- handwriting-ocr - OCR software for recognition of handwritten text
- AWSLabs: handwritten text regognition for Apache MXNet
- vloison/Handwritten_Text_Recognition
- TableNet - Unofficial implementation of ICDAR 2019 paper : TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images.
- image-table-ocr
- TreeStructure - Table Extraction Tool
- TableTrainNet - Table recognition inside douments using neural networks.
- table_layout_detection_research
- TableBank
- Camelot
- ocr-table - Extract tables from scanned image PDFs using Optical Character Recognition.
- ExtractTable-py
- image-table-ocr
- lingua - The most accurate natural language detection library for Java and other JVM languages, suitable for long and short text alike
- langdetect
- whatthelang - Lightning Fast Language Prediction rocket
- wiki-lang-detect
- Open OCR - Run Tesseract in Docker containers
- tesseract-web-service - An implementation of RESTful web service for tesseract-OCR using tornado.
- docker-ocropy - A Docker container for running the [ocropy OCR system](htps://github.com/tmbdev/ocropy).
- ABBYY Cloud OCR SDK Code samples - Code samples for using the proprietary commercial ABBYY OCR API.
- nidaba - An expandable and scalable OCR pipeline
- gamera - A meta-framework for building document processing applications, e.g. OCR
- ocr-tools - Project to provide CLI and web service interfaces to common OCR engines
- ocrad-docker - Run the [ocrad](http://www.gnu.org/software/ocrad/) OCR engine in a docker container
- kraken-docker - Run the [kraken](https://github.com/mittagessen/kraken) OCR engine in a docker container
- Konfuzio - Free Online OCR up to 2.000 pages per month and OCR API by [@atraining], see https://youtu.be/NZKUrKyFVA8 (code is not open)
- ocr.space - Free Online OCR and OCR API by [@a9t9](https://github.com/A9T9) based on Tesseract (code is not open)
- OCR4all - Provides OCR services through web applications. Included Projects: [LAREX](https://github.com/chreul/LAREX), [OCRopus](https://github.com/tmbdev/ocropy), [calamari](https://github.com/ChWick/calamari) and [nashi](https://github.com/andbue/nashi).
- ISRI OCR Evaluation Tools - ocr-evaluation-tools/blob/HEAD/user-guide.pdf)
- isri-ocr-evaluation-tools - further development by [@eddieantonio](https://github.com/eddieantonio) (2015, 2016)
- ancientgreekocr-evaluation-tools - further development by [@nickjwhite](https://github.com/nickjwhite) (2013, 2014)
- ocrevalUAtion - Cross-format evaluation, CLI and GUI
- ngram-ocr-eval - Brute and simple OCR evaluation using ngrams
- quack - Quality-Assurance-tool for scans with corresponding ALTO-files
- tesseract-ocr - A Crystal wrapper for tesseract-ocr.
- tesseract_ocr - Elixir library wrapping the tesseract executable.
- gosseract - Go package for OCR (Optical Character Recognition), by using Tesseract C++ library.
- Tess4J - Java Native Access bindings to Tesseract.
- tess-two - Tools for compiling Tesseract on Android and Java API.
- tesseract for .net - A .Net wrapper for tesseract-ocr.
- TTesseractOCR4 - Object Pascal binding for tesseract-ocr 4.x.
- Tesseract OCR for PHP - Tesseract PHP bindings.
- pytesseract - A Python wrapper for Google Tesseract.
- pyocr - A Python wrapper for Tesseract and Cuneiform.
- ocrodjvu - A library and standalone tool for doing OCR on DjVu documents, wrapping Cuneiform, gocr, ocrad, ocropus and tesseract
- tesserocr - A Python wrapper for the tesseract-ocr API
- ocracy - pure javascript lstm rnn implementation based on ocropus
- gocr.js - Javascript port (emscripten) of gocr
- ocrad.js - Javascript port (emscripten) of ocrad
- tesseract.js
- node-tesseract-ocr - A simple wrapper for the Tesseract OCR package.
- node-tesseract-native - C++ module for node providing OCR with tesseract and leptonica.
- rtesseract - Ruby library wrapping the tesseract and imagemagick executables.
- ruby-tesseract - Native Tesseract bindings for Ruby MRI and JRuby
- ocr_space - API wrapper for free ocr service ocr.space. Includes CLI
- tesseract.rs - Rust bindings for tesseract OCR.
- leptess - Productive and safe Rust bindings/wrappers for tesseract and leptonica.
- tesseract - R bindings for tesseract OCR.
- Tesseract OCR iOS - Swift and Objective-C wrapper for Tesseract OCR.
- SwiftOCR - Fast and simple OCR library written in Swift. Optimized for recognizing short, one line long alphanumeric codes.
- glyph-miner - A system for extracting glyphs from early typeset prints
- ocrodeg - a small Python library implementing document image degradation for data augmentation for handwriting recognition and OCR applications.
- archiscribe-corpus - >4,200 lines transcribed from 19th Century German prints via [archiscribe](https://archiscribe.jbaiter.de/) `CC-BY 4.0`
- CIS OCR Test Set - 2 example documents each in German/Latin/Greek with ground truth for [PoCoTo](https://github.com/cisocrgroup/PoCoTo)
- Rescribe - Transcriptions of Caroline Minuscule Manuscripts `PDM 1.0`
- CLTK - Corpora from [Classical Language Toolkit](http://cltk.org/) `PDM 1.0`
- DIVA-HisDB - 150 pages<sup>[PAGE-XML](https://github.com/PRImA-Research-Lab/PAGE-XML)</sup> of three medieval manuscripts `CC-BY-NC 3.0`
- EarlyPrintedBooks - ~8,800 lines from several early printed books `CC-BY-NC-SA 4.0`
- EEBO-TCP - 25,363 EEBO documents transcribed by [TCP](http://www.textcreationpartnership.org/tcp-eebo/) `PDM 1.0`
- ECCO-TCP - 2,188 ECCO documents transcribed by [TCP](http://www.textcreationpartnership.org/tcp-ecco/) `PDM 1.0`
- eMOP-TCP - 2,188 ECCO-TCP documents, cleaned up by [eMOP](http://emop.tamu.edu/) `PDM 1.0`
- Evans-TCP - 4,977 Evans documents transcribed by [TCP](http://www.textcreationpartnership.org/tcp-evans/)
- FDHN - Finnish Digitised Historical Newspapers, [Paper](http://doi.org/10.1045/july2016-paakkonen), (free) [registration](https://digi.kansalliskirjasto.fi/opendata/submit?set_language=en) required, [Terms of Use](https://digi.kansalliskirjasto.fi/terms)
- FROC-MSS - 4 Old French Medieval Manuscripts `CC-BY 4.0`
- GERMANA - 764 Spanish manuscript pages, (free) [registration](https://www.prhlt.upv.es/wp/resource/the-germana-corpus) required `non-commercial use only`
- GT4HistOCR - Ground Truth for German Fraktur and Early Modern Latin `CC-BY 4.0`
- imagessan - Sanskrit images & ground truth (Devanagari script)
- IMPACT-BHL - 2,418 pages<sup>[PAGE-XML](https://github.com/PRImA-Research-Lab/PAGE-XML)</sup> from the Biodiversity Heritage Library, [XML@GitHub](https://github.com/impactcentre/groundtruth-bhl) `CC-BY 3.0`
- IMPACT-BL - 294 pages<sup>[PAGE-XML](https://github.com/PRImA-Research-Lab/PAGE-XML)</sup> from the British Library, (free) [registration](https://www.digitisation.eu/wp-login.php?action=register) required `PDM 1.0`
- IMPACT-BNE - 215 pages<sup>[PAGE-XML](https://github.com/PRImA-Research-Lab/PAGE-XML)</sup> from the National Library of Spain, (free) [registration](https://www.digitisation.eu/wp-login.php?action=register) required, [XML@GitHub](https://github.com/impactcentre/groundtruth-spa) `CC-BY-NC-SA 4.0`
- IMPACT-BNF - 151 pages<sup>[PAGE-XML](https://github.com/PRImA-Research-Lab/PAGE-XML)</sup> from the National Library of France, (free) [registration](https://www.digitisation.eu/wp-login.php?action=register) required `CC-BY-NC-SA 4.0`
- IMPACT-KB - 142 pages<sup>[PAGE-XML](https://github.com/PRImA-Research-Lab/PAGE-XML)</sup> from the National Library of the Netherlands `CC-BY 4.0`
- IMPACT-NKC - 187 pages<sup>[PAGE-XML](https://github.com/PRImA-Research-Lab/PAGE-XML)</sup> from the Czech National Library, (free) [registration](https://www.digitisation.eu/wp-login.php?action=register) required `CC-BY-NC-SA 4.0`
- IMPACT-NLB - 19 pages<sup>[PAGE-XML](https://github.com/PRImA-Research-Lab/PAGE-XML)</sup> from the National Library of Bulgaria, (free) [registration](https://www.digitisation.eu/wp-login.php?action=register) required `CC-BY-NC-ND 4.0`
- IMPACT-NUK - 209 pages<sup>[PAGE-XML](https://github.com/PRImA-Research-Lab/PAGE-XML)</sup> from the National Library of Slovenia, (free) [registration](https://www.digitisation.eu/wp-login.php?action=register) required `CC-BY-NC-SA 4.0`
- IMPACT-PSNC - 478 pages<sup>[PAGE-XML](https://github.com/PRImA-Research-Lab/PAGE-XML)</sup> from four Polish digital libraries, [XML@GitHub](https://github.com/impactcentre/groundtruth-pol) `CC-BY 3.0`
- LascivaRoma/lexical - Transcription of 19th century lexical resources for Latin learning
- MJSynth - 9m synthetic images covering 90k English words
- OCR19thSAC - 19,000 pages Swiss Alpine Club yearbooks transcribed via [Text+Berg digital](http://textberg.ch/site/en/welcome/) `CC-BY 4.0`
- OCR-D - 180 pages<sup>[PAGE-XML](https://github.com/PRImA-Research-Lab/PAGE-XML)</sup> of German historical prints from [OCR-D](http://ocr-d.de/) `CC-BY-SA 4.0`
- OCR_GS_Data - Double-checked Arabic Gold Standard from [OpenITI](https://github.com/OpenITI)
- old-books - 322 old books from [Project Gutenberg](https://www.gutenberg.org/) `GPL 3.0`
- PRImA-ENP - 528 pages<sup>[PAGE-XML](https://github.com/PRImA-Research-Lab/PAGE-XML)</sup> historic newspapers from [Europeana Newspapers](http://www.europeana-newspapers.eu/), (free) [registration](http://www.primaresearch.org/register) required `PDM 1.0`
- RODRIGO - 853 Spanish manuscript pages, (free) [registration](https://www.prhlt.upv.es/wp/resource/the-rodrigo-corpus) required `non-commercial use only`
- Toebler-OCR - (Kraken) Ground Truth transcription of few pages of the Tobler-Lommatzsch: Altfranzösisches Wörterbuch
- VideoTextSCM
- TransDETR
- YORO
- typefont - The first open-source library that detects the font of a text in a image.
- DAVAR-lab-OCR
- CRNN.tf2
- ocr.pytorch
- PytorchOCR
- MMOCR
- doctr
- Master OCR
- xiaofengShi/CHINESE-OCR
- PaddleOCR
- Urdu-Ocr
- ocr.pytorch
- ocular - Ocular is a state-of-the-art historical OCR system.
- OCR++
- pytextrator - python ocr using tesseract/ with EAST opencv detector
- OCR-D
- ocrd_tesserocr
- Deeplearning-OCR
- PICCL
- cnn_lstm_ctc_ocr - Tensorflow-based CNN+LSTM trained with CTC-loss for OCR.
- PassportScanner - Scan the MRZ code of a passport and extract the firstname, lastname, passport number, nationality, date of birth, expiration date and personal numer.
- pannous/tensorflow-ocr - OCR using tensorflow with attention.
- BowieHsu/tensorflow_ocr - OCR detection implement with tensorflow v1.4.
- GRCNN-for-OCR - This is the implementation of the paper "Gated Recurrent Convolution Neural Network for OCR"
- go-ocr - A tool for extracting text from scanned documents (via OCR), with user-defined post-processing.
- insightocr - MXNet OCR implementation. Including text recognition and detection.
- ocr_densenet - The first Xi'an Jiaotong University Artificial Intelligence Practice Contest (2018AI Practice Contest - Picture Text Recognition) first; only use the densenet to identify the Chinese characters
- CNN_LSTM_CTC_Tensorflow - CNN+LSTM+CTC based OCR implemented using tensorflow.
- tmbdev/clstm - A small C++ implementation of LSTM networks, focused on OCR.
- VistaOCR
- tesseract.js
- Tesseract
- kaldi
- ocropus3 - Repository collecting all the submodules for the new PyTorch-based OCR System.
- calamari
- ocropy - Python-based tools for document analysis and OCR
- chinese_ocr
- deep_ocr - make a better chinese character recognition OCR than tesseract.
- ocular - Ocular is a state-of-the-art historical OCR system.
- textDetectionWithScriptID
- transcribus
- FastText - Library for efficient text classification and representation learning
- GOCR
- Ocrad
- franc - Natural language detection
- ocrfeeder
- emedvedev/attention-ocr - A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.
- da03/attention-ocr - Visual Attention based OCR
- dhlab-epfl/dhSegment - Generic framework for historical document processing
- grobid - A machine learning software for extracting information from scholarly documents
- lapdftext - LA-PDFText is a system for extracting accurate text from PDF-based research articles
- kraken
- gosseract - Go package for OCR (Optical Character Recognition), by using Tesseract C++ library.
- EasyOCR - Ready-to-use OCR with 40+ languages supported including Chinese, Japanese, Korean and Thai.
- invoice-scanner-react-native
- Arabic-OCR
- Awesome-Scene-Text-Recognition
- awesome-deep-text-detection-recognition
- kba/awesome-ocr
- perfectspr/awesome-ocr
- chongyangtao/Awesome-Scene-Text-Recognition - Papers and datasets
- ABBYY
- Omnipage
- Clova.ai
- Konfuzio
- thehive.ai
- impira
- AWS Textracet
- Nanonets
- docparser
- ocrolus
- Butler Labs
- nw-page-editor - Simple app for visual editing of Page XML files
- hocr
- alto
- PageXML
- ocr-fileformat - Validate and transform various OCR file formats
- hocr-tools - Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.
- scenetext - This is a synthetically generated dataset, in which word instances are placed in natural scene images, while taking into account the scene layout.
- Total-Text-Dataset
- ocr-open-dataset
- DocCreator - DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation.
- Scene-Text-Image-Transformer - Scene Text Image Transformer
- Belval/TextRecognitionDataGenerator - A synthetic data generator for text recognition
- Sanster/text_renderer
- awesome-SynthText
- Text-Image-Augmentation
- UnrealText
- SynthText_Chinese_version
- ajgalleo/document-image-binarization
- PRLib - Pre-Recognize Library - library with algorithms for improving OCR quality.
- sbb_binarization
- KBNLresearch/ochre - Toolbox for OCR post-correction
- cisocrgroup/PoCoTo - The CIS OCR PostCorrectionTool
- afterscan
- TedEval
- clovaai/deep-text-recognition-benchmark - Text recognition (optical character recognition) with deep learning methods.
- dinglehopper - dinglehopper is an OCR evaluation tool and reads ALTO, PAGE and text files.
- CLEval
- ocrodeg - a small Python library implementing document image degradation for data augmentation for handwriting recognition and OCR applications.
- scantailor - Scan Tailor is an interactive post-processing tool for scanned pages.
- jlsutherland/doc2text - help researchers fix these errors and extract the highest quality text from their pdfs as possible.
- mauvilsa/nw-page-editor - Simple app for visual editing of Page XML files.
- Transkribus - Transkribus is a comprehensive platform for the digitisation, AI-powered recognition, transcription and searching of historical documents.
- open-semantic-search - Open Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
- ocrserver - A simple OCR API server, seriously easy to be deployed by Docker, on Heroku as well
- cosc428-structor - ~1000 book pages + OpenCV + python = page regions identified as paragraphs, lines, images, captions, etc.
- nidaba - An expandable and scalable OCR pipeline
- OCRmyPDF
- IMPACT: Tools for text digitisation - List of tools software projects related, some related to OCR
- OCR-D - List of OCR-related academic articles in the context of the [OCR-D](http://www.ocr-d.de/) project. :de:
- Mendeley Group "OCR - Optical Character Recognition" - Collection of 34 papers on OCR
- eadh.org projects - List of Digital Humanities-related projects in Europe, some related to OCR
- Wikipedia: Comparison of optical character recognition software
- OCR [and Deep Learning
- Ocropus Wiki: Publications
- Tesseract Blends Old and New OCR Technology
- What You Always Wanted To Know About Tesseract
- Extracting text from an image using Ocropus
- Training an Ocropus OCR model
- Ocropus Wiki: Compute errors and confusions
- Ocropus Wiki: Working with Ground Truth
- OCRopus
- 10 Tips for making your OCR project succeed
- Overview of LEADTOOLS Image Cleanup and Pre-processing SDK Technology
- Extracting Text from PDFs; Doing OCR; all within R
- R programming environment
- Tutorial: Command-line OCR on a Mac
- Practical Expercience with OCRopus Model Training
- Homemade Manuscript OCR (1): OCRopy - Baptiste-Camps](https://github.com/Jean-Baptiste-Camps)
- Optimizing Binarization for OCRopus
- Prototype demo for OCR postfix in Danish Newspapers
- How Can I OCR My Dictionary?
- "Needlessly complex" blog - tos (Python based), particularly:
- Page dewarping
- Compressing and enhancing hand-written notes
- Unprojecting text with ellipses
- (Open-Source-)OCR-Workflows - D](https://github.com/OCR-D) project.
- A gentle introduction to OCR
- Worauf kann ich mich verlassen? Arbeiten mit digitalisierten Quellen, Teil 1: OCR
- abbyy-finereader-ocr-senate - Using OCR to parse scanned Senate Financial Disclosure forms.
- cvOCR - An OCR system for recognizing resume or cv text, implemented in Python and C and based on tesseract
- MathOCR - A printed scientific document recognition system, **pre-alpha**
- High performance document layout analysis
- Adaptive degraded document image binarization
- [Internship Report
- OCRopus Addons (Internship Report)
- Local Logistic Classifiers for Large Scale Learning
- High Performance OCR for Printed English and Fraktur using LSTM Networks - Hasan, Mayce Al Azawi. Shafait
- Can we build language-independent OCR using LSTM networks? - Hasan, Breuel
- Offline Printed Urdu Nastaleeq Script Recognition with Bidirectional LSTM Networks - Hasan, Ahmed, Rashid, Shafait, Breuel
- OCR of historical printings of Latin texts: Problems, Prospects, Progress.
- Correcting Noisy OCR: Context beats Confusion
- TypeWright: An Experiment in Participatory Curation
- Benchmarking of LSTM Networks
- Recognition of Historical Greek Polytonic Scripts Using LSTM - Hassan, Papavassiliou, Basilis Gatos, Katsouros, Liwicki
- A Segmentation-Free Approach for Printed Devanagari Script Recognition - Hasan, Breuel
- A Sequence Learning Approach for Multiple Script Identification - Hasan, Afzal, Shfait, Liwicki, Breuel
- Important New Developments in Arabographic Optical Character Recognition (OCR)
- OpenArabic/OCR_GS_Data
- OCR of historical printings with an application to building diachronic corpora: A case study using the RIDGES herbal corpus
- Automatic quality evaluation and (semi-) automatic improvement of mixed models for OCR on historical documents
- Generic Text Recognition using Long Short-Term Memory Networks - Hasan -- Ph.D Thesis
- OCRoRACT: A Sequence Learning OCR System Trained on Isolated Characters - Hasan, Bukhari
- Recursive Recurrent Nets with Attention Modeling for OCR in the Wild
- paper:2016
- text-detection-ctpn
- yizt/keras-ctpn
- tianzhi0549/CTPN - Detecting Text in Natural Image with Connectionist Text Proposal Network
- paper:2016
- TextBoxes (official)
- TextBoxes-TensorFlow - TextBoxes re-implement using tensorflow
- zj463261929/TextBoxes - TextBoxes: A Fast Text Detector with a Single Deep Neural Network
- shinjayne/textboxes - Textboxes implementation with Tensorflow (python)
- shinTB - Textboxes : Image Text Detection Model : python package (tensorflow)
- Telugu OCR Framework using Deep Learning
- TeluguOCR - ocr/issues/49)
- EAST - (tf1/py2) A tensorflow implementation of EAST text detector
- AdvancedEAST - (tf1/py2) AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.
- kurapan/EAST
- songdejia/EAST - This is a pytorch re-implementation of EAST: An Efficient and Accurate Scene Text Detector.
- HaozhengLi/EAST_ICPR - Forked from argman/EAST for the ICPR MTWI 2018 CHALLENGE
- deepthinking-qichao/EAST_ICPR2018
- SakuraRiven/EAST
- EAST-Detector-for-text-detection-using-OpenCV - Text Detection from images using OpenCV
- easy-EAST
- A Two-Stage Method for Text Line Detection in Historical Documents - Net
- tensorflow_PSENet - This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network
- PAN-PSEnet
- PSENet - Shape Robust Text Detection with Progressive Scale Expansion Network.
- paper:2018
- FOTS - An Implementation of the FOTS: Fast Oriented Text Spotting with a Unified Network.
- FOTS_OCR
- paper:2018
- TextBoxes_plusplus (offical) - Shot Oriented Scene Text Detector
- Shun14/TextBoxes_plusplus_Tensorflo - Textboxes_plusplus implementation with Tensorflow (python)
- paper:2019
- CRAFT-pytorch (official) - Pytorch implementation of CRAFT text detector.
- autonise/CRAFT-Remade
- s3nh/pytorch-text-recognition
- backtime92/CRAFT-Reimplementation
- fcakyon/craft-text-detector
- YongWookHa/craft-text-detector
- faustomorales/keras-ocr - A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
- fcakyon/craft-text-detector
- paper:2020
- AdelaiDet
Programming Languages
Keywords
ocr
76
deep-learning
28
text-detection
27
tensorflow
21
text-recognition
17
scene-text-recognition
15
scene-text
15
pytorch
13
tesseract
13
scene-text-detection
13
python
12
detection
10
computer-vision
10
machine-learning
10
crnn
9
optical-character-recognition
9
tesseract-ocr
8
ocr-recognition
7
text
6
keras
6
dataset
5
image-processing
5
craft
5
text-detection-recognition
5
cnn
5
nlp
5
hocr
4
opencv
4
icdar
4
lstm
4
page-xml
4
pdf
4
ocr-d
4
recognition
4
handwritten-text-recognition
4
qurator
4
alto-xml
4
alto
4
ctpn
4
neural-network
4
segmentation
4
end-to-end
3
east
3
language-detection
3
transformer
3
ctc
3
scanner
3
psenet
3
deep-neural-networks
3
ocr-detection
3