Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with ocr

A curated list of projects in awesome lists tagged with ocr .

https://github.com/junhoyeo/betterocr

🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.

ai chatgpt chatgpt-api easyocr llm ocr openai openai-api tesseract tesseract-ocr

Last synced: 02 Aug 2024

https://github.com/ahmetozlu/signature_extractor

A super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents using OpenCV and scikit-image.

handwritten-signatures image-processing image-segmentation ocr ocr-engine optical-character-recognition scanned-documents signature-detection signature-extraction-algorithm signature-extractor signature-recognition signature-verification

Last synced: 05 Aug 2024

https://github.com/TalkUHulk/ai.deploy.box

A toolbox for deep learning model deployment using C++ YoloX | YoloV7 | YoloV8 | Gan | OCR | MobileVit | Scrfd | MobileSAM | StableDiffusion

controlnet cpp face gan lora mnn mobilesam ncnn ocr onnx paddlelite scrfd stablediffusion tnn webassembly yolov7 yolov8 yolox

Last synced: 31 Jul 2024

https://github.com/Holmeyoung/crnn-pytorch

Pytorch implementation of CRNN (CNN + RNN + CTCLoss) for all language OCR.

cnn ctc-loss ocr rnn

Last synced: 01 Aug 2024

https://github.com/watsonyanghx/CNN_LSTM_CTC_Tensorflow

CNN+LSTM+CTC based OCR implemented using tensorflow.

cnn ctc lstm ocr tensorflow

Last synced: 01 Aug 2024

https://github.com/lucasrla/remarks

Extract annotations (highlights and scribbles) from PDF, EPUB, and notebooks marked with reMarkable tablets. Export to Markdown, PDF, PNG, SVG

annotations epub highlighting markdown obsidian ocr ocrmypdf pdf pdf-converter pymupdf remarkable-tablet roamresearch svg-images zotero

Last synced: 30 Sep 2024

https://github.com/Megvii-CSG/MegReader

A research project for text detection and recognition using PyTorch 1.2.

ctc deep-learning ocr pytorch text-detection text-detection-recognition text-recognition

Last synced: 01 Aug 2024

https://github.com/danschultzer/receipt-scanner

Receipt scanner extracts information from your PDF or image receipts - built in NodeJS

extract-data extract-information ocr optical-character-recognition receipt-scanner receipts

Last synced: 01 Oct 2024

https://github.com/enoch3712/ExtractThinker

ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.

ai llm nlp ocr openai python

Last synced: 01 Aug 2024

https://github.com/pd3f/pd3f

🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based

extract-text language-model machine-learning ocr parsr pd3f pdf pdf-to-text pipeline python text-extraction

Last synced: 01 Aug 2024

https://github.com/scambier/obsidian-text-extractor

A (companion) plugin to facilitate the extraction of text from images (OCR) and PDFs.

obsidian obsidian-plugin ocr pdf

Last synced: 01 Aug 2024

https://github.com/blueaxis/Poricom

Optical character recognition in manga images. Manga OCR desktop application

manga-ocr manga-reader ocr pyqt5-gui

Last synced: 30 Jul 2024

https://github.com/SWHL/RapidVideOCR

Extract video hard subtitles and automatically generate corresponding srt files.

ocr subtitle video videosubfinder

Last synced: 09 Aug 2024

https://github.com/matiastucci/ionic-ocr-example

:camera: Simple Ionic app using ocrad.js

camera ionic ocr ocrad

Last synced: 29 Sep 2024

https://github.com/ExtractTable/ExtractTable-py

Python library to extract tabular data from images and scanned PDFs

extracttable image-table-recognition ocr pdf-table-extract table-extraction tabular-data

Last synced: 01 Aug 2024

https://github.com/cseas/ocr-table

Extract tables from scanned image PDFs using Optical Character Recognition.

extract-tables ocr ocr-table optical-character-recognition pdfminer python scanned-image-pdfs shell tesseract

Last synced: 01 Aug 2024

https://github.com/LeoFCardoso/pdf2pdfocr

A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!

docker ocr pdf pdftk python tesseract

Last synced: 30 Jul 2024

https://github.com/fcakyon/craft-text-detector

Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector

actions anaconda computer-vision craft deep-learning document hacktoberfest linux macos neural-network ocr pypi python pytorch text text-detection vision windows workflow

Last synced: 01 Aug 2024

https://github.com/HusseinYoussef/Arabic-OCR

OCR system for Arabic language that converts images of typed text to machine-encoded text.

arabic character-segmentation computer-vision dataset image-processing machine-learning neural-network ocr opencv-python scikit-learn segmentation

Last synced: 01 Aug 2024

https://github.com/ropensci/tesseract

Bindings to Tesseract OCR engine for R

ocr r r-package rstats tesseract tesseract-ocr

Last synced: 02 Aug 2024

https://github.com/SwiftyTesseract/SwiftyTesseract

A Swift wrapper around Tesseract for use in iOS, macOS, and Linux applications

ios ocr optical-character-recognition swift tesseract

Last synced: 09 Aug 2024

https://github.com/epsylon/cintruder

Captcha Intruder (CIntrud3r) is an automatic pentesting tool to bypass captchas.

bruteforcer captcha cintruder ocr pentesting

Last synced: 04 Aug 2024

https://github.com/scott0123/Tesseract-macOS

Objective C wrapper for the open source OCR Engine Tesseract (macOS)

mac macos objective-c ocr screenshot tesseract tesseract-mac tesseract-macos xcode

Last synced: 31 Jul 2024

https://github.com/Dicklesworthstone/llama2_aided_tesseract

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections, complete with options for text validation and hallucination filtering.

ai-assist hallucinations llama2 llm ocr tesseract

Last synced: 01 Aug 2024

https://github.com/Chandler-Lu/alfred-ocr

OCR & Translate using multiple interfaces for multi platform.

alfred cnocr ocr python quicker zxing

Last synced: 01 Aug 2024

https://github.com/rakeshvar/rnn_ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

captcha ctc ctc-loss deep-learning gru lstm neural-network ocr python recurrent-neural-networks rnn rnn-ctc speech-recognition speech-to-text theano

Last synced: 02 Aug 2024

https://github.com/zelon88/hrcloud2

A full-featured home hosted Cloud Drive, Personal Assistant, App Launcher, File Converter, Streamer, Share Tool & More!

antivirus applauncher cloud-drive cloud-platform cloud-storage cms editor enterprise file-converter nextcloud ocr owncloud paas personal-assistants security self-hosted server share-tool streamer wordpress

Last synced: 26 Sep 2024

https://github.com/zelon88/HRCloud2

A full-featured home hosted Cloud Drive, Personal Assistant, App Launcher, File Converter, Streamer, Share Tool & More!

antivirus applauncher cloud-drive cloud-platform cloud-storage cms editor enterprise file-converter nextcloud ocr owncloud paas personal-assistants security self-hosted server share-tool streamer wordpress

Last synced: 31 Jul 2024

https://github.com/robertknight/tesseract-wasm

JS/WebAssembly build of the Tesseract OCR engine for use in browsers and Node

js ocr webassembly

Last synced: 01 Oct 2024

https://github.com/the-black-knight-01/Tabulo

Table Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)

deep-learning detection faster-r-cnn luminoth ocr pdf-table-extraction python sonnet ssd table-data-extraction table-detection table-detection-using-deep-learning table-recognition tabulo tensorflow tesseract

Last synced: 03 Aug 2024

https://github.com/auerswal/ssocr

Seven Segment Optical Character Recognition

image-processing ocr optical-character-recognition seven-segment

Last synced: 17 Aug 2024

https://github.com/googlecloudplatform/document-ai-samples

Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud

document-understanding machine-learning ocr pdf python samples

Last synced: 28 Sep 2024

https://github.com/imistyrain/caffe-oneclick

Use caffe to train your own data in just one click

caffe caffe-python caffe-windows ocr

Last synced: 30 Jul 2024

https://github.com/mathewthe2/game2text

Complete toolbox for gamifying language learning

anki language-learning languages ocr yomichan

Last synced: 26 Sep 2024

https://github.com/trekhleb/links-detector

📖 👆🏻 Links Detector makes printed links clickable via your smartphone camera. No need to type a link in, just scan and click on it.

computer-vision javascript machine-learning object-detection ocr tensorflowjs tesnorflow tesseract typescript

Last synced: 03 Oct 2024

https://github.com/matt-m-o/YomiNinja

Open-source OCR and dictionary tool.

dictonary language-learning languages ocr overlay

Last synced: 30 Jul 2024

https://github.com/mindee/react-mindee-js

Front-End Computer Vision SDK for React

computer-vision javascript ocr react reactjs sdk

Last synced: 04 Aug 2024

https://github.com/clean-apps/CleanSCAN

A simple, smart and efficient document scanner for Android

android app document ocr pdf productivity scanner

Last synced: 01 Aug 2024

https://github.com/UB-Mannheim/ocr-fileformat

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)

alto finereader hocr ocr ocr-d page-xml transformation validation

Last synced: 30 Jul 2024

https://github.com/skylander86/lambda-text-extractor

AWS Lambda functions to extract text from various binary formats.

aws-lambda lambda-functions ocr pdf pdf-ocr-extraction searchable-pdfs tesseract text-extraction

Last synced: 04 Aug 2024

https://github.com/agoldis/react-native-mlkit-ocr

Google on-device MLKit Text Recognition for React Native

mlkit ocr react-native text-recognition

Last synced: 03 Oct 2024

https://github.com/mathewthe2/Game2Text

Complete toolbox for gamifying language learning

anki language-learning languages ocr yomichan

Last synced: 30 Jul 2024

https://github.com/RhetTbull/textinator

Simple MacOS StatusBar / Menu Bar app to automatically detect text in screenshots

macos menubar menubar-app menubarapp ocr osx screenshot utility

Last synced: 01 Aug 2024

https://github.com/xuannianz/differentiablebinarization

DB (Real-time Scene Text Detection with Differentiable Binarization) implementation in Keras and Tensorflow

db differentiable-binarization keras-db ocr text-detection tf-db

Last synced: 01 Oct 2024

https://github.com/AlejandroAkbal/Image-to-Text-OCR

Image to Text is a web tool to extract text from any image using OCR

image nuxt ocr ocr-recognition vite vue web

Last synced: 31 Jul 2024

https://github.com/flming/crnn.tf2

Convolutional Recurrent Neural Network(CRNN) for End-to-End Text Recognition - TensorFlow 2

crnn ctc keras ocr scene-text-recognition tensorflow-lite tensorflow2 tf2

Last synced: 26 Sep 2024

https://github.com/FLming/CRNN.tf2

Convolutional Recurrent Neural Network(CRNN) for End-to-End Text Recognition - TensorFlow 2

crnn ctc keras ocr scene-text-recognition tensorflow-lite tensorflow2 tf2

Last synced: 01 Aug 2024

https://github.com/alimranahmed/laraocr

Laravel Optical Character Reader(OCR) package using ocr engines(Tesseract)

laravel55 ocr tesseract

Last synced: 27 Sep 2024

https://github.com/wzx54321/LockDemo

指纹识别、图形识别、aliOCR识别

demo fingerprint fingerprint-lock gesture-lock gesturelock ocr pattern pattern-lock

Last synced: 31 Jul 2024

https://github.com/yardstick17/image_text_reader

The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes. The image is pre-processed for better comprehension by OCR. This module first makes bounding box for text in images and then normalizes it to 300 dpi, suitable for OCR engine to read.

image-reader image-to-text ocr ocr-text-reader read-image tesseract-ocr

Last synced: 01 Aug 2024

https://github.com/HenryLulu/video-to-text-ocr-demo

视频硬字幕提取

baiducloud ocr opencv

Last synced: 06 Aug 2024

https://github.com/zhiqwang/sightseq

Computer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection

attention crnn ctc densenet faster-rcnn image-captioning mobilenet object-detection ocr pytorch scene-texts text-recognition transformer

Last synced: 07 Aug 2024

https://github.com/rocketraman/sane-scan-pdf

Sane command-line scan-to-pdf script on Linux with OCR and deskew support

deskew linux ocr sane scanner scanning unpaper

Last synced: 01 Oct 2024

https://github.com/thomasjhuang/deep-learning-for-document-dewarping

An application of high resolution GANs to dewarp images of perturbed documents

document gan ocr ocr-recognition pix2pix

Last synced: 30 Jul 2024

https://github.com/dilippuri/PAN-Card-OCR

Retrive meaningful information from PAN Card image using tesseract-ocr :sunglasses:

aadhar ocr optical-character-recognition pan-card

Last synced: 12 Aug 2024

https://github.com/devmaxxing/videocr-PaddleOCR

Extract hardcoded subtitles from videos using machine learning

machine-learning ocr paddleocr paddlepaddle subtitles

Last synced: 04 Aug 2024

https://github.com/simonw/s3-ocr

Tools for running OCR against files stored in S3

ocr s3 textract

Last synced: 01 Oct 2024

https://github.com/AgentMaker/AgentOCR

一个多语言支持、易使用的 OCR 项目。An easy-to-use OCR project with multilingual support.

easy-deploy multilingual ocr onnx

Last synced: 04 Aug 2024

https://github.com/kaelzhang/penteract-ocr

⭐️ The native node.js bindings to the Tesseract OCR project.

n-api nodejs ocr ocr-engine optical-character-recognition penteract tesseract

Last synced: 03 Oct 2024

https://github.com/yizt/keras-ctpn

keras复现场景文本检测网络CPTN: 《Detecting Text in Natural Image with Connectionist Text Proposal Network》;欢迎试用,关注,并反馈问题...

ctpn deep-learning keras ocr text-detection

Last synced: 01 Aug 2024

https://github.com/deepinsight/insightocr

MXNet OCR implementation. Including text recognition and detection.

crnn mxnet ocr text-recognition

Last synced: 01 Aug 2024

https://github.com/dhondta/webgrep

Grep Web pages with extra features like JS deobfuscation and OCR

css-unminifier ctf-tools exif-metadata grep js-deobfuscator ocr python security-tools webpage

Last synced: 04 Aug 2024

https://github.com/Dadangdut33/Screen-Translate

A Screen Translator/OCR Translator made by using Python and Tesseract, the user interface are made using Tkinter. All code written in python.

ocr opencv-python python tesseract-ocr tkinter translate

Last synced: 04 Aug 2024

https://github.com/Kocarus/Manga-Translator-TesseractOCR

Automatically translates manga pages with Tesseract-OCR and Google Translate API for Python

google-translate-api manga ocr opencv-python pytesseract python27 tesseract-ocr translator

Last synced: 09 Aug 2024

https://github.com/kocarus/manga-translator-tesseractocr

Automatically translates manga pages with Tesseract-OCR and Google Translate API for Python

google-translate-api manga ocr opencv-python pytesseract python27 tesseract-ocr translator

Last synced: 01 Oct 2024

https://github.com/opensemanticsearch/open-semantic-search-apps

Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations and named entities) and data import (ETL like text extraction, OCR and crawling filesystems or websites)

django django-application named-entities named-entity-recognition ocr ontologies ontology python research-data-management research-tool search search-interface skos solr solr-client solr-dataimporter thesaurus ui user-interface

Last synced: 27 Sep 2024

https://github.com/008karan/PAN_OCR

Building OCR using YOLO and Tesseract

deep-learning ocr tesseract yolo

Last synced: 02 Aug 2024

https://github.com/rerender2021/language-shadow

A simple ocr translator powered by avernakis react.

avernakis chinese desktop english ocr translation

Last synced: 01 Aug 2024

https://github.com/SkeathyTomas/genshin_artifact_auxiliary

A Genshin Impact artifact rater sticking upon artifacts inside the game window. 刻晴办公桌 | 原神 | 圣遗物评分。集成在游戏窗口之上的原神圣遗物导出、评分工具,无需游戏内外来回切换对比,游戏中快速计算与查阅结果。

genshin-impact ocr paddleocr pyside6 python rapidocr tesseract

Last synced: 02 Aug 2024

https://github.com/louisbrulenaudet/apple-ocr

Easy-to-Use Apple Vision wrapper for text extraction, scalar representation and clustering using K-means.

apple clustering kmeans nlp ocr ocr-recognition pyobjc python scatter-plot sklearn

Last synced: 26 Sep 2024

https://github.com/xuguodong1999/COCR

OCR/OCSR on handwritting ⏣/chemical-structural-formulas with YOLO & CRNN models.

chemical-formula chemistry handwriting-recognition ocr ocsr

Last synced: 02 Aug 2024

https://github.com/beacandler/R2CNN

caffe re-implementation of R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

caffe deep-learning ocr scene-text-detection

Last synced: 01 Aug 2024

https://github.com/shelfio/aws-lambda-tesseract

6 MB Tesseract (with English training data) to fit inside AWS Lambda

aws-lambda node-module nodejs npm-package ocr optical-character-recognition serverless tesseract

Last synced: 02 Oct 2024

https://github.com/Monogramm/erpnext_ocr

:snake: :alembic: Optical Character Recognition using tesseract within Frappe.

erpnext frappe ocr python tesseract

Last synced: 01 Aug 2024

https://github.com/lexmartinez/ocr-electron-vue

:card_index: A Simple OCR Application built on Electron, Vue.js & Tesseract.js

electron ocr tesseract vuejs

Last synced: 01 Aug 2024

https://github.com/xiaomaxiao/PSENET

keras implementation of PSENET

keras ocr

Last synced: 02 Aug 2024

https://github.com/R0Wi-DEV/workflow_ocr

This is a Nextcloud Workflow App which enables you to process files via OCR on serverside.

nextcloud nextcloud-workflow-ocr ocr pdf-files

Last synced: 01 Aug 2024

https://github.com/Arthelon/imgclip

Command line utility that extracts text from an image into the system clipboard.

cli command-line javascript ocr tesseract

Last synced: 01 Aug 2024

https://github.com/blueaxis/Cloe

Manga OCR snipping application for desktop

manga-ocr ocr ocr-python pyqt5 snipping-tool

Last synced: 30 Jul 2024

https://github.com/nmapx/revolut-stocks-list

Extract Revolut stocks list from the list screenshot(s).

extract image list ocr revolut screenshot stocks tesseract

Last synced: 09 Aug 2024

https://github.com/iron-software/iron-ocr-image-to-text-in-csharp

Image to Text Tutorial in C# - See https://ironsoftware.com/csharp/ocr/tutorials/how-to-read-text-from-an-image-in-csharp-net/

csharp csharp-code imagetotext ocr pdftotext

Last synced: 28 Sep 2024

https://github.com/farhanchoudhary/PAN_Card_OCR_Project

To extract details from Indian National Identification Cards such as PAN (completed) & Aadhar, Passport, Driving License (WIP) in a structured format

image-processing image-to-text ocr optical-character-recognition pan pan-card pytesseract tesseract

Last synced: 12 Aug 2024