Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with ocr
A curated list of projects in awesome lists tagged with ocr .
https://github.com/junhoyeo/betterocr
🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.
ai chatgpt chatgpt-api easyocr llm ocr openai openai-api tesseract tesseract-ocr
Last synced: 02 Aug 2024
https://github.com/ahmetozlu/signature_extractor
A super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents using OpenCV and scikit-image.
handwritten-signatures image-processing image-segmentation ocr ocr-engine optical-character-recognition scanned-documents signature-detection signature-extraction-algorithm signature-extractor signature-recognition signature-verification
Last synced: 05 Aug 2024
https://github.com/TalkUHulk/ai.deploy.box
A toolbox for deep learning model deployment using C++ YoloX | YoloV7 | YoloV8 | Gan | OCR | MobileVit | Scrfd | MobileSAM | StableDiffusion
controlnet cpp face gan lora mnn mobilesam ncnn ocr onnx paddlelite scrfd stablediffusion tnn webassembly yolov7 yolov8 yolox
Last synced: 31 Jul 2024
https://github.com/Haste171/langchain-chatbot
AI Chatbot for analyzing/extracting information from data in conversational format.
ai artificial-intelligence bot chromadb discord discord-bot embeddings extractive-question-answering gpt-3 gpt-4 langchain ocr openai openai-api openai-api-chatbot pdf pdf-chat-bot pdf-ocr pinecone vector-database
Last synced: 31 Jul 2024
https://github.com/Holmeyoung/crnn-pytorch
Pytorch implementation of CRNN (CNN + RNN + CTCLoss) for all language OCR.
Last synced: 01 Aug 2024
https://github.com/watsonyanghx/CNN_LSTM_CTC_Tensorflow
CNN+LSTM+CTC based OCR implemented using tensorflow.
Last synced: 01 Aug 2024
https://github.com/lucasrla/remarks
Extract annotations (highlights and scribbles) from PDF, EPUB, and notebooks marked with reMarkable tablets. Export to Markdown, PDF, PNG, SVG
annotations epub highlighting markdown obsidian ocr ocrmypdf pdf pdf-converter pymupdf remarkable-tablet roamresearch svg-images zotero
Last synced: 30 Sep 2024
https://github.com/Megvii-CSG/MegReader
A research project for text detection and recognition using PyTorch 1.2.
ctc deep-learning ocr pytorch text-detection text-detection-recognition text-recognition
Last synced: 01 Aug 2024
https://github.com/SubhamTyagi/android-ocr
Tesseract based OCR for android
android fdroid foss image-reader math-ocr ocr ocr-android ocr-recognition ocr-text-reader reader tesseract
Last synced: 06 Aug 2024
https://github.com/qurator-spk/eynollah
Document Layout Analysis
document-layout-analysis ocr qurator segmentation textline-detection
Last synced: 01 Aug 2024
https://github.com/danschultzer/receipt-scanner
Receipt scanner extracts information from your PDF or image receipts - built in NodeJS
extract-data extract-information ocr optical-character-recognition receipt-scanner receipts
Last synced: 01 Oct 2024
https://github.com/pd3f/pd3f
🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based
extract-text language-model machine-learning ocr parsr pd3f pdf pdf-to-text pipeline python text-extraction
Last synced: 01 Aug 2024
https://github.com/scambier/obsidian-text-extractor
A (companion) plugin to facilitate the extraction of text from images (OCR) and PDFs.
obsidian obsidian-plugin ocr pdf
Last synced: 01 Aug 2024
https://github.com/blueaxis/Poricom
Optical character recognition in manga images. Manga OCR desktop application
manga-ocr manga-reader ocr pyqt5-gui
Last synced: 30 Jul 2024
https://github.com/SWHL/RapidVideOCR
Extract video hard subtitles and automatically generate corresponding srt files.
ocr subtitle video videosubfinder
Last synced: 09 Aug 2024
https://github.com/matiastucci/ionic-ocr-example
:camera: Simple Ionic app using ocrad.js
Last synced: 29 Sep 2024
https://github.com/brakmic/OpenCV
:camera: Computer-Vision Demos
computer-vision ocr ocr-recognition opencv scanimage scanned-documents scanning vision
Last synced: 01 Aug 2024
https://github.com/ExtractTable/ExtractTable-py
Python library to extract tabular data from images and scanned PDFs
extracttable image-table-recognition ocr pdf-table-extract table-extraction tabular-data
Last synced: 01 Aug 2024
https://github.com/Psarpei/Multi-Type-TD-TSR
Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:
algorithms computer-science computer-vision computer-vision-algorithms computer-vision-opencv deep-learning image-processing machine-learning machine-learning-algorithms natural-language-processing nlp nlp-machine-learning ocr ocr-python ocr-recognition table-detection table-detection-using-deep-learning table-structure-recognition
Last synced: 01 Aug 2024
https://github.com/cseas/ocr-table
Extract tables from scanned image PDFs using Optical Character Recognition.
extract-tables ocr ocr-table optical-character-recognition pdfminer python scanned-image-pdfs shell tesseract
Last synced: 01 Aug 2024
https://github.com/fcakyon/craft-text-detector
Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector
actions anaconda computer-vision craft deep-learning document hacktoberfest linux macos neural-network ocr pypi python pytorch text text-detection vision windows workflow
Last synced: 01 Aug 2024
https://github.com/HusseinYoussef/Arabic-OCR
OCR system for Arabic language that converts images of typed text to machine-encoded text.
arabic character-segmentation computer-vision dataset image-processing machine-learning neural-network ocr opencv-python scikit-learn segmentation
Last synced: 01 Aug 2024
https://github.com/ropensci/tesseract
Bindings to Tesseract OCR engine for R
ocr r r-package rstats tesseract tesseract-ocr
Last synced: 02 Aug 2024
https://github.com/SwiftyTesseract/SwiftyTesseract
A Swift wrapper around Tesseract for use in iOS, macOS, and Linux applications
ios ocr optical-character-recognition swift tesseract
Last synced: 09 Aug 2024
https://github.com/mkucej/i-librarian-free
I, Librarian - open-source version of a PDF managing SaaS.
arxiv crossref document-management doi groupware ieee nasa ocr officedocs pdf pdf-viewer php pubmed remote-work research-and-development saas web-based
Last synced: 01 Aug 2024
https://github.com/epsylon/cintruder
Captcha Intruder (CIntrud3r) is an automatic pentesting tool to bypass captchas.
bruteforcer captcha cintruder ocr pentesting
Last synced: 04 Aug 2024
https://github.com/scott0123/Tesseract-macOS
Objective C wrapper for the open source OCR Engine Tesseract (macOS)
mac macos objective-c ocr screenshot tesseract tesseract-mac tesseract-macos xcode
Last synced: 31 Jul 2024
https://github.com/Dicklesworthstone/llama2_aided_tesseract
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections, complete with options for text validation and hallucination filtering.
ai-assist hallucinations llama2 llm ocr tesseract
Last synced: 01 Aug 2024
https://github.com/rakeshvar/rnn_ctc
Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
captcha ctc ctc-loss deep-learning gru lstm neural-network ocr python recurrent-neural-networks rnn rnn-ctc speech-recognition speech-to-text theano
Last synced: 02 Aug 2024
https://github.com/maxent-ai/ocrpy
OCR, Archive, Index and Search: Implementation agnostic OCR framework.
aws azure computer-vision cv deep-learning google-vision-api image-processing information-retrieval nlp ocr ocr-python python semantic-search tesseract-ocr transformers
Last synced: 02 Aug 2024
https://github.com/zelon88/hrcloud2
A full-featured home hosted Cloud Drive, Personal Assistant, App Launcher, File Converter, Streamer, Share Tool & More!
antivirus applauncher cloud-drive cloud-platform cloud-storage cms editor enterprise file-converter nextcloud ocr owncloud paas personal-assistants security self-hosted server share-tool streamer wordpress
Last synced: 26 Sep 2024
https://github.com/zelon88/HRCloud2
A full-featured home hosted Cloud Drive, Personal Assistant, App Launcher, File Converter, Streamer, Share Tool & More!
antivirus applauncher cloud-drive cloud-platform cloud-storage cms editor enterprise file-converter nextcloud ocr owncloud paas personal-assistants security self-hosted server share-tool streamer wordpress
Last synced: 31 Jul 2024
https://github.com/robertknight/tesseract-wasm
JS/WebAssembly build of the Tesseract OCR engine for use in browsers and Node
Last synced: 01 Oct 2024
https://github.com/the-black-knight-01/Tabulo
Table Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)
deep-learning detection faster-r-cnn luminoth ocr pdf-table-extraction python sonnet ssd table-data-extraction table-detection table-detection-using-deep-learning table-recognition tabulo tensorflow tesseract
Last synced: 03 Aug 2024
https://github.com/auerswal/ssocr
Seven Segment Optical Character Recognition
image-processing ocr optical-character-recognition seven-segment
Last synced: 17 Aug 2024
https://github.com/autonise/CRAFT-Remade
Implementation of CRAFT Text Detection
craft detection ocr pytorch pytorch-implementation text-detection weak-supervision
Last synced: 01 Aug 2024
https://github.com/godruoyi/ocr
The Best Image OCR SDK For BAT
aliyun-ocr baidu-ocr image-recognition laravel-ocr ocr php-ocr tencent-ocr
Last synced: 03 Oct 2024
https://github.com/googlecloudplatform/document-ai-samples
Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud
document-understanding machine-learning ocr pdf python samples
Last synced: 28 Sep 2024
https://github.com/imistyrain/caffe-oneclick
Use caffe to train your own data in just one click
caffe caffe-python caffe-windows ocr
Last synced: 30 Jul 2024
https://github.com/mathewthe2/game2text
Complete toolbox for gamifying language learning
anki language-learning languages ocr yomichan
Last synced: 26 Sep 2024
https://github.com/trekhleb/links-detector
📖 👆🏻 Links Detector makes printed links clickable via your smartphone camera. No need to type a link in, just scan and click on it.
computer-vision javascript machine-learning object-detection ocr tensorflowjs tesnorflow tesseract typescript
Last synced: 03 Oct 2024
https://github.com/matt-m-o/YomiNinja
Open-source OCR and dictionary tool.
dictonary language-learning languages ocr overlay
Last synced: 30 Jul 2024
https://github.com/mindee/react-mindee-js
Front-End Computer Vision SDK for React
computer-vision javascript ocr react reactjs sdk
Last synced: 04 Aug 2024
https://github.com/stb-tester/stb-tester
Automated Testing for Set-Top Boxes and Smart TVs
computer-vision gstreamer hdmi hdmi-cec lirc numpy ocr opencv page-object power-control pylint python remote-control set-top-box template-matching test-automation test-framework testing video video-processing
Last synced: 28 Sep 2024
https://github.com/clean-apps/CleanSCAN
A simple, smart and efficient document scanner for Android
android app document ocr pdf productivity scanner
Last synced: 01 Aug 2024
https://github.com/ReceiptManager/receipt-manager-app
Receipt parser application written in dart.
android android-studio application flutter flutter-apps ocr ocr-recognition receipt receipt-parser receipt-scanner tesseract-ocr
Last synced: 02 Aug 2024
https://github.com/UB-Mannheim/ocr-fileformat
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
alto finereader hocr ocr ocr-d page-xml transformation validation
Last synced: 30 Jul 2024
https://github.com/skylander86/lambda-text-extractor
AWS Lambda functions to extract text from various binary formats.
aws-lambda lambda-functions ocr pdf pdf-ocr-extraction searchable-pdfs tesseract text-extraction
Last synced: 04 Aug 2024
https://github.com/agoldis/react-native-mlkit-ocr
Google on-device MLKit Text Recognition for React Native
mlkit ocr react-native text-recognition
Last synced: 03 Oct 2024
https://github.com/mathewthe2/Game2Text
Complete toolbox for gamifying language learning
anki language-learning languages ocr yomichan
Last synced: 30 Jul 2024
https://github.com/RhetTbull/textinator
Simple MacOS StatusBar / Menu Bar app to automatically detect text in screenshots
macos menubar menubar-app menubarapp ocr osx screenshot utility
Last synced: 01 Aug 2024
https://github.com/HsiehYiChia/Scene-text-recognition
Scene text detection and recognition based on Extremal Region(ER)
adaboost algorithm canny cascade-classifier chaincode classifier computer-vision detection image-processing lbp machine-learning mser non-maximum-suppression ocr opencv scene-text-detection scene-text-recognition spelling-checker svm text-recognition
Last synced: 01 Aug 2024
https://github.com/LexPredict/lexpredict-contraxsuite
LexPredict ContraxSuite
analytics contracts documents legal legaltech machine-learning natural-language-processing ocr
Last synced: 31 Jul 2024
https://github.com/caltechlibrary/handprint
Apply different text recognition services to images of handwritten documents.
amazon-rekognition amazon-textract google-api google-cloud google-vision-api handwritten-text-recognition htr library-automation machine-learning microsoft-azure ocr optical-character-recognition python
Last synced: 01 Aug 2024
https://github.com/xuannianz/differentiablebinarization
DB (Real-time Scene Text Detection with Differentiable Binarization) implementation in Keras and Tensorflow
db differentiable-binarization keras-db ocr text-detection tf-db
Last synced: 01 Oct 2024
https://github.com/AlejandroAkbal/Image-to-Text-OCR
Image to Text is a web tool to extract text from any image using OCR
image nuxt ocr ocr-recognition vite vue web
Last synced: 31 Jul 2024
https://github.com/HaozhengLi/EAST_ICPR
Forked from argman/EAST for the ICPR MTWI 2018 CHALLENGE
detect detecting detection detector east icdar icdar2015 icpr icpr2018 mtwi mtwi2018 ocr tensorflow text text-detect text-detecting text-detection text-detector
Last synced: 01 Aug 2024
https://github.com/flming/crnn.tf2
Convolutional Recurrent Neural Network(CRNN) for End-to-End Text Recognition - TensorFlow 2
crnn ctc keras ocr scene-text-recognition tensorflow-lite tensorflow2 tf2
Last synced: 26 Sep 2024
https://github.com/FLming/CRNN.tf2
Convolutional Recurrent Neural Network(CRNN) for End-to-End Text Recognition - TensorFlow 2
crnn ctc keras ocr scene-text-recognition tensorflow-lite tensorflow2 tf2
Last synced: 01 Aug 2024
https://github.com/alimranahmed/laraocr
Laravel Optical Character Reader(OCR) package using ocr engines(Tesseract)
Last synced: 27 Sep 2024
https://github.com/wzx54321/LockDemo
指纹识别、图形识别、aliOCR识别
demo fingerprint fingerprint-lock gesture-lock gesturelock ocr pattern pattern-lock
Last synced: 31 Jul 2024
https://github.com/yardstick17/image_text_reader
The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes. The image is pre-processed for better comprehension by OCR. This module first makes bounding box for text in images and then normalizes it to 300 dpi, suitable for OCR engine to read.
image-reader image-to-text ocr ocr-text-reader read-image tesseract-ocr
Last synced: 01 Aug 2024
https://github.com/jiangxiluning/MASTER-TF
MASTER
cv deep-learning ocr ocr-recognition scene-text-recognition transformer
Last synced: 01 Aug 2024
https://github.com/zeitlings/alfred-workflows
Collected Alfred Workflows & Proof of Concept
alfred alfred-workflow automation converter devonthink dictionary filesystem macos ocr pdf pdf-compression text-recognition window-switcher
Last synced: 31 Jul 2024
https://github.com/zhiqwang/sightseq
Computer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection
attention crnn ctc densenet faster-rcnn image-captioning mobilenet object-detection ocr pytorch scene-texts text-recognition transformer
Last synced: 07 Aug 2024
https://github.com/thomasjhuang/deep-learning-for-document-dewarping
An application of high resolution GANs to dewarp images of perturbed documents
document gan ocr ocr-recognition pix2pix
Last synced: 30 Jul 2024
https://github.com/dilippuri/PAN-Card-OCR
Retrive meaningful information from PAN Card image using tesseract-ocr :sunglasses:
aadhar ocr optical-character-recognition pan-card
Last synced: 12 Aug 2024
https://github.com/devmaxxing/videocr-PaddleOCR
Extract hardcoded subtitles from videos using machine learning
machine-learning ocr paddleocr paddlepaddle subtitles
Last synced: 04 Aug 2024
https://github.com/simonw/s3-ocr
Tools for running OCR against files stored in S3
Last synced: 01 Oct 2024
https://github.com/AgentMaker/AgentOCR
一个多语言支持、易使用的 OCR 项目。An easy-to-use OCR project with multilingual support.
easy-deploy multilingual ocr onnx
Last synced: 04 Aug 2024
https://github.com/kaelzhang/penteract-ocr
⭐️ The native node.js bindings to the Tesseract OCR project.
n-api nodejs ocr ocr-engine optical-character-recognition penteract tesseract
Last synced: 03 Oct 2024
https://github.com/yizt/keras-ctpn
keras复现场景文本检测网络CPTN: 《Detecting Text in Natural Image with Connectionist Text Proposal Network》;欢迎试用,关注,并反馈问题...
ctpn deep-learning keras ocr text-detection
Last synced: 01 Aug 2024
https://github.com/deepinsight/insightocr
MXNet OCR implementation. Including text recognition and detection.
crnn mxnet ocr text-recognition
Last synced: 01 Aug 2024
https://github.com/dhondta/webgrep
Grep Web pages with extra features like JS deobfuscation and OCR
css-unminifier ctf-tools exif-metadata grep js-deobfuscator ocr python security-tools webpage
Last synced: 04 Aug 2024
https://github.com/Dadangdut33/Screen-Translate
A Screen Translator/OCR Translator made by using Python and Tesseract, the user interface are made using Tkinter. All code written in python.
ocr opencv-python python tesseract-ocr tkinter translate
Last synced: 04 Aug 2024
https://github.com/Kocarus/Manga-Translator-TesseractOCR
Automatically translates manga pages with Tesseract-OCR and Google Translate API for Python
google-translate-api manga ocr opencv-python pytesseract python27 tesseract-ocr translator
Last synced: 09 Aug 2024
https://github.com/kocarus/manga-translator-tesseractocr
Automatically translates manga pages with Tesseract-OCR and Google Translate API for Python
google-translate-api manga ocr opencv-python pytesseract python27 tesseract-ocr translator
Last synced: 01 Oct 2024
https://github.com/opensemanticsearch/open-semantic-search-apps
Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations and named entities) and data import (ETL like text extraction, OCR and crawling filesystems or websites)
django django-application named-entities named-entity-recognition ocr ontologies ontology python research-data-management research-tool search search-interface skos solr solr-client solr-dataimporter thesaurus ui user-interface
Last synced: 27 Sep 2024
https://github.com/fauu/Kamite
Japanese immersion assistant for learners (Windows/Linux)
anime deepl immersion japanese japanese-language japanese-study java language-learning languages linux manga mpv ocr solidjs tesseract textractor typescript visual-novel visual-novels yomichan
Last synced: 30 Jul 2024
https://github.com/008karan/PAN_OCR
Building OCR using YOLO and Tesseract
deep-learning ocr tesseract yolo
Last synced: 02 Aug 2024
https://github.com/rerender2021/language-shadow
A simple ocr translator powered by avernakis react.
avernakis chinese desktop english ocr translation
Last synced: 01 Aug 2024
https://github.com/SkeathyTomas/genshin_artifact_auxiliary
A Genshin Impact artifact rater sticking upon artifacts inside the game window. 刻晴办公桌 | 原神 | 圣遗物评分。集成在游戏窗口之上的原神圣遗物导出、评分工具,无需游戏内外来回切换对比,游戏中快速计算与查阅结果。
genshin-impact ocr paddleocr pyside6 python rapidocr tesseract
Last synced: 02 Aug 2024
https://github.com/louisbrulenaudet/apple-ocr
Easy-to-Use Apple Vision wrapper for text extraction, scalar representation and clustering using K-means.
apple clustering kmeans nlp ocr ocr-recognition pyobjc python scatter-plot sklearn
Last synced: 26 Sep 2024
https://github.com/xuguodong1999/COCR
OCR/OCSR on handwritting ⏣/chemical-structural-formulas with YOLO & CRNN models.
chemical-formula chemistry handwriting-recognition ocr ocsr
Last synced: 02 Aug 2024
https://github.com/beacandler/R2CNN
caffe re-implementation of R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection
caffe deep-learning ocr scene-text-detection
Last synced: 01 Aug 2024
https://github.com/shelfio/aws-lambda-tesseract
6 MB Tesseract (with English training data) to fit inside AWS Lambda
aws-lambda node-module nodejs npm-package ocr optical-character-recognition serverless tesseract
Last synced: 02 Oct 2024
https://github.com/lexmartinez/ocr-electron-vue
:card_index: A Simple OCR Application built on Electron, Vue.js & Tesseract.js
Last synced: 01 Aug 2024
https://github.com/R0Wi-DEV/workflow_ocr
This is a Nextcloud Workflow App which enables you to process files via OCR on serverside.
nextcloud nextcloud-workflow-ocr ocr pdf-files
Last synced: 01 Aug 2024
https://github.com/Arthelon/imgclip
Command line utility that extracts text from an image into the system clipboard.
cli command-line javascript ocr tesseract
Last synced: 01 Aug 2024
https://github.com/blueaxis/Cloe
Manga OCR snipping application for desktop
manga-ocr ocr ocr-python pyqt5 snipping-tool
Last synced: 30 Jul 2024
https://github.com/nmapx/revolut-stocks-list
Extract Revolut stocks list from the list screenshot(s).
extract image list ocr revolut screenshot stocks tesseract
Last synced: 09 Aug 2024
https://github.com/iron-software/iron-ocr-image-to-text-in-csharp
Image to Text Tutorial in C# - See https://ironsoftware.com/csharp/ocr/tutorials/how-to-read-text-from-an-image-in-csharp-net/
csharp csharp-code imagetotext ocr pdftotext
Last synced: 28 Sep 2024
https://github.com/farhanchoudhary/PAN_Card_OCR_Project
To extract details from Indian National Identification Cards such as PAN (completed) & Aadhar, Passport, Driving License (WIP) in a structured format
image-processing image-to-text ocr optical-character-recognition pan pan-card pytesseract tesseract
Last synced: 12 Aug 2024