Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with tesseract
A curated list of projects in awesome lists tagged with tesseract .
https://github.com/tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
hacktoberfest lstm machine-learning ocr ocr-engine tesseract tesseract-ocr
Last synced: 16 Dec 2024
https://github.com/naptha/tesseract.js
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
deep-learning javascript ocr tesseract webassembly
Last synced: 16 Dec 2024
https://github.com/ocrmypdf/ocrmypdf
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
image-processing ocr pdf python tesseract
Last synced: 16 Dec 2024
https://github.com/jbarlow83/OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
image-processing ocr pdf python tesseract
Last synced: 16 Nov 2024
https://github.com/ocrmypdf/OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
image-processing ocr pdf python tesseract
Last synced: 26 Oct 2024
https://github.com/tesseract-ocr/tessdata
Trained models with fast variant of the "best" LSTM models + legacy models
Last synced: 01 Dec 2024
https://github.com/pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
data-science epub extract-data font mupdf ocr pdf pdf-documents pymupdf python table-extraction tesseract text-processing text-shaping xps
Last synced: 06 Nov 2024
https://github.com/pymupdf/pymupdf
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
data-science epub extract-data font mupdf ocr pdf pdf-documents pymupdf python table-extraction tesseract text-processing text-shaping xps
Last synced: 16 Dec 2024
https://github.com/tebelorg/rpa-python
Python package for doing RPA
cross-platform opencv python rpa sikuli tagui tesseract
Last synced: 16 Dec 2024
https://github.com/tebelorg/RPA-Python
Python package for doing RPA
cross-platform opencv python rpa sikuli tagui tesseract
Last synced: 29 Oct 2024
https://github.com/thiagoalessio/tesseract-ocr-for-php
A wrapper to work with Tesseract OCR inside PHP.
image-to-text ocr php tesseract text-recognition
Last synced: 17 Dec 2024
https://github.com/otiai10/gosseract
Go package for OCR (Optical Character Recognition), by using Tesseract C++ library
go ocr ocr-server tesseract tesseract-ocr
Last synced: 17 Dec 2024
https://github.com/dicklesworthstone/llm_aided_ocr
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
ai-assist llama2 llm ocr ocr-correction tesseract
Last synced: 19 Dec 2024
https://github.com/rmtheis/android-ocr
Experimental optical character recognition app
android ocr optical-character-recognition tesseract
Last synced: 25 Sep 2024
https://github.com/sirfz/tesserocr
A Python wrapper for the tesseract-ocr API
cython ocr optical-character-recognition python-library tesseract
Last synced: 17 Dec 2024
https://github.com/ianzhao05/textshot
Python tool for grabbing text via screenshot
ocr ocr-recognition python python-3 python-script python3 screenshot script tesseract tesseract-ocr
Last synced: 20 Dec 2024
https://github.com/Pulover/PuloversMacroCreator
Automation Utility - Recorder & Script Generator
ahk autohotkey automation robot rpa tesseract
Last synced: 28 Oct 2024
https://github.com/Dicklesworthstone/llm_aided_ocr
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
ai-assist llama2 llm ocr ocr-correction tesseract
Last synced: 31 Aug 2024
https://github.com/gauravsingh9356/j.a.r.v.i.s
Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list generator, Opens any website with just a voice command, Plays Music, Wikipedia searching, Dictionary with Intelligent Sensing i.e. auto spell checking, Weather Reporting i.e. temp, wind speed, humidity, YouTube searching, Google Map searching, Youtube Downloading, etc.
chatgpt dictionary-application difflib hacktoberfest hactober-accepted hactoberfest2021 jarvis jarvis-ai newsapi opencv optical-character-recognition optical-text-recognition python python3 pyttsx3 speech-recognition tesseract tesseract-ocr weather-api webbrowser
Last synced: 15 Dec 2024
https://github.com/GauravSingh9356/J.A.R.V.I.S
Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list generator, Opens any website with just a voice command, Plays Music, Wikipedia searching, Dictionary with Intelligent Sensing i.e. auto spell checking, Weather Reporting i.e. temp, wind speed, humidity, YouTube searching, Google Map searching, Youtube Downloading, etc.
chatgpt dictionary-application difflib hacktoberfest hactober-accepted hactoberfest2021 jarvis jarvis-ai newsapi opencv optical-character-recognition optical-text-recognition python python3 pyttsx3 speech-recognition tesseract tesseract-ocr weather-api webbrowser
Last synced: 18 Nov 2024
https://github.com/dannnylo/rtesseract
Ruby library for working with the Tesseract OCR.
hacktoberfest rtesseract ruby tesseract tesseract-ocr
Last synced: 20 Dec 2024
https://github.com/akylas/oss-documentscanner
Android document document scanning app
android document document-scan document-scan-to-text document-scanner document-scanner-app document-scanning image-processing opencv pdf scanner tesseract zxingcpp
Last synced: 20 Dec 2024
https://github.com/Akylas/OSS-DocumentScanner
Android document document scanning app
android document document-scan document-scan-to-text document-scanner document-scanner-app document-scanning image-processing opencv pdf scanner tesseract zxingcpp
Last synced: 09 Nov 2024
https://github.com/ccextractor/ccextractor
CCExtractor - Official version maintained by the core team
c cea-608 cea-708 dvb hacktoberfest hacktoberfest2021 image image-processing ocr rust subtitles teletext tesseract tesseract-ocr video
Last synced: 21 Dec 2024
https://github.com/CCExtractor/ccextractor
CCExtractor - Official version maintained by the core team
c cea-608 cea-708 dvb hacktoberfest hacktoberfest2021 image image-processing ocr rust subtitles teletext tesseract tesseract-ocr video
Last synced: 05 Nov 2024
https://github.com/jonathanpalma/react-native-tesseract-ocr
Tesseract OCR wrapper for React Native
android ocr optical-character-recognition react-native tesseract tesseract-ocr text-detection text-detector text-recognition
Last synced: 21 Dec 2024
https://github.com/junhoyeo/betterocr
🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.
ai chatgpt chatgpt-api easyocr llm ocr openai openai-api tesseract tesseract-ocr
Last synced: 09 Nov 2024
https://github.com/tesseract-ocr/tessdata_fast
Fast integer versions of trained LSTM models
Last synced: 17 Nov 2024
https://github.com/SubhamTyagi/android-ocr
Tesseract based OCR for android
android fdroid foss image-reader math-ocr ocr ocr-android ocr-recognition ocr-text-reader reader tesseract
Last synced: 25 Nov 2024
https://github.com/prabhakar267/image2text
:clipboard: Python wrapper to grab text from images and save as text files using Tesseract Engine
image2text ocr optical-character-recognition python-wrapper tesseract tesseract-engine tesseract-installation tesseract-ocr
Last synced: 21 Dec 2024
https://github.com/zapolnoch/node-tesseract-ocr
A Node.js wrapper for the Tesseract OCR API
image-to-text ocr tesseract text-recognition
Last synced: 16 Dec 2024
https://github.com/cseas/ocr-table
Extract tables from scanned image PDFs using Optical Character Recognition.
extract-tables ocr ocr-table optical-character-recognition pdfminer python scanned-image-pdfs shell tesseract
Last synced: 18 Dec 2024
https://github.com/ropensci/tesseract
Bindings to Tesseract OCR engine for R
ocr r r-package rstats tesseract tesseract-ocr
Last synced: 10 Nov 2024
https://github.com/SwiftyTesseract/SwiftyTesseract
A Swift wrapper around Tesseract for use in iOS, macOS, and Linux applications
ios ocr optical-character-recognition swift tesseract
Last synced: 29 Nov 2024
https://github.com/scott0123/Tesseract-macOS
Objective C wrapper for the open source OCR Engine Tesseract (macOS)
mac macos objective-c ocr screenshot tesseract tesseract-mac tesseract-macos xcode
Last synced: 30 Oct 2024
https://github.com/scherroman/mugen
A command-line music video generator based on rhythm
amv audio command-line librosa montage moviepy mugen music-video python remix rhythm tesseract video
Last synced: 14 Oct 2024
https://github.com/the-black-knight-01/tabulo
Table Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)
deep-learning detection faster-r-cnn luminoth ocr pdf-table-extraction python sonnet ssd table-data-extraction table-detection table-detection-using-deep-learning table-recognition tabulo tensorflow tesseract
Last synced: 19 Dec 2024
https://github.com/the-black-knight-01/Tabulo
Table Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)
deep-learning detection faster-r-cnn luminoth ocr pdf-table-extraction python sonnet ssd table-data-extraction table-detection table-detection-using-deep-learning table-recognition tabulo tensorflow tesseract
Last synced: 17 Nov 2024
https://github.com/trekhleb/links-detector
📖 👆🏻 Links Detector makes printed links clickable via your smartphone camera. No need to type a link in, just scan and click on it.
computer-vision javascript machine-learning object-detection ocr tensorflowjs tesnorflow tesseract typescript
Last synced: 19 Dec 2024
https://github.com/skylander86/lambda-text-extractor
AWS Lambda functions to extract text from various binary formats.
aws-lambda lambda-functions ocr pdf pdf-ocr-extraction searchable-pdfs tesseract text-extraction
Last synced: 22 Nov 2024
https://github.com/alimranahmed/laraocr
Laravel Optical Character Reader(OCR) package using ocr engines(Tesseract)
Last synced: 21 Dec 2024
https://github.com/dilippuri/aadhaar-card-ocr
Extract text information from Aadhaar Card using tesseract-ocr :sunglasses:
aadhaar ocr tesseract tesseract-ocr
Last synced: 27 Oct 2024
https://github.com/kaelzhang/penteract-ocr
⭐️ The native node.js bindings to the Tesseract OCR project.
n-api nodejs ocr ocr-engine optical-character-recognition penteract tesseract
Last synced: 01 Nov 2024
https://github.com/ndavd/ncube
Generalized Hypercube Visualizer
bevy hypercube mathematics rust simulation tesseract
Last synced: 16 Dec 2024
https://github.com/bweigel/aws-lambda-tesseract-layer
A layer for AWS Lambda containing the tesseract C libraries and tesseract executable.
amazon-linux aws-lambda lambda lambda-layer serverless serverless-framework tesseract
Last synced: 02 Nov 2024
https://github.com/fauu/Kamite
Japanese immersion assistant for learners (Windows/Linux)
anime deepl immersion japanese japanese-language japanese-study java language-learning languages linux manga mpv ocr solidjs tesseract textractor typescript visual-novel visual-novels yomichan
Last synced: 25 Oct 2024
https://github.com/fauu/kamite
Japanese immersion assistant for learners (Windows/Linux)
anime deepl immersion japanese japanese-language japanese-study java language-learning languages linux manga mpv ocr solidjs tesseract textractor typescript visual-novel visual-novels yomichan
Last synced: 11 Nov 2024
https://github.com/victorqribeiro/ocr
Simple app to extract text from pictures using Tesseract
image-recognition ocr tesseract text-extraction text-recognition
Last synced: 17 Nov 2024
https://github.com/008karan/PAN_OCR
Building OCR using YOLO and Tesseract
deep-learning ocr tesseract yolo
Last synced: 09 Nov 2024
https://github.com/SkeathyTomas/genshin_artifact_auxiliary
A Genshin Impact artifact rater sticking upon artifacts inside the game window. 刻晴办公桌 | 原神 | 圣遗物评分。集成在游戏窗口之上的原神圣遗物导出、评分工具,无需游戏内外来回切换对比,游戏中快速计算与查阅结果。
genshin-impact ocr paddleocr pyside6 python rapidocr tesseract
Last synced: 10 Nov 2024
https://github.com/onmyway133/mathsolver
⌨️Camera calculator with Vision
reverse-polish-notation shunting-yard-algorithm swift tesseract vision
Last synced: 02 Nov 2024
https://github.com/testica/text-scanner
OCR Android app using tesseract
android google-play java ocr-android tesseract
Last synced: 24 Nov 2024
https://github.com/hertzg/tesseract-server
A small lightweight HTTP server that converts photos, images and scanned documents to text using optical character recognition by utilizing the power of Google Tesseract.
api container containers docker docker-compose docker-image hacktoberfest http-server image-processing ocr rest-api tesseract tesseract-server typescript
Last synced: 16 Dec 2024
https://github.com/shelfio/aws-lambda-tesseract
6 MB Tesseract (with English training data) to fit inside AWS Lambda
aws-lambda node-module nodejs npm-package ocr optical-character-recognition serverless tesseract
Last synced: 20 Dec 2024
https://github.com/lexmartinez/ocr-electron-vue
:card_index: A Simple OCR Application built on Electron, Vue.js & Tesseract.js
Last synced: 04 Nov 2024
https://github.com/Arthelon/imgclip
Command line utility that extracts text from an image into the system clipboard.
cli command-line javascript ocr tesseract
Last synced: 07 Nov 2024
https://github.com/shreeshrii/tess5train-fonts
Files and Scripts to run Tesseract 5 LSTM Training using fonts
lstm lstm-neural-networks tesseract tesseract-ocr training tutorial
Last synced: 26 Nov 2024
https://github.com/farhanchoudhary/PAN_Card_OCR_Project
To extract details from Indian National Identification Cards such as PAN (completed) & Aadhar, Passport, Driving License (WIP) in a structured format
image-processing image-to-text ocr optical-character-recognition pan pan-card pytesseract tesseract
Last synced: 02 Dec 2024
https://github.com/tesseract-ocr/tesseract-ocr.github.io
Tesseract documentation
Last synced: 17 Nov 2024
https://github.com/nmapx/revolut-stocks-list
Extract Revolut stocks list from the list screenshot(s).
extract image list ocr revolut screenshot stocks tesseract
Last synced: 28 Nov 2024
https://github.com/vjgpt/vehicle-number-plate-reading
Read Vehicle Number Plate and store the data in a CSV file with date and time.
license-plate-recognition number-plate-recognition number-plates opencv recognition tesseract vehicle-identification-number
Last synced: 24 Nov 2024
https://github.com/marcincichocki/breach-protocol-autosolver
Solve breach protocol minigame in second(s). Windows/Linux/GeForce Now/Google Stadia. Every language.
cyberpunk2077 electron ocr react tesseract tesseract-ocr typescript
Last synced: 06 Nov 2024
https://github.com/hhhrrrttt222111/handreacting
Text to Handwriting converter made using React.
firebase font hacktoberfest handwriting javascript react react-speech reactjs tesseract tesseract-ocr text text-to-handwriting vanilla-javascript website
Last synced: 14 Dec 2024
https://github.com/dannnylo/tesseract-ocr-elixir
This package is a wrapper of Tesseract OCR. Helping to read characters on an image.
hacktoberfest tesseract tesseract-ocr
Last synced: 19 Dec 2024
https://github.com/aryaminus/saram
Get OCR in txt form from an image or pdf extension supporting multiple files from directory using pytesseract with auto rotation for wrong orientation. PYPI:
character-recognition chmod image ocr orientation-detection pdf pillow pyocr pytesseract python tesseract wand
Last synced: 27 Nov 2024
https://github.com/devforth/imagetotext.app
Copy text from the raster images online
image-analysis image-processing image-recognition imagetotext tesseract
Last synced: 10 Nov 2024
https://github.com/GerHobbelt/qiqqa-open-source
The open-sourced version of the award-winning Qiqqa research management tool for Windows (a bleeding edge dev fork) ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ☞☞☞ File any issues you find in the main repo issue tracker at https://github.com/jimmejardine/qiqqa-open-source/issues
citations document-classification document-management meta-analysis metadata mupdf pdf qiqqa tesseract
Last synced: 06 Nov 2024
https://github.com/leverylteam/leveryl
An Advanced & Feature Rich Server Software for MC:PE 1.1.x
mcpe mcpe-server php php7 pmmp pocketmine-mp spoon tesseract
Last synced: 09 Oct 2024
https://github.com/mauvilsa/tesseract-recognize
Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format
cli docker-image document-recognition ocr optical-character-recognition pagexml tesseract text-detection
Last synced: 02 Nov 2024
https://github.com/bandrel/ocyara
Performs OCR on image files and scans them for matches to YARA rules
ocr optical-character-recognition python python-3 tesseract tesseract-ocr-api yara yara-rules
Last synced: 12 Oct 2024
https://github.com/bandrel/OCyara
Performs OCR on image files and scans them for matches to YARA rules
ocr optical-character-recognition python python-3 tesseract tesseract-ocr-api yara yara-rules
Last synced: 13 Nov 2024
https://github.com/rinormaloku/textocry
Textocry - Copy text from Images (chrome extension)
chrome-extension javascript ocr tesseract
Last synced: 28 Oct 2024
https://github.com/eddyverbruggen/nativescript-ocr
:newspaper: :mag: Tesseract-powered OCR plugin for NativeScript
nativescript nativescript-plugin ocr optical-character-recognition tesseract tesseract-ocr
Last synced: 01 Oct 2024
https://github.com/itext/itext-pdfocr-dotnet
pdfOCR is an iText 7 add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compliant PDF or PDF/A-3u files that are accessible, searchable, and suitable for archiving
archival character data diacritic extractable glyphs hindi image iso-compliant ligatures mandarin ocr optical pdf portuguese recognition scan searchable spanish tesseract
Last synced: 19 Dec 2024
https://github.com/stafazzoli/FarsiOCR
An OCR application for Farsi/ Persian documents.
farsi ocr python tesseract tesseract-ocr
Last synced: 29 Nov 2024
https://github.com/sivakumar-mahalingam/fastmrz
⚡Extracting the Machine Readable Zone (MRZ) from passport or any document images
identity-document mrz mrz-scanner ocr opencv opencv-python passport passport-mrz python tesseract tesseract-ocr text-recognition
Last synced: 17 Dec 2024
https://github.com/henrivain/tesseractocrmaui
Tesseract wrapper for Windows, Android and iOS for .NET MAUI
android csharp library maui nuget nuget-package tesseract tesseract-ocr windows windows-desktop
Last synced: 12 Oct 2024
https://github.com/googlecloudplatform/dlp-pdf-redaction
This solution provides an automated, serverless way to redact sensitive data from PDF files using Google Cloud Services like Data Loss Prevention (DLP), Cloud Workflows, and Cloud Run.
bigquery cloud cloudfunctions cloudrun cloudstorage cloudworkflows datalossprevention dlp documents gcp mask ocr pdf redaction serverless terraform tesseract workflows
Last synced: 07 Oct 2024
https://github.com/schwarzkopfb/tesseract-ocr
Node.js wrapper for Tesseract OCR CLI.
javascript nodejs ocr tesseract
Last synced: 04 Nov 2024
https://github.com/lucs1590/nkocr
🔎📝 This is a module to make specifics OCRs at food products and nutritional tables.
computer-vision east food-products hacktoberfest image-processing language ocr opencv opencv-python pytesseract python python3 specifics-ocrs spelling-correction symspell tesseract tesseract-ocr tesseract-ocr-engine tesseract-python
Last synced: 17 Dec 2024
https://github.com/davidegalilei/nimtesseract
A Tesseract OCR wrapper for Nim
nim ocr ocr-recognition tesseract text wrapper
Last synced: 14 Nov 2024
https://github.com/fourdigits/wagtail_textract
Text extraction for Wagtail document search
django search tesseract text-extraction textract wagtail
Last synced: 10 Dec 2024
https://github.com/maxim2266/ocr
A collection of tools for OCR (optical character recognition).
bash-script c extract-text linux ocr ocr-recognition tesseract
Last synced: 09 Oct 2024
https://github.com/itext/itext-pdfocr-java
pdfOCR is an iText 7 add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compliant PDF or PDF/A-3u files that are accessible, searchable, and suitable for archiving
archival character data diacritic extractable glyphs hindi image iso-compliant ligatures mandarin ocr optical pdf portuguese recognition scan searchable spanish tesseract
Last synced: 14 Nov 2024
https://github.com/andreluizsecco/recogcaptcha
Técnica de reconhecimento de captcha utilizando o framework AForge.NET e o Tesseract
aforge-net dotnet image-recognition tesseract
Last synced: 22 Oct 2024
https://github.com/SimformSolutionsPvtLtd/tesseract-OCR-iOS-demo
This prototype is to recognize text inside the image and for that it uses Tesseract OCR. The underlying Tesseract engine will process the picture and return anything that it believes is text.
demo example-project ios ocr ocr-library optical-character-recognition sample swift tesseract
Last synced: 30 Oct 2024
https://github.com/luke161/unity-tesseract
Tesseract (4D Hypercube) implementation in Unity
Last synced: 06 Nov 2024
https://github.com/incubated-geek-cc/Text-To-Speech-App
A Fusion of OCR Technology (Tesseract.js) & Web Speech API. Standalone, portable and works offline.
data-science javascript machine-learning ocr ocr-recognition tesseract tesseract-ocr tesseract-ocr-api tesseractjs webapp
Last synced: 08 Nov 2024
https://github.com/keeferrourke/imgrep
A grep for images. Submission to Hack the North 2017.
cli golang grep grepping hackathon hackthenorth htn17 image-processing images imgrep tesseract waterloo
Last synced: 14 Oct 2024