Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with tesseract

A curated list of projects in awesome lists tagged with tesseract .

https://github.com/tesseract-ocr/tesseract

Tesseract Open Source OCR Engine (main repository)

hacktoberfest lstm machine-learning ocr ocr-engine tesseract tesseract-ocr

Last synced: 16 Dec 2024

https://github.com/naptha/tesseract.js

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

deep-learning javascript ocr tesseract webassembly

Last synced: 16 Dec 2024

https://github.com/ocrmypdf/ocrmypdf

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

image-processing ocr pdf python tesseract

Last synced: 16 Dec 2024

https://github.com/jbarlow83/OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

image-processing ocr pdf python tesseract

Last synced: 16 Nov 2024

https://github.com/ocrmypdf/OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

image-processing ocr pdf python tesseract

Last synced: 26 Oct 2024

https://github.com/tesseract-ocr/tessdata

Trained models with fast variant of the "best" LSTM models + legacy models

ocr tesseract

Last synced: 01 Dec 2024

https://github.com/aisingapore/tagui

Free RPA tool by AI Singapore

ai nlp opencv rpa tesseract

Last synced: 17 Dec 2024

https://github.com/pymupdf/PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

data-science epub extract-data font mupdf ocr pdf pdf-documents pymupdf python table-extraction tesseract text-processing text-shaping xps

Last synced: 06 Nov 2024

https://github.com/pymupdf/pymupdf

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

data-science epub extract-data font mupdf ocr pdf pdf-documents pymupdf python table-extraction tesseract text-processing text-shaping xps

Last synced: 16 Dec 2024

https://github.com/aisingapore/TagUI

Free RPA tool by AI Singapore

ai nlp opencv rpa tesseract

Last synced: 25 Oct 2024

https://github.com/tebelorg/rpa-python

Python package for doing RPA

cross-platform opencv python rpa sikuli tagui tesseract

Last synced: 16 Dec 2024

https://github.com/tebelorg/RPA-Python

Python package for doing RPA

cross-platform opencv python rpa sikuli tagui tesseract

Last synced: 29 Oct 2024

https://github.com/thiagoalessio/tesseract-ocr-for-php

A wrapper to work with Tesseract OCR inside PHP.

image-to-text ocr php tesseract text-recognition

Last synced: 17 Dec 2024

https://github.com/otiai10/gosseract

Go package for OCR (Optical Character Recognition), by using Tesseract C++ library

go ocr ocr-server tesseract tesseract-ocr

Last synced: 17 Dec 2024

https://github.com/dicklesworthstone/llm_aided_ocr

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.

ai-assist llama2 llm ocr ocr-correction tesseract

Last synced: 19 Dec 2024

https://github.com/rmtheis/android-ocr

Experimental optical character recognition app

android ocr optical-character-recognition tesseract

Last synced: 25 Sep 2024

https://github.com/sirfz/tesserocr

A Python wrapper for the tesseract-ocr API

cython ocr optical-character-recognition python-library tesseract

Last synced: 17 Dec 2024

https://github.com/Pulover/PuloversMacroCreator

Automation Utility - Recorder & Script Generator

ahk autohotkey automation robot rpa tesseract

Last synced: 28 Oct 2024

https://github.com/Dicklesworthstone/llm_aided_ocr

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.

ai-assist llama2 llm ocr ocr-correction tesseract

Last synced: 31 Aug 2024

https://github.com/gauravsingh9356/j.a.r.v.i.s

Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list generator, Opens any website with just a voice command, Plays Music, Wikipedia searching, Dictionary with Intelligent Sensing i.e. auto spell checking, Weather Reporting i.e. temp, wind speed, humidity, YouTube searching, Google Map searching, Youtube Downloading, etc.

chatgpt dictionary-application difflib hacktoberfest hactober-accepted hactoberfest2021 jarvis jarvis-ai newsapi opencv optical-character-recognition optical-text-recognition python python3 pyttsx3 speech-recognition tesseract tesseract-ocr weather-api webbrowser

Last synced: 15 Dec 2024

https://github.com/GauravSingh9356/J.A.R.V.I.S

Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list generator, Opens any website with just a voice command, Plays Music, Wikipedia searching, Dictionary with Intelligent Sensing i.e. auto spell checking, Weather Reporting i.e. temp, wind speed, humidity, YouTube searching, Google Map searching, Youtube Downloading, etc.

chatgpt dictionary-application difflib hacktoberfest hactober-accepted hactoberfest2021 jarvis jarvis-ai newsapi opencv optical-character-recognition optical-text-recognition python python3 pyttsx3 speech-recognition tesseract tesseract-ocr weather-api webbrowser

Last synced: 18 Nov 2024

https://github.com/dannnylo/rtesseract

Ruby library for working with the Tesseract OCR.

hacktoberfest rtesseract ruby tesseract tesseract-ocr

Last synced: 20 Dec 2024

https://github.com/tesseract-ocr/tesstrain

Train Tesseract LSTM with make

ocr tesseract training

Last synced: 16 Dec 2024

https://github.com/junhoyeo/betterocr

🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.

ai chatgpt chatgpt-api easyocr llm ocr openai openai-api tesseract tesseract-ocr

Last synced: 09 Nov 2024

https://github.com/tesseract-ocr/tessdata_fast

Fast integer versions of trained LSTM models

ocr tesseract

Last synced: 17 Nov 2024

https://github.com/prabhakar267/image2text

:clipboard: Python wrapper to grab text from images and save as text files using Tesseract Engine

image2text ocr optical-character-recognition python-wrapper tesseract tesseract-engine tesseract-installation tesseract-ocr

Last synced: 21 Dec 2024

https://github.com/zapolnoch/node-tesseract-ocr

A Node.js wrapper for the Tesseract OCR API

image-to-text ocr tesseract text-recognition

Last synced: 16 Dec 2024

https://github.com/cseas/ocr-table

Extract tables from scanned image PDFs using Optical Character Recognition.

extract-tables ocr ocr-table optical-character-recognition pdfminer python scanned-image-pdfs shell tesseract

Last synced: 18 Dec 2024

https://github.com/LeoFCardoso/pdf2pdfocr

A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!

docker ocr pdf pdftk python tesseract

Last synced: 26 Oct 2024

https://github.com/ropensci/tesseract

Bindings to Tesseract OCR engine for R

ocr r r-package rstats tesseract tesseract-ocr

Last synced: 10 Nov 2024

https://github.com/SwiftyTesseract/SwiftyTesseract

A Swift wrapper around Tesseract for use in iOS, macOS, and Linux applications

ios ocr optical-character-recognition swift tesseract

Last synced: 29 Nov 2024

https://github.com/scott0123/Tesseract-macOS

Objective C wrapper for the open source OCR Engine Tesseract (macOS)

mac macos objective-c ocr screenshot tesseract tesseract-mac tesseract-macos xcode

Last synced: 30 Oct 2024

https://github.com/scherroman/mugen

A command-line music video generator based on rhythm

amv audio command-line librosa montage moviepy mugen music-video python remix rhythm tesseract video

Last synced: 14 Oct 2024

https://github.com/the-black-knight-01/tabulo

Table Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)

deep-learning detection faster-r-cnn luminoth ocr pdf-table-extraction python sonnet ssd table-data-extraction table-detection table-detection-using-deep-learning table-recognition tabulo tensorflow tesseract

Last synced: 19 Dec 2024

https://github.com/the-black-knight-01/Tabulo

Table Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)

deep-learning detection faster-r-cnn luminoth ocr pdf-table-extraction python sonnet ssd table-data-extraction table-detection table-detection-using-deep-learning table-recognition tabulo tensorflow tesseract

Last synced: 17 Nov 2024

https://github.com/trekhleb/links-detector

📖 👆🏻 Links Detector makes printed links clickable via your smartphone camera. No need to type a link in, just scan and click on it.

computer-vision javascript machine-learning object-detection ocr tensorflowjs tesnorflow tesseract typescript

Last synced: 19 Dec 2024

https://github.com/skylander86/lambda-text-extractor

AWS Lambda functions to extract text from various binary formats.

aws-lambda lambda-functions ocr pdf pdf-ocr-extraction searchable-pdfs tesseract text-extraction

Last synced: 22 Nov 2024

https://github.com/alimranahmed/laraocr

Laravel Optical Character Reader(OCR) package using ocr engines(Tesseract)

laravel55 ocr tesseract

Last synced: 21 Dec 2024

https://github.com/writecrow/ocr2text

Convert a PDF via OCR to a TXT file in UTF-8 encoding

batch converter corpus ocr pdf tesseract

Last synced: 20 Dec 2024

https://github.com/dilippuri/aadhaar-card-ocr

Extract text information from Aadhaar Card using tesseract-ocr :sunglasses:

aadhaar ocr tesseract tesseract-ocr

Last synced: 27 Oct 2024

https://github.com/koreader/koreader-base

Base framework offering a Lua scriptable environment for creating document readers

djvu emulator epub ffi koreader leptonica lua luajit mupdf pdf sdl tesseract ubuntu

Last synced: 16 Dec 2024

https://github.com/kaelzhang/penteract-ocr

⭐️ The native node.js bindings to the Tesseract OCR project.

n-api nodejs ocr ocr-engine optical-character-recognition penteract tesseract

Last synced: 01 Nov 2024

https://github.com/ndavd/ncube

Generalized Hypercube Visualizer

bevy hypercube mathematics rust simulation tesseract

Last synced: 16 Dec 2024

https://github.com/bweigel/aws-lambda-tesseract-layer

A layer for AWS Lambda containing the tesseract C libraries and tesseract executable.

amazon-linux aws-lambda lambda lambda-layer serverless serverless-framework tesseract

Last synced: 02 Nov 2024

https://github.com/victorqribeiro/ocr

Simple app to extract text from pictures using Tesseract

image-recognition ocr tesseract text-extraction text-recognition

Last synced: 17 Nov 2024

https://github.com/monogramm/erpnext_ocr

:snake: :alembic: Optical Character Recognition using tesseract within Frappe.

erpnext frappe ocr python tesseract

Last synced: 15 Dec 2024

https://github.com/008karan/PAN_OCR

Building OCR using YOLO and Tesseract

deep-learning ocr tesseract yolo

Last synced: 09 Nov 2024

https://github.com/SkeathyTomas/genshin_artifact_auxiliary

A Genshin Impact artifact rater sticking upon artifacts inside the game window. 刻晴办公桌 | 原神 | 圣遗物评分。集成在游戏窗口之上的原神圣遗物导出、评分工具,无需游戏内外来回切换对比,游戏中快速计算与查阅结果。

genshin-impact ocr paddleocr pyside6 python rapidocr tesseract

Last synced: 10 Nov 2024

https://github.com/testica/text-scanner

OCR Android app using tesseract

android google-play java ocr-android tesseract

Last synced: 24 Nov 2024

https://github.com/hertzg/tesseract-server

A small lightweight HTTP server that converts photos, images and scanned documents to text using optical character recognition by utilizing the power of Google Tesseract.

api container containers docker docker-compose docker-image hacktoberfest http-server image-processing ocr rest-api tesseract tesseract-server typescript

Last synced: 16 Dec 2024

https://github.com/shelfio/aws-lambda-tesseract

6 MB Tesseract (with English training data) to fit inside AWS Lambda

aws-lambda node-module nodejs npm-package ocr optical-character-recognition serverless tesseract

Last synced: 20 Dec 2024

https://github.com/Monogramm/erpnext_ocr

:snake: :alembic: Optical Character Recognition using tesseract within Frappe.

erpnext frappe ocr python tesseract

Last synced: 05 Nov 2024

https://github.com/lexmartinez/ocr-electron-vue

:card_index: A Simple OCR Application built on Electron, Vue.js & Tesseract.js

electron ocr tesseract vuejs

Last synced: 04 Nov 2024

https://github.com/Arthelon/imgclip

Command line utility that extracts text from an image into the system clipboard.

cli command-line javascript ocr tesseract

Last synced: 07 Nov 2024

https://github.com/shreeshrii/tess5train-fonts

Files and Scripts to run Tesseract 5 LSTM Training using fonts

lstm lstm-neural-networks tesseract tesseract-ocr training tutorial

Last synced: 26 Nov 2024

https://github.com/farhanchoudhary/PAN_Card_OCR_Project

To extract details from Indian National Identification Cards such as PAN (completed) & Aadhar, Passport, Driving License (WIP) in a structured format

image-processing image-to-text ocr optical-character-recognition pan pan-card pytesseract tesseract

Last synced: 02 Dec 2024

https://github.com/nmapx/revolut-stocks-list

Extract Revolut stocks list from the list screenshot(s).

extract image list ocr revolut screenshot stocks tesseract

Last synced: 28 Nov 2024

https://github.com/Lynnesbian/OCRbot

An OCR (Optical Character Recognition) bot for Mastodon (and compatible) instances

mastodon ocr python python3 tesseract

Last synced: 06 Nov 2024

https://github.com/marcincichocki/breach-protocol-autosolver

Solve breach protocol minigame in second(s). Windows/Linux/GeForce Now/Google Stadia. Every language.

cyberpunk2077 electron ocr react tesseract tesseract-ocr typescript

Last synced: 06 Nov 2024

https://github.com/dannnylo/tesseract-ocr-elixir

This package is a wrapper of Tesseract OCR. Helping to read characters on an image.

hacktoberfest tesseract tesseract-ocr

Last synced: 19 Dec 2024

https://github.com/aryaminus/saram

Get OCR in txt form from an image or pdf extension supporting multiple files from directory using pytesseract with auto rotation for wrong orientation. PYPI:

character-recognition chmod image ocr orientation-detection pdf pillow pyocr pytesseract python tesseract wand

Last synced: 27 Nov 2024

https://github.com/onmyway133/bigbignumbers

🔢Say the number out loud

ocr tesseract text trump vision

Last synced: 02 Nov 2024

https://github.com/GerHobbelt/qiqqa-open-source

The open-sourced version of the award-winning Qiqqa research management tool for Windows (a bleeding edge dev fork) ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ☞☞☞ File any issues you find in the main repo issue tracker at https://github.com/jimmejardine/qiqqa-open-source/issues

citations document-classification document-management meta-analysis metadata mupdf pdf qiqqa tesseract

Last synced: 06 Nov 2024

https://github.com/leverylteam/leveryl

An Advanced & Feature Rich Server Software for MC:PE 1.1.x

mcpe mcpe-server php php7 pmmp pocketmine-mp spoon tesseract

Last synced: 09 Oct 2024

https://github.com/mauvilsa/tesseract-recognize

Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format

cli docker-image document-recognition ocr optical-character-recognition pagexml tesseract text-detection

Last synced: 02 Nov 2024

https://github.com/bandrel/ocyara

Performs OCR on image files and scans them for matches to YARA rules

ocr optical-character-recognition python python-3 tesseract tesseract-ocr-api yara yara-rules

Last synced: 12 Oct 2024

https://github.com/bandrel/OCyara

Performs OCR on image files and scans them for matches to YARA rules

ocr optical-character-recognition python python-3 tesseract tesseract-ocr-api yara yara-rules

Last synced: 13 Nov 2024

https://github.com/rinormaloku/textocry

Textocry - Copy text from Images (chrome extension)

chrome-extension javascript ocr tesseract

Last synced: 28 Oct 2024

https://github.com/eddyverbruggen/nativescript-ocr

:newspaper: :mag: Tesseract-powered OCR plugin for NativeScript

nativescript nativescript-plugin ocr optical-character-recognition tesseract tesseract-ocr

Last synced: 01 Oct 2024

https://github.com/itext/itext-pdfocr-dotnet

pdfOCR is an iText 7 add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compliant PDF or PDF/A-3u files that are accessible, searchable, and suitable for archiving

archival character data diacritic extractable glyphs hindi image iso-compliant ligatures mandarin ocr optical pdf portuguese recognition scan searchable spanish tesseract

Last synced: 19 Dec 2024

https://github.com/stafazzoli/FarsiOCR

An OCR application for Farsi/ Persian documents.

farsi ocr python tesseract tesseract-ocr

Last synced: 29 Nov 2024

https://github.com/sivakumar-mahalingam/fastmrz

⚡Extracting the Machine Readable Zone (MRZ) from passport or any document images

identity-document mrz mrz-scanner ocr opencv opencv-python passport passport-mrz python tesseract tesseract-ocr text-recognition

Last synced: 17 Dec 2024

https://github.com/henrivain/tesseractocrmaui

Tesseract wrapper for Windows, Android and iOS for .NET MAUI

android csharp library maui nuget nuget-package tesseract tesseract-ocr windows windows-desktop

Last synced: 12 Oct 2024

https://github.com/googlecloudplatform/dlp-pdf-redaction

This solution provides an automated, serverless way to redact sensitive data from PDF files using Google Cloud Services like Data Loss Prevention (DLP), Cloud Workflows, and Cloud Run.

bigquery cloud cloudfunctions cloudrun cloudstorage cloudworkflows datalossprevention dlp documents gcp mask ocr pdf redaction serverless terraform tesseract workflows

Last synced: 07 Oct 2024

https://github.com/schwarzkopfb/tesseract-ocr

Node.js wrapper for Tesseract OCR CLI.

javascript nodejs ocr tesseract

Last synced: 04 Nov 2024

https://github.com/davidegalilei/nimtesseract

A Tesseract OCR wrapper for Nim

nim ocr ocr-recognition tesseract text wrapper

Last synced: 14 Nov 2024

https://github.com/fourdigits/wagtail_textract

Text extraction for Wagtail document search

django search tesseract text-extraction textract wagtail

Last synced: 10 Dec 2024

https://github.com/maxim2266/ocr

A collection of tools for OCR (optical character recognition).

bash-script c extract-text linux ocr ocr-recognition tesseract

Last synced: 09 Oct 2024

https://github.com/itext/itext-pdfocr-java

pdfOCR is an iText 7 add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compliant PDF or PDF/A-3u files that are accessible, searchable, and suitable for archiving

archival character data diacritic extractable glyphs hindi image iso-compliant ligatures mandarin ocr optical pdf portuguese recognition scan searchable spanish tesseract

Last synced: 14 Nov 2024

https://github.com/andreluizsecco/recogcaptcha

Técnica de reconhecimento de captcha utilizando o framework AForge.NET e o Tesseract

aforge-net dotnet image-recognition tesseract

Last synced: 22 Oct 2024

https://github.com/SimformSolutionsPvtLtd/tesseract-OCR-iOS-demo

This prototype is to recognize text inside the image and for that it uses Tesseract OCR. The underlying Tesseract engine will process the picture and return anything that it believes is text.

demo example-project ios ocr ocr-library optical-character-recognition sample swift tesseract

Last synced: 30 Oct 2024

https://github.com/luke161/unity-tesseract

Tesseract (4D Hypercube) implementation in Unity

4d-hypercube tesseract unity

Last synced: 06 Nov 2024

https://github.com/incubated-geek-cc/Text-To-Speech-App

A Fusion of OCR Technology (Tesseract.js) & Web Speech API. Standalone, portable and works offline.

data-science javascript machine-learning ocr ocr-recognition tesseract tesseract-ocr tesseract-ocr-api tesseractjs webapp

Last synced: 08 Nov 2024

https://github.com/keeferrourke/imgrep

A grep for images. Submission to Hack the North 2017.

cli golang grep grepping hackathon hackthenorth htn17 image-processing images imgrep tesseract waterloo

Last synced: 14 Oct 2024