An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with ocr-text-reader

A curated list of projects in awesome lists tagged with ocr-text-reader .

https://github.com/ushelp/EasyOCR

Java OCR 识别组件(基于Tesseract OCR 引擎)。能自动完成图片清理、识别 CAPTCHA 验证码图片内容的一体化工作。Java Image cleanup, OCR recognition component (based Tesseract OCR engine, automatically cleanup image and identification CAPTCHA verification code picture content).

captcha image-cleanup java-language java-ocr ocr ocr-engine ocr-java ocr-library ocr-recognition ocr-text-reader tesseract

Last synced: 17 Jan 2026

https://github.com/yardstick17/image_text_reader

The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes. The image is pre-processed for better comprehension by OCR. This module first makes bounding box for text in images and then normalizes it to 300 dpi, suitable for OCR engine to read.

image-reader image-to-text ocr ocr-text-reader read-image tesseract-ocr

Last synced: 21 Jan 2026

https://github.com/YangDai2003/CopilotOCR-Android

Fast OCR, multi-language support, sleek design. Scan, edit, and share text effortlessly. Smart barcode and QR code scanning.

android android-ocr-application java ocr ocr-android ocr-text-reader

Last synced: 20 Apr 2025

https://github.com/frederico23/docling_ocr

A powerful Python package for extracting text from images and documents using the SmolDocling-256M-preview advanced LLM-based models.

llm ocr ocr-python ocr-text-reader

Last synced: 14 Jan 2026

https://github.com/hermann-web/python-ocr

Converting invoice pdf to image, image to text and then get, from the text, invoice informations like invoice number or vendor name

image-to-text invoice-number invoice-pdf ocr ocr-python ocr-recognition ocr-text-reader pdf pdf-to-image python tesseract

Last synced: 21 Jun 2025

https://github.com/moheladwy/ocr4linux

OCR Script Tool for Extracting Text from Screenshots (images) using bash, and python scripts only

bash bash-script ocr ocr-python ocr-text-reader python tesseract tesseract-engine tesseract-ocr tesseract-python

Last synced: 27 Oct 2025

https://github.com/salmik/ocr-ios-app-

Document Scanner with OCR iOS app written in Swift

ios-app ios-swift ocr ocr-recognition ocr-text-reader scanner swift swift5 wescan xcode xcode12

Last synced: 11 Apr 2025

https://github.com/salmik/ocr-scanner-app

Document Scanner with OCR iOS app written in Swift

ocr ocr-recognition ocr-text-reader scanner wescan

Last synced: 14 May 2025

https://github.com/xavimondev/textscan.app

Web app that detects and extracts text from any image using OCR

google-vision-api nextjs ocr ocr-recognition ocr-text-reader react tailwindcss typescript

Last synced: 10 Apr 2025

https://github.com/abdulahad968/file-morph

Convert your PDF, Word, and PPT files effortlessly with File Morph – a simple, fast, and secure file converter.

css fileconverter html5 ilovepdf-api nodejs ocr-text-reader reactjs rest-api tesseract-ocr

Last synced: 04 Aug 2025

https://github.com/lakshya-gg/tesseractjs

A process called Optical Character Recognition converts printed texts into digital image files. It is commonly used to recognize text in scanned documents and images.

javascript ocr-recognition ocr-text-reader typescript

Last synced: 09 Sep 2025

https://github.com/geo-y20/standard-ocr-

Explore the Standard OCR Project: a deep learning-based character recognition system leveraging advanced computer vision techniques. Detect characters in images using ResNet, Xception, Inception, and MobileNet models. Train, evaluate, and contribute to this cutting-edge technology.

css deep-learning deep-neural-networks flask html javascript jupyter-notebook keras ocr-recognition ocr-text-reader python resnet tensorflow xception

Last synced: 30 Dec 2025

https://github.com/arbazkhan4712/image-to-text

It fetch the text from an image and saves it into a text file

ocr ocr-python ocr-recognition ocr-text-reader python tesseract

Last synced: 10 Apr 2025

https://github.com/sushant1827/mistral-ocr-pdf-image

This workflow automates the extraction of structured information from PDFs and image files using the Mistral OCR API.

google-sheets http-requests mistral-ai n8n n8n-nodes n8n-workflow ocr-service ocr-text-reader openai-api text-classification

Last synced: 13 Feb 2026

https://github.com/altayyuzeir/pdf2docx-paddleocr-ui

📚 Pdf to Docx Converter with PaddleX & PaddleOCR UI

ocr ocr-python ocr-text-reader paddleocr paddlepaddle paddlex paddlex-gui streamlit

Last synced: 21 Mar 2025

https://github.com/suryaxanden/ocr_with_tesseract

Text extraction from an image using Tesseract engine for OCR.

ocr ocr-text-reader opencv opencv-python python python3 tesseract tesseract-ocr

Last synced: 18 Mar 2025

https://github.com/khaouitiabdelhakim/arabicocr-python-tutorial

This project uses the ArabicOcr package to convert Arabic text in images to editable text using OCR techniques.

ocr ocr-python ocr-recognition ocr-text-reader pip python script

Last synced: 26 Jan 2026

https://github.com/zreechxnn/servo-controller-ocr

Servo Controller-OCR integrates computer vision, OCR, and Arduino to control a servo motor based on text detection from a webcam. It uses Python for real-time image processing and Tesseract OCR for text recognition, combined with Arduino to handle servo motor operations. Ideal for automation projects requiring text-based triggers.

arduino-uno iot-device ocr-python ocr-text-reader opencv opencv-python servo-controller

Last synced: 14 Oct 2025

https://github.com/kishyassin/ocr-in-go-fiber

Optical Character Recognition (OCR) on uploaded images using the gosseract library.

fiber-framework golang ocr ocr-text-reader tesseract-ocr

Last synced: 20 Sep 2025

https://github.com/natgluons/AI-docs-analyzer-API

A smart document processing system built with an open-source multimodal LLM and OCR (DocTR/TrOCR), using FastAPI, Supabase, PgVector, Azure Functions, and Neo4j to automate invoice analysis and identity document verification.

document-processing document-verification fastapi llm multimodal-large-language-models neo4j-graph ocr ocr-text-reader pgvector supabase

Last synced: 19 Jun 2025

https://github.com/tizianogranzotto/ocr-renamer

IA takes the date, type of doc and number, then renames the file.

csharp dotnet net ocr ocr-recognition ocr-text-reader

Last synced: 22 Mar 2025

https://github.com/spajai/ocr-image-to-text

OCR Project to extract text from image using Tesseract, Perl

image-to-text ocr ocr-text-reader perl perl-ocr perl5 perl5-module poc tesseract tesseract-ocr tesseract-ocr-api

Last synced: 11 Nov 2025

https://github.com/janakhpon/monocr

Optical Character Recognition for Mon (mnw) text.

mon ocr ocr-text-reader python

Last synced: 09 Oct 2025

https://github.com/tsvetang2/advanced-local-ocr

Advanced local OCR is a project, inspired by the text extraction some AIs do. So instead of leaving people paying for such services, why not publish a open-source version, that keeps the privacy of each user. The app allows integration with LLMs via APIs.

desktop-app easyocr gui-application image-processing llms local-first ocr ocr-engine ocr-python ocr-text-reader privacy pyqt5 pyqt5-desktop-application tesseract text-cleaning text-recognition-from-image

Last synced: 03 Jul 2025

https://github.com/riccardototi/llm-pdf-extractor

LLM-PDF-Parser is a FastAPI-based application that extracts text from PDFs and images and uses Ollama's LLM to extract specific fields based on a given JSON template.

docker docker-compose image-recognition kubernetes llm microservices ocr-text-reader open-source pdf-extractor-llm

Last synced: 25 Mar 2025

https://github.com/muthukumar0908/bizcardx-extracting-business-card-data-with-ocr

Extracting the image by using easyOCR and Create a user interface app using Streamlit.

data-mining ocr-text-reader sql-server streamlit-application

Last synced: 30 Mar 2025

https://github.com/milanesaopa/image-text-extractor

Xcrap Image Text Extractor is a package of the Xcrap framework that abstracts the extraction of texts from images using the node-tesseract-ocr library.

apache background-subtraction computer-vision gnome image image-to-text llm machine-learning nuxt ocr-text-reader remove-grid tensorflow text-recognition vue

Last synced: 27 Jul 2025

https://github.com/n-jangra/img-ocr

This repository is a Go wrapper for Tesseract OCR using the gosseract package. It enables developers to easily integrate Optical Character Recognition (OCR) functionality into their Go applications, leveraging the power of Tesseract for text extraction from images.

golang gotemplate html-css img ocr-text-reader

Last synced: 23 Jul 2025

https://github.com/jakiur1234/visualtovoice

Visual-to-Voice is a lightweight, open-source project to convert PDF and image text into speech. Using advanced OCR and text-to-speech technology, it ensures fast, accurate, and natural voice output. Perfect for enhancing accessibility and productivity.

ocr ocr-recognition ocr-text-reader ocr-tools tts tts-tool

Last synced: 02 Mar 2025

https://github.com/vishal-ahirwar/image2text

Text 2 Image Using Qt6 C++ and Tesseract

cpp ocr-recognition ocr-text-reader qt6

Last synced: 14 Sep 2025

https://github.com/haripasapuleti/text-extractor

A Flask-based web app that uses OCR technology to extract text from uploaded images. Users can upload images, preview them, and download the extracted text as a file.

file-upload flask image-preprocessing ocr-text-reader python

Last synced: 08 Oct 2025

https://github.com/aungkaungpyaepaing/kivy-ocr-app

Making Optical character recognition (OCR) app using Python and Kivy. This project is based on only image processing without any machine learning or deep learning methods.

kivy-application ocr-text-reader opencv-python pytesseract-ocr threshold

Last synced: 18 Jan 2026

https://github.com/takk8is/thearchivistlens

This system implements the complete Reinert Method of Descending Hierarchical Classification (DHC), offering all IRaMuTeQ functionalities

analysis davidccavalcante iramuteq llm machine-learning ocr ocr-python ocr-recognition ocr-text-reader python reinert takk-ag takk-design takk8is

Last synced: 10 Oct 2025

https://github.com/raghavendranhp/cardify_extracting_business_card_data_and_management

Cardify is a Streamlit application that simplifies the process of extracting and managing business card data. With Cardify, users can easily upload business card images, extract relevant information using OCR, and manage the extracted data efficiently.

easyocr ocr ocr-text-reader opencv python regular-expression sqlite3 streamlit

Last synced: 16 Oct 2025

https://github.com/livewithcodeankit/ai-ocr

Advanced AI-OCR with FastAPI and OpenAI Integration

ai ml ocr ocr-text-reader openai pydantic pytesseract-ocr python

Last synced: 26 Feb 2025

https://github.com/sairam-s0/local_ai_automation

This project automates question solving using AI and OCR. Instead of scraping text from websites, the system captures screenshots of questions, uses OCR to extract text, and then feeds it to the Phi-4 Mini AI model to generate accurate answers and code solutions automatically.

ai ai-automation aiagent computer-vision llama machine-learning nlp-machine-learning ocr-text-reader phi problem-solving

Last synced: 19 Nov 2025

https://github.com/tjkessler/tesseract-positional

Tool to save positional OCR data to a text file

ocr-python ocr-recognition ocr-text-reader tesseract tesseract-ocr

Last synced: 15 Mar 2025

https://github.com/spajai/ocr-ocrspace

Perl Interface for Optical Character Recognition https://ocr.space/ (https://metacpan.org/pod/OCR::OcrSpace)

api cpan cpan-module ocr ocr-engine ocr-service ocr-text-reader ocrspace perl perl-module perl-sdk

Last synced: 10 Nov 2025

https://github.com/bcastelino/ocr-text-vision-pro

AI-powered OCR application using Llama 3.2 Vision for advanced image understanding and text extraction.

large-language-models llama llama3-2-vision ocr-text-reader openrouter-api python3 streamlit-webapp utility

Last synced: 01 Aug 2025