Projects in Awesome Lists tagged with ocr
A curated list of projects in awesome lists tagged with ocr .
https://github.com/tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
hacktoberfest lstm machine-learning ocr ocr-engine tesseract tesseract-ocr
Last synced: 26 Dec 2025
https://github.com/paddlepaddle/paddleocr
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
chineseocr crnn db ocr ocrlite
Last synced: 09 Sep 2025
https://github.com/PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
chineseocr crnn db ocr ocrlite
Last synced: 14 Mar 2025
https://github.com/naptha/tesseract.js
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
deep-learning javascript ocr tesseract webassembly
Last synced: 12 May 2025
https://github.com/siyuan-note/siyuan
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
anki chatgpt deepseek electron evernote knowledge-base local-first markdown note-taking notes-app notion obsidian ocr ollama openai pdf s3 self-hosted webdav
Last synced: 02 Jan 2026
https://github.com/opendatalab/mineru
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
ai4science document-analysis extract-data layout-analysis ocr parser pdf pdf-converter pdf-extractor-llm pdf-extractor-pretrain pdf-extractor-rag pdf-parser python
Last synced: 06 Jan 2026
https://github.com/sharex/sharex
ShareX is a free and open source program that lets you capture or record any area of your screen and share it with a single press of a key. It also allows uploading images, text or other types of files to many supported destinations you can choose from.
capture color-picker csharp dropbox file-sharing file-upload ftp gif gif-recorder image-annotation imgur ocr productivity region-capture screen-capture screen-recorder screenshot share sharex url-shortener
Last synced: 09 Sep 2025
https://github.com/ShareX/ShareX
ShareX is a free and open source program that lets you capture or record any area of your screen and share it with a single press of a key. It also allows uploading images, text or other types of files to many supported destinations you can choose from.
capture color-picker csharp dropbox file-sharing file-upload ftp gif gif-recorder image-annotation imgur ocr productivity region-capture screen-capture screen-recorder screenshot share sharex url-shortener
Last synced: 13 Mar 2025
https://github.com/ocrmypdf/ocrmypdf
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
image-processing ocr pdf python tesseract
Last synced: 11 Dec 2025
https://github.com/paperless-ngx/paperless-ngx
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
angular archiving django dms document-management document-management-system machine-learning ocr optical-character-recognition pdf
Last synced: 12 May 2025
https://github.com/opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
ai4science document-analysis extract-data layout-analysis ocr parser pdf pdf-converter pdf-extractor-llm pdf-extractor-pretrain pdf-extractor-rag pdf-parser python
Last synced: 24 Mar 2025
https://github.com/jaidedai/easyocr
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
cnn crnn data-mining deep-learning easyocr image-processing information-retrieval lstm machine-learning ocr optical-character-recognition python pytorch scene-text scene-text-recognition
Last synced: 17 Nov 2025
https://github.com/JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
cnn crnn data-mining deep-learning easyocr image-processing information-retrieval lstm machine-learning ocr optical-character-recognition python pytorch scene-text scene-text-recognition
Last synced: 14 Mar 2025
https://github.com/hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Last synced: 24 Mar 2025
https://github.com/hiroi-sora/umi-ocr
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Last synced: 05 Apr 2025
https://lukas-blecher.github.io/LaTeX-OCR/
pix2tex: Using a ViT to convert images of equations into LaTeX code.
dataset deep-learning im2latex im2markup im2text image-processing image2text latex latex-ocr machine-learning math-ocr ocr python pytorch transformer vision-transformer vit
Last synced: 19 Nov 2025
https://github.com/lukas-blecher/latex-ocr
pix2tex: Using a ViT to convert images of equations into LaTeX code.
dataset deep-learning im2latex im2markup im2text image-processing image2text latex latex-ocr machine-learning math-ocr ocr python pytorch transformer vision-transformer vit
Last synced: 13 May 2025
https://github.com/ocrmypdf/OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
image-processing ocr pdf python tesseract
Last synced: 14 Mar 2025
https://github.com/lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
dataset deep-learning im2latex im2markup im2text image-processing image2text latex latex-ocr machine-learning math-ocr ocr python pytorch transformer vision-transformer vit
Last synced: 26 Mar 2025
https://github.com/daybreak-u/chineseocr_lite
超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
Last synced: 13 May 2025
https://github.com/DayBreak-u/chineseocr_lite
超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
Last synced: 18 Mar 2025
https://github.com/unstructured-io/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
data-pipelines deep-learning document-image-analysis document-image-processing document-parser document-parsing docx donut information-retrieval langchain llm machine-learning ml natural-language-processing nlp ocr pdf pdf-to-json pdf-to-text preprocessing
Last synced: 09 Sep 2025
https://github.com/Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
data-pipelines deep-learning document-image-analysis document-image-processing document-parser document-parsing docx donut information-retrieval langchain llm machine-learning ml natural-language-processing nlp ocr pdf pdf-to-json pdf-to-text preprocessing
Last synced: 26 Mar 2025
https://github.com/tisfeng/easydict
一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎 苹果系统翻译,OpenAI,Gemini,DeepL,Google,Bing,腾讯,百度,阿里,小牛,彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words and translating text.
app baidu bing deepl dictionary gemini google macos ocr openai shortcuts tencent translate translator youdao
Last synced: 13 May 2025
https://github.com/dataelement/bisheng
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
agent ai chatbot enterprise finetune genai gpt langchian llama llm llmdevops llmops ocr openai orchestration python rag react sft workflow
Last synced: 14 May 2025
https://github.com/pymupdf/pymupdf
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
data-science epub extract-data font mupdf ocr pdf pdf-documents pymupdf python table-extraction tesseract text-processing text-shaping xps
Last synced: 09 Sep 2025
https://github.com/microsoft/ailab
Experience, Learn and Code the latest breakthrough innovations with Microsoft AI
ai algorithms azure-functions bing-search bot computer-vision csharp custom-vision dnn html5 image-classification iot javascript language-learning luis object-detection ocr translation
Last synced: 14 May 2025
https://github.com/zyddnys/manga-image-translator
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
anime auto-translation chinese-translation deep-learning image-processing inpainting japanese-translations machine-translation manga neural-network ocr pytorch-implementation text-detection text-detection-recognition transformer
Last synced: 12 May 2025
https://github.com/tisfeng/Easydict
一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎 苹果系统翻译,OpenAI,Gemini,DeepL,Google,Bing,腾讯,百度,阿里,小牛,彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words and translating text.
app baidu bing deepl dictionary gemini google macos ocr openai shortcuts tencent translate translator youdao
Last synced: 24 Mar 2025
https://github.com/tesseract-ocr/tessdata
Trained models with fast variant of the "best" LSTM models + legacy models
Last synced: 24 Mar 2025
https://github.com/HIllya51/LunaTranslator
Galgame翻译器,支持HOOK、OCR、剪贴板等。Visual Novel Translator , support HOOK / OCR / clipboard
galgame ocr reverse-engineering translator visual-novel win32
Last synced: 14 May 2025
https://github.com/adithya-s-k/omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
ingestion-api ocr omniparser parse-server parser-library vision-transformer web-crawler whisper-api
Last synced: 13 May 2025
https://github.com/pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
data-science epub extract-data font mupdf ocr pdf pdf-documents pymupdf python table-extraction tesseract text-processing text-shaping xps
Last synced: 08 Apr 2025
https://github.com/clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
computer-vision document-ai eccv-2022 multimodal-pre-trained-model nlp ocr
Last synced: 13 May 2025
https://github.com/Swift-AI/Swift-AI
The Swift machine learning library.
artificial-intelligence deep-learning ios machine-learning macos ocr swift
Last synced: 02 Aug 2025
https://github.com/swift-ai/swift-ai
The Swift machine learning library.
artificial-intelligence deep-learning ios machine-learning macos ocr swift
Last synced: 12 Apr 2025
https://github.com/chineseocr/chineseocr
yolo3+ocr
chinese-ocr chinese-text-detect darknet-text-detect idcard ocr opencv-dnn trainticket yolo3
Last synced: 14 May 2025
https://github.com/yaofanguk/video-subtitle-extractor
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
deep-learning extract hardsub ocr ripper srt subrip subtitles
Last synced: 13 May 2025
https://github.com/axa-group/parsr
Transforms PDF, Documents and Images into Enriched Structured Data
data document extraction hacktoberfest images nlp ocr parsr pdf python typescript
Last synced: 13 May 2025
https://github.com/axa-group/Parsr
Transforms PDF, Documents and Images into Enriched Structured Data
data document extraction hacktoberfest images nlp ocr parsr pdf python typescript
Last synced: 13 Mar 2025
https://github.com/paddlepaddle/paddlex
All-in-One Development Tool based on PaddlePaddle
ai-pipelines classification deployment formula-recognition layout-detection object-detection ocr pdf2markdown pp-chatocr segmentation speech-recognition time-series
Last synced: 14 May 2025
https://github.com/jonaswinkler/paperless-ng
A supercharged version of paperless: scan, index and archive all your physical documents
angular archiving django dms document-management-system full-text-search machine-learning ocr search
Last synced: 27 Sep 2025
https://github.com/layout-parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
computer-vision deep-learning detectron2 document-image-processing document-layout-analysis layout-analysis layout-detection layout-parser object-detection ocr
Last synced: 13 May 2025
https://github.com/Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
computer-vision deep-learning detectron2 document-image-processing document-layout-analysis layout-analysis layout-detection layout-parser object-detection ocr
Last synced: 15 Mar 2025
https://github.com/YaoFANGUK/video-subtitle-extractor
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
deep-learning extract hardsub ocr ripper srt subrip subtitles
Last synced: 24 Mar 2025
https://github.com/PaddlePaddle/PaddleX
All-in-One Development Tool based on PaddlePaddle(飞桨低代码开发工具)
ai-pipelines classification deployment object-detection ocr pp-chatocr segmentation time-series
Last synced: 28 Mar 2025
https://github.com/xushengfeng/eSearch
截屏 离线OCR 搜索翻译 以图搜图 贴图 录屏 万向滚动截屏 屏幕翻译 Screenshot Offline OCR Search Translate Search for picture Paste the picture on the screen Screen recorder Omnidirectional scrolling screenshot Screen translator
clipboard color-picker cross-platform electron image-editing image-editor live-text ocr paddleocr screen-capture screen-recorder screenshot search search-photos
Last synced: 08 Apr 2025
https://github.com/mindee/doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
deep-learning document-recognition ocr optical-character-recognition pytorch tensorflow2 text-detection text-detection-recognition text-recognition
Last synced: 14 May 2025
https://github.com/nmac427/swiftocr
Fast and simple OCR library written in Swift
deprecated ios macos ocr ocr-engine ocr-library optical-character-recognition swift swiftocr
Last synced: 14 May 2025
https://github.com/NMAC427/SwiftOCR
Fast and simple OCR library written in Swift
deprecated ios macos ocr ocr-engine ocr-library optical-character-recognition swift swiftocr
Last synced: 06 Aug 2025
https://github.com/open-mmlab/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
abcnet abinet crnn dbnet deep-learning fcenet key-information-extraction maskrcnn ocr pan panet psenet pytorch sar sdmg-r segmentation-based-text-recognition spts svtr text-detection text-recognition
Last synced: 14 May 2025
https://github.com/tencent/tnn
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.
coreml deep-learning face-detection hairsegmentaion inference mnn ncnn ocr openvino pytorch tengine tensorflow tensorrt
Last synced: 13 May 2025
https://github.com/Tencent/TNN
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.
coreml deep-learning face-detection hairsegmentaion inference mnn ncnn ocr openvino pytorch tengine tensorflow tensorrt
Last synced: 20 Mar 2025
https://github.com/rapidai/rapidocr
📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.
chineseocr crnn dbnet easyocr ocr onnxocr onnxruntime openvino paddleocr rapidocr rapidocronnxruntime
Last synced: 14 May 2025
https://github.com/clovaai/deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
crnn deep-learning grcnn iccv2019 ocr ocr-recognition r2am rare recognition rosetta scene-text scene-text-recognition star-net text-recognition
Last synced: 14 May 2025
https://github.com/danily07/translumo
Advanced real-time screen translator for games, hardcoded subtitles in videos, static text and etc.
autotranslate easyocr game-translation mlnet ocr translation
Last synced: 14 May 2025
https://github.com/dmmaze/ballonstranslator
深度学习辅助漫画翻译工具, 支持一键机翻和简单的图像/文本编辑 | Yet another computer-aided comic/manga translation tool powered by deeplearning
auto-translation chinese-translation comics computer-aided-translation computer-vision deep-learning inpainting manga ocr pyqt pyqt6 pytorch qt qt6 scene-text-detection
Last synced: 13 May 2025
https://github.com/thejoefin/text-grab
Use OCR in Windows quickly and easily with Text Grab. With optional background process and notifications.
dotnet msix ocr window-10 windows windows-11 wpf
Last synced: 13 May 2025
https://github.com/dmMaze/BallonsTranslator
深度学习辅助漫画翻译工具, 支持一键机翻和简单的图像/文本编辑 | Yet another computer-aided comic/manga translation tool powered by deeplearning
auto-translation chinese-translation comics computer-aided-translation computer-vision deep-learning inpainting manga ocr pyqt pyqt6 pytorch qt qt6 scene-text-detection
Last synced: 11 Apr 2025
https://github.com/belval/textrecognitiondatagenerator
A synthetic data generator for text recognition
data dataset fake ocr synthetic text text-recognition training-set-generator
Last synced: 14 May 2025
https://github.com/breezedeus/CnOCR
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】
chinese-character-recognition english-character-recognition ocr ocr-python pytorch
Last synced: 04 Apr 2025
https://github.com/zggsong/stranslate
A ready-to-go translation ocr tool developed with WPF/WPF 开发的一款即用即走的翻译、OCR工具
ocr openai paddleocr stranslate wecharocr wpf writing zggsong
Last synced: 11 Apr 2025
https://github.com/eragonruan/text-detection-ctpn
text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network
ctpn id-card ocr robust-reading tensorflow text-detection
Last synced: 14 May 2025
https://github.com/aim-uofa/adelaidet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
abcnet adelaidet blendmask boxinst condinst densecl fcos instance-segmentation meinst object-detection ocr solo solov2 text-detection text-recognition
Last synced: 14 May 2025
https://github.com/aim-uofa/AdelaiDet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
abcnet adelaidet blendmask boxinst condinst densecl fcos instance-segmentation meinst object-detection ocr solo solov2 text-detection text-recognition
Last synced: 02 Apr 2025
https://github.com/breezedeus/cnocr
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】
chinese-character-recognition english-character-recognition ocr ocr-python pytorch
Last synced: 06 Oct 2025
https://github.com/Belval/TextRecognitionDataGenerator
A synthetic data generator for text recognition
data dataset fake ocr synthetic text text-recognition training-set-generator
Last synced: 28 Mar 2025
https://github.com/TheJoeFin/Text-Grab
Use OCR in Windows quickly and easily with Text Grab. With optional background process and notifications.
dotnet msix ocr window-10 windows windows-11 wpf
Last synced: 01 Apr 2025
https://github.com/clovaai/craft-pytorch
Official implementation of Character Region Awareness for Text Detection (CRAFT)
craft curved-text cvpr2019 detection ocr ocr-detection pytorch text-detection
Last synced: 14 May 2025
https://github.com/clovaai/CRAFT-pytorch
Official implementation of Character Region Awareness for Text Detection (CRAFT)
craft curved-text cvpr2019 detection ocr ocr-detection pytorch text-detection
Last synced: 02 Apr 2025
https://github.com/ub-mannheim/tesseract
Tesseract Open Source OCR Engine (main repository)
lstm ocr ocr-d ocr-d-mp tesseract-ocr windows-build
Last synced: 06 Oct 2025
https://github.com/kerlomz/captcha_trainer
[验证码识别-训练] This project is based on CNN/ResNet/DenseNet+GRU/LSTM+CTC/CrossEntropy to realize verification code identification. This project is only for training the model.
captcha-recognition ocr tensorflow tensorflow-tutorials
Last synced: 14 May 2025
https://github.com/argman/east
A tensorflow implementation of EAST text detector
deep-learning ocr tensorflow text-detection
Last synced: 14 May 2025
https://github.com/argman/EAST
A tensorflow implementation of EAST text detector
deep-learning ocr tensorflow text-detection
Last synced: 02 Apr 2025
https://github.com/anylisten/tools-ocr
树洞 OCR 文字识别(一款跨平台的 OCR 小工具)
cross-platform javafx mac ocr screenshot windows
Last synced: 15 May 2025
https://github.com/AnyListen/tools-ocr
树洞 OCR 文字识别(一款跨平台的 OCR 小工具)
cross-platform javafx mac ocr screenshot windows
Last synced: 15 May 2025
https://github.com/thiagoalessio/tesseract-ocr-for-php
A wrapper to work with Tesseract OCR inside PHP.
image-to-text ocr php tesseract text-recognition
Last synced: 14 May 2025
https://github.com/xiaofengShi/CHINESE-OCR
[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别
ctpn keras-crnn lstm-ctc ocr pytorch-crnn tensorflow
Last synced: 02 Apr 2025
https://github.com/RapidAI/RapidOCR
Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle. (将PaddleOCR模型做了转换,采用ONNXRuntime推理,速度很快)
chineseocr crnn dbnet easyocr ocr onnxocr onnxruntime openvino paddleocr rapidocr
Last synced: 20 Mar 2025
https://github.com/otiai10/gosseract
Go package for OCR (Optical Character Recognition), by using Tesseract C++ library
go ocr ocr-server tesseract tesseract-ocr
Last synced: 23 Apr 2025
https://github.com/deepdoctection/deepdoctection
A Repo For Document AI
document-ai document-image-analysis document-layout-analysis document-parser document-understanding layoutlm nlp ocr publaynet pubtabnet python pytorch table-detection table-recognition tensorflow
Last synced: 04 Jan 2026
https://github.com/ypwhs/captcha_break
验证码识别
captcha captcha-breaking crnn ctc-loss deep-learning jupyter-notebook keras keras-tutorials ocr pytorch-tutorial
Last synced: 15 May 2025
https://github.com/alisen39/trwebocr
开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~
docker ocr ocr-recognition python python3 web
Last synced: 14 Apr 2025
https://github.com/ciur/papermerge
Open Source Document Management System for Digital Archives (Scanned Documents)
archives django dms document-management ocr paperless pdf scan scanned-documents
Last synced: 13 May 2025
https://github.com/dicklesworthstone/llm_aided_ocr
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
ai-assist llama2 llm ocr ocr-correction tesseract
Last synced: 14 May 2025
https://github.com/alisen39/TrWebOCR
开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~
docker ocr ocr-recognition python python3 web
Last synced: 22 Jul 2025
https://github.com/InkTimeRecord/TTime
🚀 Screenshots, word marking, OCR, AI, translation software || 截图、划词、文字识别、AI、翻译软件
ai exe macos ocr pc screenshots software translation ttime windows
Last synced: 24 Mar 2025
https://github.com/catchthetornado/text-extract-api
Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
anonymization api extract json llm ocr ocr-python pdf pii
Last synced: 14 May 2025
https://github.com/Goldziher/kreuzberg
Document intelligence framework for Python - Extract text, metadata, and structured data from PDFs, images, Office documents, and more. Built on Pandoc, PDFium, and Tesseract.
async document-intelligence mcp metadata-extraction ocr pandoc pdf-extraction pdfium python rag table-extraction tesseract text-extraction
Last synced: 21 Oct 2025
https://github.com/openpaperwork/paperwork
Personal document manager (Linux/Windows) -- Moved to Gnome's Gitlab
dms document-management edms gtk gtk3 indexing ocr paperwork pdf personal-document-system python python3 sane scanner
Last synced: 18 Dec 2025
https://github.com/breezedeus/pix2text
An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
image-to-markdown latex latex-pdf layout-analysis math-formula math-formula-recognition math-ocr mathpix ocr python pytorch table-ocr
Last synced: 14 May 2025
https://github.com/codexu/note-gen
A cross-platform Markdown note-taking application dedicated to using AI to bridge recording and writing, organizing fragmented knowledge into a readable note.
ai app chatgpt deepseek gemini markdown nextjs notes ocr openai rust shadcn-ui tailwindcss tauri
Last synced: 14 May 2025
https://github.com/breezedeus/Pix2Text
An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
image-to-markdown latex latex-pdf layout-analysis math-formula math-formula-recognition math-ocr mathpix ocr python pytorch table-ocr
Last synced: 22 Apr 2025