Projects in Awesome Lists tagged with layout-analysis
A curated list of projects in awesome lists tagged with layout-analysis .
https://github.com/opendatalab/mineru
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
ai4science document-analysis extract-data layout-analysis ocr parser pdf pdf-converter pdf-extractor-llm pdf-extractor-pretrain pdf-extractor-rag pdf-parser python
Last synced: 06 Jan 2026
https://github.com/opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
ai4science document-analysis extract-data layout-analysis ocr parser pdf pdf-converter pdf-extractor-llm pdf-extractor-pretrain pdf-extractor-rag pdf-parser python
Last synced: 24 Mar 2025
https://github.com/layout-parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
computer-vision deep-learning detectron2 document-image-processing document-layout-analysis layout-analysis layout-detection layout-parser object-detection ocr
Last synced: 13 May 2025
https://github.com/Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
computer-vision deep-learning detectron2 document-image-processing document-layout-analysis layout-analysis layout-detection layout-parser object-detection ocr
Last synced: 15 Mar 2025
https://github.com/breezedeus/pix2text
An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
image-to-markdown latex latex-pdf layout-analysis math-formula math-formula-recognition math-ocr mathpix ocr python pytorch table-ocr
Last synced: 07 Feb 2026
https://github.com/breezedeus/Pix2Text
An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
image-to-markdown latex latex-pdf layout-analysis math-formula math-formula-recognition math-ocr mathpix ocr python pytorch table-ocr
Last synced: 22 Apr 2025
https://github.com/uglytoad/pdfpig
Read and extract text and other content from PDFs in C# (port of PDFBox)
alto-xml csharp document-analysis hocr layout-analysis netstandard page-xml pdf pdf-document pdf-document-processor pdf-extractor pdf-files pdf-generation pdfbox
Last synced: 10 May 2025
https://github.com/UglyToad/PdfPig
Read and extract text and other content from PDFs in C# (port of PDFBox)
alto-xml csharp document-analysis hocr layout-analysis netstandard page-xml pdf pdf-document pdf-document-processor pdf-extractor pdf-files pdf-generation pdfbox
Last synced: 24 Mar 2025
https://github.com/kotaro-kinoshita/yomitoku
YomiTokuはAIを活用した日本語文書解析エンジンを提供するPythonパッケージです。 Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.
deep-learning layout-analysis ocr python pytorch
Last synced: 14 Jan 2026
https://github.com/mittagessen/kraken
OCR engine for all the languages
alto-xml handwritten-text-recognition hocr htr layout-analysis neural-networks ocr optical-character-recognition page-xml
Last synced: 14 Mar 2025
https://github.com/BobLd/DocumentLayoutAnalysis
Document Layout Analysis resources repos for development with PdfPig.
alto alto-xml csharp docstrum document-layout-analysis hocr hocr-documents layout-analysis page-segmentation page-xml pdf pdfpig recursive-xy-cut table-extraction tei xy-cut xycut
Last synced: 10 May 2025
https://github.com/bobld/documentlayoutanalysis
Document Layout Analysis resources repos for development with PdfPig.
alto alto-xml csharp docstrum document-layout-analysis hocr hocr-documents layout-analysis page-segmentation page-xml pdf pdfpig recursive-xy-cut table-extraction tei xy-cut xycut
Last synced: 04 Apr 2025
https://github.com/mindspore-lab/mindocr
A toolbox of ocr models and algorithms based on MindSpore
crnn dbnet deep-learning key-information-extraction layout-analysis layoutxlm mindspore ocr ocr-large-model table-recognition tablemaster text-detection text-recognition vary-toy
Last synced: 11 Mar 2026
https://github.com/rapidai/rapiddoc
📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。
layout-analysis layout-recover
Last synced: 24 May 2026
https://github.com/rapidai/rapidlayout
Analysis of Chinese and English layouts 中英文版面分析
cdla doclayout-yolo layout layout-analysis pp-structure
Last synced: 02 Apr 2026
https://github.com/ppaanngggg/yolo-doclaynet
YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis
doclaynet document-analysis layout-analysis ultralytics yolo yolov8
Last synced: 28 Jan 2026
https://github.com/jpleorx/detectron2-publaynet
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
artificial-intelligence computer-vision deep-learning detectron2 document-analysis document-classification document-layout document-layout-analysis faster-rcnn instance-segmentation layout-analysis machine-learning neural-network neural-networks object-detection publaynet python python3 pytorch
Last synced: 10 May 2025
https://github.com/bobld/pdfpigmlnetblockclassifier
Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
classifier csharp document-layout document-layout-analysis layout-analysis lightgbm machine-learning ml-net pdf pdf-document pdf-document-processor pdfpig publaynet
Last synced: 14 Apr 2025
https://github.com/hadro/directory-pipeline
A pipeline for turning digital collections into structured data -- an LLM assisted, IIIF-native tool to jump into working with sources like digitized print directories.
annotations city-directories cultural-heritage digitization directories iiif layout-analysis ner ocr surya-ocr
Last synced: 20 Jun 2026
https://github.com/aidayang/mineru-oneclick
MinerU免安装部署一键启动整合包
ai4science document-analysis extract-data layout-analysis markdown mineru ocr parser pdf pdf-converter pdf-extractor-llm pdf-extractor-pretrain pdf-extractor-rag pdf-parser pdftojson pdftomarkdown python
Last synced: 12 Jul 2025
https://github.com/rithulkamesh/docproc
Document Intelligence Platform — Extract, refine, and query documents with vision LLMs and config-driven RAG.
content-extraction data-extraction document-analysis document-parsing equation-detection layout-analysis machine-learning mathematical-symbols ocr pdf-processing pdf-text-extraction python region-detection text-classification text-extraction
Last synced: 02 Apr 2026
https://github.com/os-climate/crrf-det
A web application for PDF content and table extraction, featuring image-based visual layout analysis, indexed document search, batch processing and extraction result annotation.
annotation data-extraction layout-analysis pdf table-extraction
Last synced: 12 Apr 2025
https://github.com/rushi-balapure/pdf_2_json_extractor
A high-performance Python library for extracting structured content from PDF documents with layout-aware text extraction. pdf_to_json preserves document structure including headings (H1-H6) and body text, outputting clean JSON format.
cli-tool cpu-only cross-platform data-extraction document-parsing document-processing json layout-analysis nlp offline pdf pdf-extraction pdf-parser pdf-processing pdf-to-json python python-library structure-extraction text-extraction
Last synced: 21 Apr 2026
https://github.com/ixalodecte/filestruct
A python package to structure files using visual and style informations
Last synced: 14 Jan 2026
https://github.com/yuvaraj3855/preocr
Fast document classification and OCR detection. Analyzes any file type to determine if OCR is needed, saving time and money on unnecessary processing.
computer-vision document-analysis document-classification document-intelligence document-processing document-understanding file-analysis image-processing layout-analysis ocr ocr-detection opencv pdf pdf-analysis pdf-parsing preprocessing python python-library text-detection text-extraction
Last synced: 16 Feb 2026
https://github.com/u9401066/asset-aware-mcp
Asset-Aware MCP Server — AI Agent precisely accesses tables, figures, sections from PDFs + .docx round-trip editing (DFM) with 46 tools / 13 resources, segmentation export, layout overlay, OCR preprocessing, knowledge graph (LightRAG)
ai document-processing docx etl fastmcp knowledge-graph layout-analysis lightrag llm mcp mcp-server medical ocr pdf python rag segmentation
Last synced: 13 May 2026
https://github.com/colintr/livedesktoptranslator
Live capture your screen and replace textual elements with their translations
electron layout-analysis ocr python translation
Last synced: 30 Apr 2026