An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with layout-analysis

A curated list of projects in awesome lists tagged with layout-analysis .

https://github.com/opendatalab/mineru

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

ai4science document-analysis extract-data layout-analysis ocr parser pdf pdf-converter pdf-extractor-llm pdf-extractor-pretrain pdf-extractor-rag pdf-parser python

Last synced: 06 Jan 2026

https://github.com/opendatalab/MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

ai4science document-analysis extract-data layout-analysis ocr parser pdf pdf-converter pdf-extractor-llm pdf-extractor-pretrain pdf-extractor-rag pdf-parser python

Last synced: 24 Mar 2025

https://github.com/breezedeus/pix2text

An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.

image-to-markdown latex latex-pdf layout-analysis math-formula math-formula-recognition math-ocr mathpix ocr python pytorch table-ocr

Last synced: 07 Feb 2026

https://github.com/breezedeus/Pix2Text

An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.

image-to-markdown latex latex-pdf layout-analysis math-formula math-formula-recognition math-ocr mathpix ocr python pytorch table-ocr

Last synced: 22 Apr 2025

https://github.com/kotaro-kinoshita/yomitoku

YomiTokuはAIを活用した日本語文書解析エンジンを提供するPythonパッケージです。 Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.

deep-learning layout-analysis ocr python pytorch

Last synced: 14 Jan 2026

https://github.com/rapidai/rapiddoc

📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。

layout-analysis layout-recover

Last synced: 24 May 2026

https://github.com/rapidai/rapidlayout

Analysis of Chinese and English layouts 中英文版面分析

cdla doclayout-yolo layout layout-analysis pp-structure

Last synced: 02 Apr 2026

https://github.com/ppaanngggg/yolo-doclaynet

YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis

doclaynet document-analysis layout-analysis ultralytics yolo yolov8

Last synced: 28 Jan 2026

https://github.com/bobld/pdfpigmlnetblockclassifier

Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.

classifier csharp document-layout document-layout-analysis layout-analysis lightgbm machine-learning ml-net pdf pdf-document pdf-document-processor pdfpig publaynet

Last synced: 14 Apr 2025

https://github.com/hadro/directory-pipeline

A pipeline for turning digital collections into structured data -- an LLM assisted, IIIF-native tool to jump into working with sources like digitized print directories.

annotations city-directories cultural-heritage digitization directories iiif layout-analysis ner ocr surya-ocr

Last synced: 20 Jun 2026

https://github.com/os-climate/crrf-det

A web application for PDF content and table extraction, featuring image-based visual layout analysis, indexed document search, batch processing and extraction result annotation.

annotation data-extraction layout-analysis pdf table-extraction

Last synced: 12 Apr 2025

https://github.com/rushi-balapure/pdf_2_json_extractor

A high-performance Python library for extracting structured content from PDF documents with layout-aware text extraction. pdf_to_json preserves document structure including headings (H1-H6) and body text, outputting clean JSON format.

cli-tool cpu-only cross-platform data-extraction document-parsing document-processing json layout-analysis nlp offline pdf pdf-extraction pdf-parser pdf-processing pdf-to-json python python-library structure-extraction text-extraction

Last synced: 21 Apr 2026

https://github.com/ixalodecte/filestruct

A python package to structure files using visual and style informations

layout-analysis parser pdf

Last synced: 14 Jan 2026

https://github.com/u9401066/asset-aware-mcp

Asset-Aware MCP Server — AI Agent precisely accesses tables, figures, sections from PDFs + .docx round-trip editing (DFM) with 46 tools / 13 resources, segmentation export, layout overlay, OCR preprocessing, knowledge graph (LightRAG)

ai document-processing docx etl fastmcp knowledge-graph layout-analysis lightrag llm mcp mcp-server medical ocr pdf python rag segmentation

Last synced: 13 May 2026

https://github.com/colintr/livedesktoptranslator

Live capture your screen and replace textual elements with their translations

electron layout-analysis ocr python translation

Last synced: 30 Apr 2026