An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with document-layout-analysis

A curated list of projects in awesome lists tagged with document-layout-analysis .

https://github.com/phamquiluan/publaynet

ICDAR 2019: MaskRCNN on PubLayNet datasets. Paragraph detection, table detection, figure detection,...

document-layout-analysis figure-detection mask-rcnn object-detection paragraph-detection pretrained-models publaynet pytorch table-detection

Last synced: 17 Aug 2025

https://github.com/phamquiluan/PubLayNet

ICDAR 2019: MaskRCNN on PubLayNet datasets. Paragraph detection, table detection, figure detection,...

document-layout-analysis figure-detection mask-rcnn object-detection paragraph-detection pretrained-models publaynet pytorch table-detection

Last synced: 20 Jul 2025

https://github.com/bobld/pdfpigmlnetblockclassifier

Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.

classifier csharp document-layout document-layout-analysis layout-analysis lightgbm machine-learning ml-net pdf pdf-document pdf-document-processor pdfpig publaynet

Last synced: 14 Apr 2025

https://github.com/bobld/simple-docstrum

A step-by-step C# implementation of the Docstrum algorithm

csharp docstrum document-layout-analysis dotnet pdf pdfpig

Last synced: 23 Jun 2025

https://github.com/hpanwar08/document-layout-analysis-app

Simple docker deployment of document layout analysis using detectron2

docker document-layout-analysis python reactjs

Last synced: 10 May 2025

https://github.com/bobld/publaynet-maskrcnn-mlnet

Using a MaskRCNN model trained on the PublayNet dataset with ML.Net in C# / .Net for Document layout analysis and page segmmentation task.

csharp document-layout-analysis dotnet figure-detection mask-detection mask-rcnn mlnet ocr onnx page-segmentation paragraph-detection pretrained-models publaynet table-detection

Last synced: 31 Jul 2025

https://github.com/bobld/pdfpigsvmregionclassifier

Proof of concept of a simple SVM Region Classifier using PdfPig and Accord.Net. The objective is to classify each text block in a pdf document page as either title, text, list, table and image.

accord-net csharp document-layout-analysis machine-learning pdf pdf-document pdfpig publaynet support-vector-machine svm svm-classifier svm-training

Last synced: 14 Apr 2025

https://github.com/huythai855/quizvista

Hệ thống sinh bài thi trắc nghiệm sử dụng trí tuệ nhân tạo - QuizVista

document-layout-analysis function-calling quiz-generator retrieval-augmented-generation

Last synced: 27 Aug 2025

https://github.com/mansurpro/docuparse

DocuParse is a high-performance tool for converting PDF documents into clean, structured Markdown files. Designed for speed and accuracy, it extracts and formats content while minimizing errors like hallucinations and repetitions.

digital-archive document-layout-analysis google-colab huggingface-transformers markdown-conversion pdf-parsing pdf-to-markdown tesseract-ocr text-extraction

Last synced: 19 Jan 2026