Projects in Awesome Lists tagged with document-understanding

https://github.com/infiniflow/ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

agent agents ai-search chatbot chatgpt data-pipelines deep-learning document-parser document-understanding genai graph graphrag llm nlp pdf-to-text preprocessing rag retrieval-augmented-generation table-structure-recognition text2sql

Last synced: 16 Dec 2024

https://github.com/deepdoctection/deepdoctection

A Repo For Document AI

document-ai document-image-analysis document-layout-analysis document-parser document-understanding layoutlm nlp ocr publaynet pubtabnet python pytorch table-detection table-recognition tensorflow

Last synced: 17 Dec 2024

https://github.com/x-plug/mplug-docowl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

chart-understanding document-understanding mllm multimodal multimodal-large-language-models table-understanding

Last synced: 19 Dec 2024

https://github.com/X-PLUG/mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

chart-understanding document-understanding mllm multimodal multimodal-large-language-models table-understanding

Last synced: 17 Nov 2024

https://github.com/AlibabaResearch/AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

artificial-intelligence computer-vision document document-analysis document-intelligence document-recognition document-understanding documentai end-to-end-ocr multimodal multimodal-deep-learning ocr scene-text-detection scene-text-detection-recognition scene-text-recognition text-detection text-recognition vision-language vision-language-model vision-language-transformer

Last synced: 07 Nov 2024

https://github.com/alibabaresearch/advancedliteratemachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

artificial-intelligence computer-vision document document-analysis document-intelligence document-recognition document-understanding documentai end-to-end-ocr multimodal multimodal-deep-learning ocr scene-text-detection scene-text-detection-recognition scene-text-recognition text-detection text-recognition vision-language vision-language-model vision-language-transformer

Last synced: 31 Oct 2024

https://github.com/wenwenyu/PICK-pytorch

Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)

document-analysis document-understanding graph-convolutional-network graph-learning graph-neural-networks key-information-extraction

Last synced: 11 Nov 2024

https://github.com/googlecloudplatform/document-ai-samples

Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud

document-understanding machine-learning ocr pdf python samples

Last synced: 15 Dec 2024

https://github.com/scut-dlvclab/document-ai-recommendations

Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

document-ai document-understanding key-information-extraction table-structure-recognition visual-information-extraction

Last synced: 19 Nov 2024

https://github.com/doc-analysis/readingbank

ReadingBank: A Benchmark Dataset for Reading Order Detection

document-ai document-intelligence document-understanding natural-language-processing nlp ocr

Last synced: 01 Dec 2024

https://nextplusplus.github.io/TAT-DQA/

TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning

document-understanding question-answering vqa

Last synced: 11 Oct 2024

https://github.com/scut-dlvclab/rfund

Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction"

document-ai document-understanding key-information-extraction ocr visual-information-extraction

Last synced: 19 Nov 2024