Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with document-understanding
A curated list of projects in awesome lists tagged with document-understanding .
https://github.com/infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
agent agents ai-search chatbot chatgpt data-pipelines deep-learning document-parser document-understanding genai graph graphrag llm nlp pdf-to-text preprocessing rag retrieval-augmented-generation table-structure-recognition text2sql
Last synced: 16 Dec 2024
https://github.com/deepdoctection/deepdoctection
A Repo For Document AI
document-ai document-image-analysis document-layout-analysis document-parser document-understanding layoutlm nlp ocr publaynet pubtabnet python pytorch table-detection table-recognition tensorflow
Last synced: 17 Dec 2024
https://github.com/x-plug/mplug-docowl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
chart-understanding document-understanding mllm multimodal multimodal-large-language-models table-understanding
Last synced: 19 Dec 2024
https://github.com/X-PLUG/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
chart-understanding document-understanding mllm multimodal multimodal-large-language-models table-understanding
Last synced: 17 Nov 2024
https://github.com/AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
artificial-intelligence computer-vision document document-analysis document-intelligence document-recognition document-understanding documentai end-to-end-ocr multimodal multimodal-deep-learning ocr scene-text-detection scene-text-detection-recognition scene-text-recognition text-detection text-recognition vision-language vision-language-model vision-language-transformer
Last synced: 07 Nov 2024
https://github.com/alibabaresearch/advancedliteratemachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
artificial-intelligence computer-vision document document-analysis document-intelligence document-recognition document-understanding documentai end-to-end-ocr multimodal multimodal-deep-learning ocr scene-text-detection scene-text-detection-recognition scene-text-recognition text-detection text-recognition vision-language vision-language-model vision-language-transformer
Last synced: 31 Oct 2024
https://github.com/wenwenyu/PICK-pytorch
Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)
document-analysis document-understanding graph-convolutional-network graph-learning graph-neural-networks key-information-extraction
Last synced: 11 Nov 2024
https://github.com/googlecloudplatform/document-ai-samples
Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud
document-understanding machine-learning ocr pdf python samples
Last synced: 15 Dec 2024
https://github.com/scut-dlvclab/document-ai-recommendations
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
document-ai document-understanding key-information-extraction table-structure-recognition visual-information-extraction
Last synced: 19 Nov 2024
https://github.com/doc-analysis/readingbank
ReadingBank: A Benchmark Dataset for Reading Order Detection
document-ai document-intelligence document-understanding natural-language-processing nlp ocr
Last synced: 01 Dec 2024
https://nextplusplus.github.io/TAT-DQA/
TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning
document-understanding question-answering vqa
Last synced: 11 Oct 2024
https://github.com/scut-dlvclab/rfund
Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction"
document-ai document-understanding key-information-extraction ocr visual-information-extraction
Last synced: 19 Nov 2024
https://github.com/zeninglin/peneo
[MM'2024] Official implementation of "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction."
document-ai document-understanding key-information-extraction ocr visual-information-extraction
Last synced: 30 Oct 2024
https://github.com/extrievetechnologies/quickcapture_ios
QuickCapture Mobile Scanning SDK Specially designed for native IOS
document-classification document-scanner-app document-scanning-sdk document-understanding ios objective-c swift
Last synced: 14 Dec 2024
https://github.com/extrievetechnologies/quickcapture_android
QuickCapture Mobile Scanning SDK Specially designed for native ANDROID from Extrieve
android document-scanner document-scanner-app document-scanning-sdk document-understanding java kotllin
Last synced: 14 Dec 2024
https://github.com/mycielski/textract_study
Analysing expense reports/invoices with AWS Textract and boto3.
aws aws-cli boto3 document-understanding expenses invoices script shell textract
Last synced: 09 Nov 2024