https://github.com/X-PLUG/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
https://github.com/X-PLUG/mPLUG-DocOwl
chart-understanding document-understanding mllm multimodal multimodal-large-language-models table-understanding
Last synced: about 1 year ago
JSON representation
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
- Host: GitHub
- URL: https://github.com/X-PLUG/mPLUG-DocOwl
- Owner: X-PLUG
- License: apache-2.0
- Created: 2023-07-04T01:18:19.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2024-09-28T15:18:33.000Z (over 1 year ago)
- Last Synced: 2024-11-04T21:11:32.609Z (over 1 year ago)
- Topics: chart-understanding, document-understanding, mllm, multimodal, multimodal-large-language-models, table-understanding
- Language: Python
- Homepage:
- Size: 105 MB
- Stars: 1,511
- Watchers: 30
- Forks: 99
- Open Issues: 57
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-ai-for-science - mPLUG-PaperOwl - Multimodal LLM for scientific charts and diagrams understanding/generation (📄 Paper→Poster / Slides / Graphical Abstract / Poster Generation)
- awesome-llm-projects - mPLUG-DocOwl