Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-ai-papers
https://github.com/lyuwenyu/awesome-ai-papers
Last synced: 6 days ago
JSON representation
-
Github resource
-
Multimodal Large Language Model
- MM-LLMs: Recent Advances in MultiModal Large Language Models - llm, survey
- Visual Instruction Tuning towards General-Purpose Multimodal Model: A Survey - llm, survey
- Visual Instruction Tuning
- OneLLM: One Framework to Align All Modalities with Language
- Generative Multimodal Models are In-Context Learners
- Data-Efficient Instruction Tuning for Alignment
- A Survey on Multimodal Large Language Models - llm, survey
- Visual Instruction Tuning towards General-Purpose Multimodal Model: A Survey - llm, survey
- Improved Baselines with Visual Instruction Tuning - 1.5
- 4M: Massively Multimodal Masked Modeling
- OneLLM: One Framework to Align All Modalities with Language
- Pixel Aligned Language Models - llm, google
- Data-Efficient Instruction Tuning for Alignment
- CAPSFUSION: Rethinking Image-Text Data at Scale
- MM-LLMs: Recent Advances in MultiModal Large Language Models - llm, survey
- A Survey on Multimodal Large Language Models - llm, survey
- Improved Baselines with Visual Instruction Tuning - 1.5
- CAPSFUSION: Rethinking Image-Text Data at Scale
-
Computer Vision
Programming Languages
Sub Categories
Keywords
instruction-tuning
2
multi-modality
2
chain-of-thought
1
in-context-learning
1
instruction-following
1
large-language-models
1
large-vision-language-model
1
large-vision-language-models
1
multimodal-chain-of-thought
1
multimodal-in-context-learning
1
multimodal-instruction-tuning
1
multimodal-large-language-models
1
visual-chain-of-thought
1
visual-in-context-learning
1
visual-instruction-tuning
1
chatbot
1
chatgpt
1
foundation-models
1
gpt-4
1
llama
1
llama-2
1
llama2
1
llava
1
multimodal
1
vision-language-model
1
visual-language-learning
1