"multimodal" Awesome Lists
Awesome-AIGC-Tutorials
Curated tutorials and resources for Large Language Models, AI Painting, and more.
ai aigc awesome chatgpt courses-resource deep-learning llm midjourney multimodal nlp
4,298 stars
290 forks
140 projects
Last updated: 05 Sep 2025
Awesome-LLM-Reasoning
From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓
awesome chain-of-thought chatgpt cot deepseek deepseek-r1 gpt gpt-4o in-context-learning language-models
3,388 stars
200 forks
161 projects
Last updated: 20 Oct 2025
Awesome-Text-to-Image
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
awseome-list generative-adversarial-network image-generation image-manipulation image-synthesis multimodal multimodal-deep-learning survey text-to-face text-to-image
2,385 stars
203 forks
433 projects
Last updated: 16 Sep 2025
Awesome-Multimodal-Research
A curated list of Multimodal Related Research.
awesome multimodal multimodal-learning multimodal-research
1,367 stars
150 forks
38 projects
Last updated: 18 Sep 2025
awesome-japanese-llm
日本語LLMまとめ - Overview of Japanese LLMs
foundation-models generative-ai generative-model generative-models japanese japanese-language japanese-language-model japanese-llm language-model language-models
1,232 stars
38 forks
354 projects
Last updated: 02 Oct 2025
awesome-vlm-architectures
Famous Vision Language Models and Their Architectures
awesome awesome-list blip clip cogvlm image-encoder internlm kosmos llava multimodal
966 stars
46 forks
146 projects
Last updated: 06 Aug 2025
Awesome-MCoT
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
chain-of-thought cot deepseek-r1 instruction-tuning large-vision-language-model mcts mllm-reasoning multimodal multimodal-chain-of-thought multimodal-large-language-models
833 stars
25 forks
70 projects
Last updated: 26 Sep 2025
Awesome-Reasoning-Foundation-Models
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models
foundation-models llm llm-reasoning multimodal reasoning reasoning-agent reasoning-language-models
625 stars
56 forks
991 projects
Last updated: 13 Oct 2025
Awesome-Multimodal-LLM
Research Trends in LLM-guided Multimodal Learning.
in-context-learning instruction-tuning large-language-models llm multimodal multimodal-large-language-models multimodal-learning parameter-efficient-learning parameter-efficient-tuning
355 stars
16 forks
38 projects
Last updated: 08 Sep 2025
Awesome-Multimodal-LLM-Autonomous-Driving
[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving
autonomous-car autonomous-driving autonomous-vehicles foundation-models large-language-models multimodal multimodal-large-language-models self-driving vision-language-model vision-transformer
293 stars
14 forks
53 projects
Last updated: 19 Aug 2025
World-Simulator
Simulating the Real World: Survey & Resources, which contains our survey "Simulating the Real World: A Unified Survey of Multimodal Generative Models" and Awesome-Text2X-Resources. Watch this repository for the latest updates! 🔥
3d-generation 4d-generation aigc awesome-list generative-models image-generation multimodal survey text2x video-generation
291 stars
15 forks
344 projects
Last updated: 12 Sep 2025
Awesome-Biomolecule-Language-Cross-Modeling
Awesome-Biomolecule-Language-Cross-Modeling: a curated list of resources for paper "Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey"
bioinformatics biomolecule multimodal natural-language-processing
226 stars
14 forks
478 projects
Last updated: 28 Sep 2025
GPA-LM
This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges".
agent-framework agents ai awesome-list gameai gameplay games gcc general-computer-control generative-ai
153 stars
7 forks
126 projects
Last updated: 22 Aug 2025
awesome-cvpr-2024
🤩 An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024
awesome awesome-list awesome-lists awesome-readme computervision cvpr cvpr2024 diffusion multimodal
143 stars
9 forks
65 projects
Last updated: 27 Sep 2025
awesome-salient-object-detection
A curated list of awesome resources for salient object detection (SOD), focusing more on multi-modal SOD, such as RGB-D SOD.
multimodal object-segmentation rgbd-sod salient-object-detection sod
130 stars
8 forks
117 projects
Last updated: 09 Sep 2025
awesome-mmps
Corpus of resources for multimodal machine learning with physiological signals (mmps).
biosignals deep-learning machine-learning multimodal multimodal-data multimodal-deep-learning multimodal-learning physiological-signals signal-processing wearable
115 stars
5 forks
433 projects
Last updated: 15 Sep 2025
awesome-LLMs-finetuning
Collection of resources for finetuning Large Language Models (LLMs).
llm llm-fine-tuning llm-finetuning mmllm multimodal
101 stars
10 forks
78 projects
Last updated: 23 Sep 2025
awesome-code-benchmark
A comprehensive code domain benchmark review of LLM researches.
awesome benchmarks bug-fixing code-completion code-efficiency code-generation codellm codellms data-science llm-benchmarking
79 stars
7 forks
313 projects
Last updated: 25 Aug 2025
Awesome-Multimodal-Chatbot
Awesome Multimodal Assistant is a curated list of multimodal chatbots/conversational assistants that utilize various modes of interaction, such as text, speech, images, and videos, to provide a seamless and versatile user experience.
awesome chat-application chatbot general-ai instruction-following instruction-tuning multimodal multimodal-assistant multimodal-dialogue papers
78 stars
7 forks
61 projects
Last updated: 27 Jun 2025
awesome-ai-papers
This repository is used to collect papers and code in the field of AI.
ai ai-agents algorithms artificial-intelligence awesome-lists computer-vision cv deep-learning diffusion-models gnn
75 stars
6 forks
1,443 projects
Last updated: 15 Oct 2025
Awesome-Multi-Modal-Dialog
[Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics
awesome-list dialogue dialogue-system multimodal multimodal-datasets multimodal-deep-learning multimodal-dialogue multimodal-learning paperlist
37 stars
4 forks
67 projects
Last updated: 29 Sep 2025
awesome-data-quality
A comprehensive collection of data quality resources, tools, papers, and projects across various data types including traditional data, LLM pretraining/fine-tuning data, multimodal data, and more. Essential reference for researchers and practitioners in data-centric AI.
awesome awesome-list data-centric-ai data-cleaning data-curation data-preprocessing data-quality data-validation graph-data llm
15 stars
3 forks
86 projects
Last updated: 29 Aug 2025
awesome-clip-papers
The most impactful papers related to contrastive pretraining for multimodal models!
awesome awesome-list awesome-readme clip clip-model contrastive-learning multimodal pretraining
10 stars
0 forks
35 projects
Last updated: 29 Mar 2024
awesome-multimodal-convai
Paper reading list for Multimodal Conversational AI
computer-vision conversational-ai deep-learning dialogue-systems language machine-learning multimodal natural-language-processing nlp papers
4 stars
0 forks
78 projects
Last updated: 18 Feb 2025
awesome-turkish-vlm
A curated list of models, datasets and other useful resources for Turkish Vision-Language Models (VLM).
awesome awesome-list computer-vision datasets deep-learning fine-tuning multimodal nlp pretrained-models turkish
3 stars
0 forks
37 projects
Last updated: 29 Jul 2025
awesome-deepseek
A curated list of DeepSeek resources, including cutting-edge AI models, developer tools, research papers, and community projects.
ai awesom awesome-list deepseek llm multimodal
3 stars
0 forks
42 projects
Last updated: 25 Aug 2025
awesome-grok
A curated list of resources for Grok, the AI developed by xAI
artificial-intelligence awesome-list developer-tools grok grok-api grok-chatbot grok-cli grok-sdk grok4 language-model
0 stars
0 forks
31 projects
Last updated: 22 Jul 2025