0 "multimodal-deep-learning" Awesome Lists
Awesome-Text-to-Image
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
awseome-list generative-adversarial-network image-generation image-manipulation image-synthesis multimodal multimodal-deep-learning survey text-to-face text-to-image
2,394 stars
205 forks
433 projects
Last updated: 16 Oct 2025
awesome-vision-language-pretraining-papers
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
bert multimodal-deep-learning pretraining vision-and-language vl-ptms
1,156 stars
104 forks
82 projects
Last updated: 03 Sep 2025
awesome-grounding
awesome grounding: A curated list of research papers in visual grounding
arxiv awesome-list captioning-images captioning-videos computer-vision embodied-agent grounding image-grounding language-grounding multimodal-deep-learning
1,118 stars
105 forks
157 projects
Last updated: 31 Oct 2025
awesome-multimodal-in-medical-imaging
A collection of resources on applications of multi-modal learning in medical imaging.
large-language-models large-multimodal-models medical-imaging medical-report-generation multimodal-deep-learning multimodal-large-language-models multimodal-learning visual-question-answering
823 stars
75 forks
298 projects
Last updated: 12 Sep 2025
Awesome-Multimodality
A Survey on multimodal learning research.
awesome-list multimodal-deep-learning multimodality
333 stars
22 forks
87 projects
Last updated: 13 Oct 2025
multimodal-ml-music
List of academic resources on Multimodal ML for Music
academic-publications awesome-list multimodal-data multimodal-deep-learning multimodal-learning music-ai music-information-retrieval music-research resources
296 stars
11 forks
59 projects
Last updated: 30 Sep 2025
awesome-Vision-and-Language-Pre-training
Recent Advances in Vision and Language Pre-training (VLP)
multimodal-deep-learning pretraining vision-and-language vision-and-language-pre-training vlp
294 stars
16 forks
107 projects
Last updated: 04 Sep 2025
awesome-emotion-recognition-in-conversations
A comprehensive reading list for Emotion Recognition in Conversations
conversational-ai dialogue-systems emotion-recognition emotion-recognition-in-conversation multimodal-deep-learning multimodal-interactions natural-language-processing
275 stars
45 forks
95 projects
Last updated: 17 Oct 2025
awesome-vision-language-models-for-earth-observation
A curated list of awesome vision and language resources for earth observation.
awesome awesome-list earth-observation multimodal-deep-learning remote-sensing vision-and-language
250 stars
18 forks
121 projects
Last updated: 20 Oct 2025
awesome-multimodal-time-series
A curated list of paper, code, data, and other resources focus on multimodal time series analysis.
awesome awesome-list awesome-resources multimodal-deep-learning multimodal-time-series multimodality time-series time-series-analysis time-series-forecasting
150 stars
10 forks
40 projects
Last updated: 24 Sep 2025
awesome-mmps
Corpus of resources for multimodal machine learning with physiological signals (mmps).
biosignals deep-learning machine-learning multimodal multimodal-data multimodal-deep-learning multimodal-learning physiological-signals signal-processing wearable
126 stars
5 forks
433 projects
Last updated: 24 Oct 2025
awesome-rvos
Referring Video Object Segmentation / Multi-Object Tracking Repo
image linguistic multi-modal multimodal-deep-learning refer-segmentation refer-vos refering-seg rvos segmentation text
89 stars
4 forks
44 projects
Last updated: 12 Nov 2025
Awesome-Multi-Modal-Dialog
[Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics
awesome-list dialogue dialogue-system multimodal multimodal-datasets multimodal-deep-learning multimodal-dialogue multimodal-learning paperlist
37 stars
4 forks
67 projects
Last updated: 29 Sep 2025
awesome-visual-dialog
Recent Advances in Visual Dialog
guesswhat multimodal-deep-learning multimodal-dialogue visual-dialog
30 stars
1 forks
33 projects
Last updated: 20 Sep 2024