0 "multimodal-deep-learning" Awesome Lists
Awesome-Text-to-Image
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
awseome-list generative-adversarial-network image-generation image-manipulation image-synthesis multimodal multimodal-deep-learning survey text-to-face text-to-image
2,435 stars
205 forks
436 projects
Last updated: 21 May 2026
awesome-vision-language-pretraining-papers
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
bert multimodal-deep-learning pretraining vision-and-language vl-ptms
1,157 stars
104 forks
82 projects
Last updated: 12 Apr 2026
awesome-grounding
awesome grounding: A curated list of research papers in visual grounding
arxiv awesome-list captioning-images captioning-videos computer-vision embodied-agent grounding image-grounding language-grounding multimodal-deep-learning
1,125 stars
104 forks
157 projects
Last updated: 30 Apr 2026
awesome-multimodal-in-medical-imaging
A collection of resources on applications of multi-modal learning in medical imaging.
large-language-models large-multimodal-models medical-imaging medical-report-generation multimodal-deep-learning multimodal-large-language-models multimodal-learning visual-question-answering
963 stars
81 forks
305 projects
Last updated: 07 Jun 2026
Awesome-Multimodality
A Survey on multimodal learning research.
awesome-list multimodal-deep-learning multimodality
333 stars
22 forks
87 projects
Last updated: 25 Apr 2026
multimodal-ml-music
List of academic resources on Multimodal ML for Music
academic-publications awesome-list multimodal-data multimodal-deep-learning multimodal-learning music-ai music-information-retrieval music-research resources
300 stars
12 forks
59 projects
Last updated: 13 May 2026
awesome-Vision-and-Language-Pre-training
Recent Advances in Vision and Language Pre-training (VLP)
multimodal-deep-learning pretraining vision-and-language vision-and-language-pre-training vlp
297 stars
14 forks
107 projects
Last updated: 01 Apr 2026
awesome-emotion-recognition-in-conversations
A comprehensive reading list for Emotion Recognition in Conversations
conversational-ai dialogue-systems emotion-recognition emotion-recognition-in-conversation multimodal-deep-learning multimodal-interactions natural-language-processing
279 stars
44 forks
95 projects
Last updated: 05 Jun 2026
awesome-vision-language-models-for-earth-observation
A curated list of awesome vision and language resources for earth observation.
awesome awesome-list earth-observation multimodal-deep-learning remote-sensing vision-and-language
254 stars
21 forks
121 projects
Last updated: 24 May 2026
awesome-multimodal-time-series
A curated list of paper, code, data, and other resources focus on multimodal time series analysis.
awesome awesome-list awesome-resources multimodal-deep-learning multimodal-time-series multimodality time-series time-series-analysis time-series-forecasting
223 stars
15 forks
40 projects
Last updated: 22 May 2026
awesome-mmps
Corpus of resources for multimodal machine learning with physiological signals (mmps).
biosignals deep-learning machine-learning multimodal multimodal-data multimodal-deep-learning multimodal-learning physiological-signals signal-processing wearable
161 stars
8 forks
445 projects
Last updated: 03 Jun 2026
awesome-rvos
Referring Video Object Segmentation / Multi-Object Tracking Repo
image linguistic multi-modal multimodal-deep-learning refer-segmentation refer-vos refering-seg rvos segmentation text
91 stars
4 forks
44 projects
Last updated: 28 May 2026
Awesome-Multi-Modal-Dialog
[Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics
awesome-list dialogue dialogue-system multimodal multimodal-datasets multimodal-deep-learning multimodal-dialogue multimodal-learning paperlist
37 stars
4 forks
67 projects
Last updated: 29 Sep 2025
awesome-visual-dialog
Recent Advances in Visual Dialog
guesswhat multimodal-deep-learning multimodal-dialogue visual-dialog
28 stars
1 forks
33 projects
Last updated: 28 Apr 2026
awesome-open-papernotes
Yet another Ph.D. adventure.
multimodal multimodal-association multimodal-correlation multimodal-deep-learning multimodal-representation mutil-modal paper paper-arxiv sensor-fusion
22 stars
4 forks
503 projects
Last updated: 22 Jan 2026