"multimodal-deep-learning" Awesome Lists
Awesome-Text-to-Image
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
awseome-list generative-adversarial-network image-generation image-manipulation image-synthesis multimodal multimodal-deep-learning survey text-to-face text-to-image
2,325 stars
200 forks
429 projects
Last updated: 23 Apr 2025
awesome-vision-language-pretraining-papers
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
bert multimodal-deep-learning pretraining vision-and-language vl-ptms
1,152 stars
104 forks
82 projects
Last updated: 02 Apr 2025
awesome-grounding
awesome grounding: A curated list of research papers in visual grounding
arxiv awesome-list captioning-images captioning-videos computer-vision embodied-agent grounding image-grounding language-grounding multimodal-deep-learning
1,070 stars
100 forks
154 projects
Last updated: 28 Apr 2025
awesome-multimodal-in-medical-imaging
A collection of resources on applications of multi-modal learning in medical imaging.
large-language-models large-multimodal-models medical-imaging medical-report-generation multimodal-deep-learning multimodal-large-language-models multimodal-learning visual-question-answering
730 stars
65 forks
296 projects
Last updated: 04 May 2025
Awesome-Multimodality
A Survey on multimodal learning research.
awesome-list multimodal-deep-learning multimodality
324 stars
22 forks
87 projects
Last updated: 03 Apr 2025
multimodal-ml-music
List of academic resources on Multimodal ML for Music
academic-publications awesome-list multimodal-data multimodal-deep-learning multimodal-learning music-ai music-information-retrieval music-research resources
295 stars
11 forks
59 projects
Last updated: 22 Apr 2025
awesome-Vision-and-Language-Pre-training
Recent Advances in Vision and Language Pre-training (VLP)
multimodal-deep-learning pretraining vision-and-language vision-and-language-pre-training vlp
293 stars
16 forks
107 projects
Last updated: 25 May 2025
awesome-emotion-recognition-in-conversations
A comprehensive reading list for Emotion Recognition in Conversations
conversational-ai dialogue-systems emotion-recognition emotion-recognition-in-conversation multimodal-deep-learning multimodal-interactions natural-language-processing
269 stars
45 forks
95 projects
Last updated: 21 Apr 2025
awesome-vision-language-models-for-earth-observation
A curated list of awesome vision and language resources for earth observation.
awesome awesome-list earth-observation multimodal-deep-learning remote-sensing vision-and-language
219 stars
17 forks
121 projects
Last updated: 11 Apr 2025
awesome-mmps
Corpus of resources for multimodal machine learning with physiological signals (mmps).
biosignals deep-learning machine-learning multimodal multimodal-data multimodal-deep-learning multimodal-learning physiological-signals signal-processing wearable
88 stars
2 forks
405 projects
Last updated: 16 Jun 2025
awesome-rvos
Referring Video Object Segmentation / Multi-Object Tracking Repo
image linguistic multi-modal multimodal-deep-learning refer-segmentation refer-vos refering-seg rvos segmentation text
87 stars
4 forks
44 projects
Last updated: 26 Sep 2024
awesome-multimodal-time-series
A curated list of paper, code, data, and other resources focus on multimodal time series analysis.
awesome awesome-list awesome-resources multimodal-deep-learning multimodal-time-series multimodality time-series time-series-analysis time-series-forecasting
71 stars
4 forks
36 projects
Last updated: 06 May 2025
Awesome-Multi-Modal-Dialog
[Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics
awesome-list dialogue dialogue-system multimodal multimodal-datasets multimodal-deep-learning multimodal-dialogue multimodal-learning paperlist
39 stars
4 forks
67 projects
Last updated: 07 Feb 2025
awesome-visual-dialog
Recent Advances in Visual Dialog
guesswhat multimodal-deep-learning multimodal-dialogue visual-dialog
30 stars
1 forks
33 projects
Last updated: 20 Sep 2024