Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-self-supervised-multimodal-learning
A curated list of self-supervised multimodal learning resources.
https://github.com/ys-zong/awesome-self-supervised-multimodal-learning
Last synced: 3 days ago
JSON representation
-
Applications
-
Healthcare
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper - group/CoMIR)
- [paper - pytorch)
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
-
State Representation Learning
-
Remote Sensing
-
Machine Translation
-
Auto-driving
-
Robotics
-
-
Summary of Common Multimodal Datasets
-
Image-Text Datasets
- Link - xirong/flickr8kcn)|
- Github
- Link - vcr)|
- Details - ml/SNLI-VE)|[Github](https://github.com/necla-ml/SNLI-VE)|
- Link
- Link
- Link - caption)|
- Link - |
- Link
- Link
- Link - |
- Link
- Link - research-datasets/conceptual-captions)|
- Link
- Link - |
- Link - |
- Link - |
- Link - Med-2019)|
- Link
- Link - lab/nlvr)|
- Link - |
- Link
- Link
-
Image-Text-Audio Datasets
-
Video-Text Datasets
-
Video-Audio Datasets
-
Point Cloud Datasets
-
Image-Ridar Datasets
-
-
Objectives
-
Masked Prediction
-
Instance Discrimination
- [paper
- [paper - research/tree/master/mmv)
- [paper - NCE_HowTo100M)
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper
- [paper - research/big_vision)
- [paper
- [paper
- [paper
- [paper - science/crossmodal-contrastive-learning)
- [paper
- [paper
- [paper - research/google-research/tree/master/vatt)
- [paper
- [paper - morgado/AVSpatialAlignment)
- [paper
- [paper
- [paper - Part-of-Speech-Embeddings)
- [paper
- [paper
- [paper
- [paper - CMA)
- [paper
- [paper
- [paper
- [paper - of-Pixels)
- [paper
- [paper
- [paper
-
Clustering
-
Hybrid
-
-
Related Survey Papers
-
Challenges
Programming Languages
Categories
Sub Categories
Healthcare
128
Instance Discrimination
33
Image-Text Datasets
23
Hybrid
22
Robustness/Fairness
14
Video-Text Datasets
14
Masked Prediction
13
Video-Audio Datasets
10
State Representation Learning
7
Remote Sensing
6
Clustering
6
Machine Translation
5
Auto-driving
5
Robotics
5
Image-Ridar Datasets
3
Point Cloud Datasets
3
Resources
3
Image-Text-Audio Datasets
2