Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
multimodal-ml-music
List of academic resources on Multimodal ML for Music
https://github.com/ilaria-manco/multimodal-ml-music
Last synced: 3 days ago
JSON representation
-
Papers
-
Journal and Conference Papers
- Interpreting Song Lyrics with an Audio-Informed Pre-trained Language Model
- Conversational Music Retrieval with Synthetic Data
- Contrastive audio-language learning for music - manco/muscall)
- Learning music audio representations via weak language supervision - manco/mulap)
- Mulan: A joint embedding of music audio and natural language
- RECAP: Retrieval Augmented Music Captioner
- Data-Efficient Playlist Captioning With Musical and Linguistic Knowledge
- Clap: Learning audio concepts from natural language supervision
- Toward Universal Text-to-Music Retrieval - text-representation)
- MusCaps: Generating Captions for Music Audio - manco/muscaps)
- Music Playlist Title Generation: A Machine-Translation Approach
- MusicBERT - learning multi-modal representations for music and text
- Music autotagging as captioning
- Deep cross-modal correlation learning for audio and lyrics in music retrieval
- Music mood detection based on audio and lyrics with deep neural net
- Exploring customer reviews for music genre classification and evolutionary studies
- Towards Music Captioning: Generating Music Playlist Descriptions
- Multimodal Music Mood Classification using Audio and Lyrics
- Cross-Modal Music Retrieval and Applications: An Overview of Key Methodologies
- Multimodal music information processing and retrieval: Survey and future challenges
- Tr$\backslash$" aumerai: Dreaming music with stylegan
- Learning Affective Correspondence between Music and Image
- The Sound of Pixels - of-Pixels)
- Image generation associated with music data
- It's Time for Artistic Correspondence in Music and Video
- Audio-visual embedding for cross-modal music video retrieval through supervised deep CCA
- Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags - w2v-attention)
- Multimodal metric learning for tag-based music retrieval - based-music-retrieval)
- Enriched music representations with multiple cross-modal contrastive learning - mir-learning)
- Large-Scale Weakly-Supervised Content Embeddings for Music Recommendation and Tagging
- Music gesture for visual sound separation
- Foley music: Learning to generate music from videos
- Musical word embedding: Bridging the gap between listening contexts and music
- Query-by-Blending: a Music Exploration System Blending Latent Vector Representations of Lyric Word, Song Audio, and Artist
- Creating a Multitrack Classical Music Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications
- Multimodal Deep Learning for Music Genre Classification
- JTAV: Jointly Learning Social Media Content Representation by Fusing Textual, Acoustic, and Visual Features
- Learning neural audio embeddings for grounding semantics in auditory perception
- Cross-modal Sound Mapping Using Deep Learning
- Music emotion recognition: From content- to context-based models
- Musiclef: A benchmark activity in multimodal music information retrieval
- Combining audio content and social context for semantic music discovery
- Audio-visual embedding for cross-modal music video retrieval through supervised deep CCA
- Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags - w2v-attention)
- Multimodal metric learning for tag-based music retrieval - based-music-retrieval)
- Creating a Multitrack Classical Music Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications
- Exploring customer reviews for music genre classification and evolutionary studies
- The Sound of Pixels - of-Pixels)
- Cbvmr: content-based video-music retrieval using soft intra-modal structure constraint - NET)
- Cross-Modal Music Retrieval and Applications: An Overview of Key Methodologies
- Multimodal music information processing and retrieval: Survey and future challenges
-
-
Datasets
-
Workshops, Tutorials & Talks
-
Journal and Conference Papers
-
-
Statistics & Visualisations
-
Journal and Conference Papers
- Yann Bayle - deep-learning-music/blob/master/reproducibility.md).
-
Categories
Sub Categories