https://github.com/deezer/real-cloned-singer-id
Repository for the ISMIR 2024 Paper "From Real to Cloned Singer Identification".
https://github.com/deezer/real-cloned-singer-id
deepfake-detection music-information-retrieval singer-id source-separation voice-cloning
Last synced: 3 months ago
JSON representation
Repository for the ISMIR 2024 Paper "From Real to Cloned Singer Identification".
- Host: GitHub
- URL: https://github.com/deezer/real-cloned-singer-id
- Owner: deezer
- Created: 2024-07-11T14:15:42.000Z (10 months ago)
- Default Branch: main
- Last Pushed: 2024-08-26T11:54:38.000Z (8 months ago)
- Last Synced: 2024-08-26T14:10:30.504Z (8 months ago)
- Topics: deepfake-detection, music-information-retrieval, singer-id, source-separation, voice-cloning
- Language: Python
- Homepage:
- Size: 354 KB
- Stars: 4
- Watchers: 5
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# From Real to Cloned Singer Identification
by [Dorian Desblancs](https://www.linkedin.com/in/dorian-desblancs), [Gabriel Meseguer-Brocal](https://www.linkedin.com/in/gabriel-meseguer-brocal-1032a42b), [Romain Hennequin](http://romain-hennequin.fr/En/index.html), and [Manuel Moussallam](https://mmoussallam.github.io/).
## About
This repository contains the code we used to train and evaluate the models from the paper. The cloned and closed datasets will unfortunately never be made public due to copyright concerns. However, the [FMA](https://github.com/mdeff/fma) and [MTG](https://mtg.github.io/mtg-jamendo-dataset/) splits used to evaluate our models can be found in the `splits.zip` file from the [Releases](https://github.com/deezer/real-cloned-singer-id/releases) section! We highly recommend you use them to evaluate your singer identification models!
## Getting Started
In order to explore our repository, one can start with the following:
```bash
# Clone and enter repository
git clone https://github.com/deezer/real-cloned-singer-id
cd real-cloned-singer-id# Build and run docker container with dependencies installed
make build
make run
```From there, you can expand upon or use the parts of this repo you need. The `foundation/` directory contains the base Transformer and Audio model, that are then used to create our embedding models. The code for training these backbone models can then be found in the `training/` directory. Finally, these embeddings are evaluated in the `evaluation/` section. Note that these are pickled for each track to accelerate singer identification result computation (see `evaluation/pickle_embeddings.py`).
## Disclaimer
This repository is not meant to run as is. It has been trimmed down a lot since most of the experimental setup cannot be made public. However, some interesting bits, such as the data pipelines for large-scale training or artist-level constrastive learning, are left here to serve as inspiration for future research in singer identification, music information retrieval, or even audio in general. Happy hacking!
## Reference
If you use this repository, please consider citing:
```
@article{desblancs2024real,
title={From Real to Cloned Singer Identification},
author={Desblancs, Dorian and Meseguer-Brocal, Gabriel and Hennequin, Romain and Moussallam, Manuel},
journal={arXiv preprint arXiv:2407.08647},
year={2024}
}
```Our paper can be found on [arXiv](https://arxiv.org/abs/2407.08647) 🌟 It will be presented at ISMIR 2024 🌉