Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/pxaris/lyra-dataset

Last synced: about 2 months ago
JSON representation

Host: GitHub
URL: https://github.com/pxaris/lyra-dataset
Owner: pxaris
Created: 2022-08-25T17:49:56.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2023-06-27T07:55:38.000Z (over 1 year ago)
Last Synced: 2023-07-19T09:42:09.786Z (over 1 year ago)
Language: Python
Size: 5.52 MB
Stars: 11
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# The Lyra Dataset

**Lyra** is a dataset for Greek Traditional and Folk music that includes 1570 pieces, summing in around 80 hours of data. The dataset incorporates YouTube timestamped links for retrieving audio and video, along with rich metadata information with regards to instrumentation, geography and genre, among others.

## Mel-spectrograms

The mel-spectrograms of the 1570 pieces, that were generated using the parameters:
```
audio sampling-rate (sr): 16000
length of the FFT window (n_fft): 512
number of samples between successive frames (hop_length): 256
Number of mel filterbanks (n_mels): 128
Minimum frequency (f_min): 0.0
Maximum frequency (f_max): 8000
```
can be **downloaded** at: [`mel-spectrograms.zip`](https://drive.google.com/file/d/1xMQIbi7Rkugr0gwtitIDSf2yQHbb-L4v/view?usp=sharing) (7.8 GB)

---

The mel-spectrograms that were used in the dataset introduction paper and were generated with the parameters:
```
audio sampling-rate (sr): 8000
length of the FFT window (n_fft): 400
number of samples between successive frames (hop_length): 400
Number of mel filterbanks (n_mels): 128
```
can be **downloaded** at: [`mel-spectrograms_initial.zip`](https://drive.google.com/file/d/10SH2gfYSf_qUVPUHA3O1lgq4C24k38I0/view?usp=sharing) (2.1 GB)

## Structure

### Data files in [`data`](data)

- `raw.tsv` - raw file with all metadata

- `split/` - training and test set split
- `training.tsv` - raw file with all metadata of samples used for the training
- `test.tsv` - raw file with all metadata of the test set samples

- `metadata-information/` - information about metadata
- `genres_hierarchy.json` - hierarchical relationships between all genres
- `places_coordinates.json` - coordinates of each place
- `places_hierarchy.json` - hierarchical relationships of each place
- `vocabulary.json` - vocabulary with the definitions of the terms evident in the dataset

- `mel-spectrograms/` - the mel-spectrograms of all music pieces following the naming convention `{id}.npy`

## Using the trained models for [`inference`](inference)

### Requirements

* [FFmpeg](https://ffmpeg.org/download.html)
* Python 3.8 or later
* Create virtual environment and install requirements
```bash
python -m venv venv
source venv/bin/activate
pip install -r requirements.txt
```

### Get inference results

1. Download trained models from [here](https://drive.google.com/file/d/1nFu3G4WX8OAtiSX4UlkqI0KvfW2FIn0u/view?usp=sharing) and put them under `models/` directory.
2. Place an `input.wav` file under `inference/` or use a different name and adjust `INPUT_FILE` at `run_inference.py` accordingly.
3. Run: `python inference/run_inference.py`
4. The inference results will be printed in the terminal.

## Citing the dataset

Please consider citing [the following publication](https://arxiv.org/abs/2211.11479v1) when using the dataset:

> C. Papaioannou, I. Valiantzas, T. Giannakopoulos, M. Kaliakatsos-Papakostas and A. Potamianos, "A Dataset for Greek Traditional and Folk Music: Lyra", in Proc. of the 23rd Int. Society for Music Information Retrieval Conf., Bengaluru, India, 2022.

## License

- The metadata is licensed under [CC BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/).