Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/salu133445/lakh-pianoroll-dataset
A collection of 174,154 multi-track piano-rolls
https://github.com/salu133445/lakh-pianoroll-dataset
multitrack music music-generation music-information-retrieval pianoroll
Last synced: 3 days ago
JSON representation
A collection of 174,154 multi-track piano-rolls
- Host: GitHub
- URL: https://github.com/salu133445/lakh-pianoroll-dataset
- Owner: salu133445
- License: mit
- Created: 2018-04-10T06:53:11.000Z (over 6 years ago)
- Default Branch: main
- Last Pushed: 2024-03-16T06:35:27.000Z (7 months ago)
- Last Synced: 2024-05-02T03:16:36.942Z (5 months ago)
- Topics: multitrack, music, music-generation, music-information-retrieval, pianoroll
- Language: Python
- Homepage: https://salu133445.github.io/lakh-pianoroll-dataset/
- Size: 1.52 MB
- Stars: 78
- Watchers: 3
- Forks: 9
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml
- License: LICENSE
Awesome Lists containing this project
README
# Source Code for Deriving Lakh Pianoroll Dataset (LPD)
> The derived dataset using the default settings is available
[here](https://salu133445.github.io/lakh-pianoroll-dataset/dataset).1. Download Lakh MIDI Dataset (LMD) with the following script.
```sh
./scripts/download_lmd.sh
```(Or, download it manually [here](http://colinraffel.com/projects/lmd/).)
2. Set the variables `LMD_ROOT` and `LPD_ROOT` in `run.sh` and variables in
`config.py` to proper values.
3. Derive all subsets and versions of LPD, `matched_ids.txt` and
`cleansed_ids.txt` with the following script.```sh
./scripts/derive_lpd.sh
```## Derive the labels for the LPD
> The derived labels can be found at `data/labels.tar.gz`.
1. Download the labels with the following script.
```sh
./scripts/download_labels.sh
```2. Derive the labels with the following script.
```sh
./scripts/derive_labels.sh
```## Synthesize audio files for the LPD
1. Install [GNU Parallel](https://www.gnu.org/software/parallel/) to run the
synthesizer in parallel mode.
2. Synthesize audio files from multitrack pianorolls with the following script.```sh
./scripts/batch_synthesize.sh ./data/lpd/lpd/lpd_cleansed/ \
./data/synthesized/lpd_cleansed 20
```(The above command will synthesize all the multitrack pianorolls in
the _LPD-cleansed_ subset with 20 parallel jobs.)