An open API service indexing awesome lists of open source software.

https://github.com/wayne391/symbolic-music-datasets

:musical_keyboard: symbolic musical datasets
https://github.com/wayne391/symbolic-music-datasets

dataset music-information-retrieval

Last synced: 1 day ago
JSON representation

:musical_keyboard: symbolic musical datasets

Awesome Lists containing this project

README

          

# List of Symbolic Musical Datasets

This repository aims to collect accessible symolic musical datasets on the Net.
Generally, each dataset is organized as the following way:
* archive: samples from the dataset
* utils: codes for crawling or processing

## Contents
* Piano-roll
* Lead Sheet
* MIDI
* MISC

---

## Piano Roll
### 5 track piano-roll dataset
![image](https://github.com/wayne391/List-of-Symbolic-Musical-Datasets/blob/master/docs/5-track_pianoroll.PNG)

This dataset is derived from [LPD](https://github.com/salu133445/lakh-pianoroll-dataset) with new pre-processing policy.

### lead sheet dataset
![image](https://github.com/wayne391/List-of-Symbolic-Musical-Datasets/blob/master/docs/hey_jude_chorus.PNG)

This dataset is derived from [Theorytab]. However, it also has potentials to incoperate with other lead sheet datasets. For further understanding, please refer to this [repo](https://github.com/wayne391/Lead-Sheet-Analysis/tree/master/lead_sheet_dataset).

---

## Lead Sheets
one melody track accompanied with one chord track

### Crawled Datasets
| Source | Genre | Format | Chord | Melody | Songs | Src |
|-----------------------|:----------:|:------:|:-----:|:------:|:------:|:----:|
| [Theorytab] | pop | XML | V | V | 10148 | [O](https://drive.google.com/file/d/13AEVD9xaZIaicEgd8tF1l6aOiRTymJxL/view?usp=sharing)
| [Wikifonia] | pop | XML | V | V | 6675 | [O](https://drive.google.com/file/d/155FZ9Uq7QLySv9y2bAtk5LD37XZDo0DF/view?usp=sharing)
| [Hymnal] | hymn | MIDI | Δ | V | 3358 | [O](https://drive.google.com/drive/folders/1fP9OmQa9amz-nwaaaITggCEWs3ewz1_8?usp=sharing)

#### Links

* WJazzD: http://jazzomat.hfm-weimar.de/dbformat/dboverview.html
* MIDI format of Theorytab is now available: [Link](https://drive.google.com/file/d/1K1t8L9IRTHnQ1ozRIMRGEyxk_yhN6kLr/view?usp=sharing).
--------------

## Midi
### Crawled Datasets
| Source | Genre | Multi-track | Format |Songs | src |
|-----------------------|:----------:|:-----------:|:------:|:------:|:---:|
| [VGMdb] | game | V | MIDI | 28419 | [O](https://drive.google.com/drive/folders/1IW83MmH-RJ81yog6sbOUOTHimobE4FuK?usp=sharing)
| [Doug McKenzie Jazz] | jazz | V | MIDI | 297 | [O](https://drive.google.com/drive/folders/1wVVDpcov5VV6Govhn1-CT0BOifqoF-Od?usp=sharing)
| [Piano-e-Competition] | classical | | MIDI | 1573 | [O](https://drive.google.com/drive/folders/17yAGt3AR6txSZv8DBcbAbT3luTMkrkIb?usp=sharing)

### Online Resources
#### Jazz
* [profesordepiano](http://www.profesordepiano.com/Real%20Book/Realbook.htm?fbclid=IwAR09XcuMD6PMEyUFq0gXAIVFsJVPw8uQSXq5s-o46JFv7OlYVQnwArFOmSk)
* [minor9](http://bhs.minor9.com)

#### Drum
* [Groove MIDI Dataset (Magenta)](https://magenta.tensorflow.org/datasets/groove)

### MIDI MAN (on reddit)
* [Midi Man](https://www.reddit.com/r/WeAreTheMusicMakers/comments/3anwu8/the_drum_percussion_midi_archive_800k/)
https://www.reddit.com/r/WeAreTheMusicMakers/comments/3ajwe4/the_largest_midi_collection_on_the_internet/

#### full-scale
* [midiworld](http://www.midiworld.com)
* [Lakh MIDI dataset](http://colinraffel.com/projects/lmd/)

---

## MISC
### Unchecked
* http://www.musicstudents.com/jam.html (backing track and chord charts)
* https://www.cs.hmc.edu/~keller/jazz/
* http://www.ralphpatt.com/Song.html
* http://www.saxuet.qc.ca/TheSaxyPage/midi.htm
* http://www.thejazzpage.de/index1.html
* http://cjam.lassecollin.se/
* http://www.jazzpla.net/jazznote3000.htm

[Theorytab]: https://www.hooktheory.com/theorytab
[Hymnal]: https://www.hymnal.net/en/home
[Wikifonia]: http://www.wikifonia.org/
[Piano-e-Competition]: http://www.piano-e-competition.com
[VGMdb]: https://www.vgmusic.com
[Doug McKenzie Jazz]: http://bushgrafts.com/wp/