https://github.com/kinwaicheuk/audioloader
PyTorch Dataset for Speech and Music audio
https://github.com/kinwaicheuk/audioloader
audio dataset pytorch
Last synced: about 2 months ago
JSON representation
PyTorch Dataset for Speech and Music audio
- Host: GitHub
- URL: https://github.com/kinwaicheuk/audioloader
- Owner: KinWaiCheuk
- Created: 2021-06-19T00:36:38.000Z (almost 4 years ago)
- Default Branch: master
- Last Pushed: 2024-07-12T02:16:44.000Z (10 months ago)
- Last Synced: 2025-03-02T13:10:06.021Z (2 months ago)
- Topics: audio, dataset, pytorch
- Language: Python
- Homepage:
- Size: 202 KB
- Stars: 73
- Watchers: 3
- Forks: 13
- Open Issues: 4
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# AudioLoader
AudioLoader is a PyTorch dataset based on [torchaudio](https://pytorch.org/audio/stable/datasets.html). It contains a collection of datasets that are not available in [torchaudio](https://pytorch.org/audio/stable/datasets.html) yet.**Currently supported datasets:**
1. [Speech](./AudioLoader/speech/speech_README.md#Speech)
1. [Multilingual LibriSpeech (MLS)](./AudioLoader/speech/speech_README.md#Multilingual-LibriSpeech)
1. [TIMIT](./AudioLoader/speech/speech_README.md#TIMIT)
1. [SpeechCommands v2 (12 classes)](./AudioLoader/speech/speech_README.md#SpeechCommandsv2)
1. [Automatic Music Transcription (AMT)](./AudioLoader/music/amt_README.md#Automatic-Music-Transcription)
1. [MAPS](./AudioLoader/music/amt_README.md#maps)
1. [MusicNet](./AudioLoader/music/amt_README.md#musicnet)
1. [MAESTRO](./AudioLoader/music/amt_README.md#maestro)
1. [Music Source Separation (MSS)](./AudioLoader/music/mss/mss_README.md#Music-Source-Separation)
1. [FastMUSDB](./AudioLoader/music/mss/mss_README.md#FastMUSDB)
1. [MusdbHQ](./AudioLoader/music/mss/mss_README.md#MusdbHQ)
## Example code
A complete example code is available in this [repository](https://github.com/KinWaiCheuk/pytorch_template). The following pseudo code shows the general idea of how to apply AudioLoader to your existing code.```python
from AudioLoader.speech import TIMIT
from torch.utils.data import DataLoader# AudioLoader helps you to set up supported datasets
dataset = TIMIT('./YourFolder',
split='train',
groups='all',
download=True)
train_loader = DataLoader(dataset,
batch_size=4)# Pass the dataset to you
model = MyModel()
trainer = pl.Trainer()
trainer.fit(model, train_loader)```
## Installation
`pip install git+https://github.com/KinWaiCheuk/AudioLoader.git`## News & Changelog
**version 0.0.3** (10 Sep 2021):
1. Replace broken links with a working links for `MAPS` and `TIMIT`
1. Remove the slience indicators in the phonemic labels for TIMIT