https://github.com/yjlolo/vae-audio

Variational auto-encoders for audio
https://github.com/yjlolo/vae-audio

Last synced: 2 months ago
JSON representation

Variational auto-encoders for audio

Host: GitHub
URL: https://github.com/yjlolo/vae-audio
Owner: yjlolo
License: mit
Created: 2019-05-22T04:16:20.000Z (about 6 years ago)
Default Branch: master
Last Pushed: 2020-05-20T14:31:27.000Z (about 5 years ago)
Last Synced: 2025-04-14T04:09:50.068Z (2 months ago)
Language: Python
Homepage:
Size: 57.4 MB
Stars: 120
Watchers: 5
Forks: 20
Open Issues: 2
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        UPDATE (20.5.20): I decided to isolate the code for reproducing the paper [Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Variational Autoencoders](https://arxiv.org/pdf/1906.08152.pdf) (up from [here](https://github.com/yjlolo/gmvae-synth)) from this repo.

# vae-audio

For variational auto-encoders (VAEs) and audio/music lovers, based on PyTorch.

## Overview

The repo is under construction.

The project is built to facillitate research on using VAEs to model audio. It provides 

 - [x] [vanilla VAE](https://arxiv.org/abs/1312.6114)

 - [x] [Gaussian mixture VAE](https://arxiv.org/abs/1611.05148)

 - [ ] [vector-quantized VAE](https://arxiv.org/abs/1711.00937)

 - [ ] customizable model options

 - [x] audio feature extracton

 - [ ] model testing and latent space visualization

 - [ ] end-to-end audio feature extraction and model training

 - [ ] higher-level wrappers for easier use

 - [ ] easier installation

 - [ ] documentation

The project structure is based on [PyTorch Template](https://github.com/victoresque/pytorch-template).

## Requirements

* torch 1.1.0

* librosa 0.6.3

## Usage

### Audio Feature Extraction 

1. Define customized `Dataset` classes in `dataset/datasets.py`

2. Run `python dataset/audio_transform.py -c your_config_of_audio_transform.json` to compute audio features (e.g., spectrograms)

3. Define customized `DataLoader` classes in `data_loader/data_loaders.py`

### Model Training

Run `python train.py -c your_config_of_model_train.json`

## To Be Continued

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/yjlolo/vae-audio

Awesome Lists containing this project

README