https://github.com/sotelo/parrot

RNN-based generative models for speech.
https://github.com/sotelo/parrot

blocks deep-learning recurrent-neural-networks speech-synthesis theano

Last synced: 9 months ago
JSON representation

RNN-based generative models for speech.

Host: GitHub
URL: https://github.com/sotelo/parrot
Owner: sotelo
Created: 2015-06-17T16:08:58.000Z (about 11 years ago)
Default Branch: master
Last Pushed: 2017-06-23T12:41:46.000Z (about 9 years ago)
Last Synced: 2025-04-11T14:48:22.973Z (about 1 year ago)
Topics: blocks, deep-learning, recurrent-neural-networks, speech-synthesis, theano
Language: Python
Size: 23.2 MB
Stars: 611
Watchers: 68
Forks: 108
Open Issues: 7
Metadata Files:
- Readme: README.MD

Awesome Lists containing this project

README

# Speech synthesis using recurrent neural networks.

This repo has the code for our ICLR submission:

Jose Sotelo, Soroush Mehri, Kundan Kumar, João Felipe Santos, Kyle Kastner, Aaron Courville, Yoshua Bengio. Char2Wav: End-to-End Speech Synthesis.

The website is [here](http://www.josesotelo.com/speechsynthesis).

NOTE: The code is currently being refactored/cleaned/documented. We wanted to make it available as soon as possible but we know well that the current version is not ready for replication. Therefore, if you're interested in this, please come back later. Alternatively, you can send me an email and I will let you know when it's ready.

NOTE(2): The code for the neural vocoder is based on [sampleRNN](https://github.com/soroushmehr/sampleRNN_ICLR2017).

Updates:
* SampleRNN modules added
* End-to-end model can generate more than 4s of audio in 1s on p6000 gpu. (This is achieved by generating 200 samples of length 10s in a single batch. Total time taken for this is 448 seconds.)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/sotelo/parrot

Awesome Lists containing this project

README