https://github.com/hadware/logorrhea

Using RNNs to generate speech from phonemized text
https://github.com/hadware/logorrhea

Last synced: about 1 year ago
JSON representation

Using RNNs to generate speech from phonemized text

Host: GitHub
URL: https://github.com/hadware/logorrhea
Owner: hadware
Created: 2017-06-18T15:57:51.000Z (about 9 years ago)
Default Branch: master
Last Pushed: 2018-01-22T02:17:57.000Z (over 8 years ago)
Last Synced: 2025-03-25T17:51:31.428Z (over 1 year ago)
Language: Jupyter Notebook
Size: 2.91 MB
Stars: 3
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Logorrhea

Using the latest advance in Deep Learning and two old TTS softwares to generate what should look like speech.

Mostly inspired by [Andrew Karpathy's article](http://karpathy.github.io/2015/05/21/rnn-effectiveness/), and [this tutorial](https://github.com/spro/practical-pytorch/blob/master/char-rnn-generation/char-rnn-generation.ipynb) from the PyTorch library.
Basically, instead of doing the generation at character-level, I'm doing it at the phoneme level.
Espeak takes care of phonemizing the input text with some very efficient rules, and mbrola takes the generated phonems to make actual sound.

Also, uses my own [Voxpopuli](https://github.com/hadware/voxpopuli) library to do the phonemization and sound rendering stuff.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/hadware/logorrhea

Awesome Lists containing this project

README