Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/R2D2FISH/glados-tts
A GLaDOS TTS, using Forward Tacotron and HiFiGAN. Inference is fast and stable, even on the CPU. A low quality vocoder model is included for mobile use. Rudimentary TTS script included. Works perfectly on Linux, partially on Maybe someone smarter than me can make a GUI.
https://github.com/R2D2FISH/glados-tts
Last synced: 15 days ago
JSON representation
A GLaDOS TTS, using Forward Tacotron and HiFiGAN. Inference is fast and stable, even on the CPU. A low quality vocoder model is included for mobile use. Rudimentary TTS script included. Works perfectly on Linux, partially on Maybe someone smarter than me can make a GUI.
- Host: GitHub
- URL: https://github.com/R2D2FISH/glados-tts
- Owner: R2D2FISH
- License: mit
- Created: 2022-02-16T19:43:27.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-03-03T21:56:08.000Z (9 months ago)
- Last Synced: 2024-08-01T16:25:10.462Z (3 months ago)
- Language: Python
- Size: 34.2 KB
- Stars: 156
- Watchers: 7
- Forks: 84
- Open Issues: 10
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# GLaDOS Text-to-speech (TTS) Voice Generator
Neural network based TTS Engine.If you want to just play around with the TTS, this works as stand-alone.
```console
python3 glados-tts/glados.py
```the TTS Engine can also be used remotely on a machine more powerful then the Pi to process in house TTS: (executed from glados-tts directory
```console
python3 engine-remote.py
```Default port is 8124
Be sure to update settings.env variable in your main Glados-voice-assistant directory:
```
TTS_ENGINE_API = http://192.168.1.3:8124/synthesize/
```## Training (New Model)
The Tacotron and ForwardTacotron models were trained as multispeaker models on two datasets separated into three speakers. LJSpeech (13,100 lines), and then on the heavily modified version of the Ellen McClain dataset, separated into Portal 1 and 2 voices (with punctuation and corrections added manually). The lines from the end of Portal 1 after the cores get knocked off were counted as Portal 2 lines.## Training (Old Model)
The initial, regular Tacotron model was trained first on LJSpeech, and then on a heavily modified version of the Ellen McClain dataset (all non-Portal 2 voice lines removed, punctuation added).* The Forward Tacotron model was only trained on about 600 voice lines.
* The HiFiGAN model was generated through transfer learning from the sample.
* All models have been optimized and quantized.## Installation Instruction
If you want to install the TTS Engine on your machine, please follow the steps
below.1. Download the model files from [`Google Drive`](https://drive.google.com/file/d/1TRJtctjETgVVD5p7frSVPmgw8z8FFtjD/view?usp=sharing) and unzip into the repo folder
2. Install the required Python packages, e.g., by running `pip install -r
requirements.txt`