https://github.com/KittenML/KittenTTS
State-of-the-art TTS model under 25MB 😻
https://github.com/KittenML/KittenTTS
Last synced: 3 months ago
JSON representation
State-of-the-art TTS model under 25MB 😻
- Host: GitHub
- URL: https://github.com/KittenML/KittenTTS
- Owner: KittenML
- License: apache-2.0
- Created: 2025-08-05T03:30:24.000Z (3 months ago)
- Default Branch: main
- Last Pushed: 2025-08-05T03:34:42.000Z (3 months ago)
- Last Synced: 2025-08-05T05:24:51.651Z (3 months ago)
- Language: Python
- Homepage:
- Size: 11.7 KB
- Stars: 26
- Watchers: 1
- Forks: 3
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- StarryDivineSky - KittenML/KittenTTS
- awesome-tts-colab - GitHub Link
- awesome-starred - KittenML/KittenTTS - State-of-the-art TTS model under 25MB 😻 (Python)
- my-awesome-github-stars - KittenML/KittenTTS - State-of-the-art TTS model under 25MB 😻 (Python)
- ai-game-devtools - Kitten TTS - source realistic text-to-speech model with just 15 million parameters, designed for lightweight deployment and high-quality voice synthesis. | | | Speech | (<span id="speech">Speech</span> / <span id="tool">LLM (LLM & Tool)</span>)
README
# Kitten TTS 😻
Kitten TTS is an open-source realistic text-to-speech model with just 15 million parameters, designed for lightweight deployment and high-quality voice synthesis.
*Currently in developer preview*
[Join our discord](https://discord.gg/upcyF5s6)
## ✨ Features
- **Ultra-lightweight**: Model size less than 25MB
- **CPU-optimized**: Runs without GPU on any device
- **High-quality voices**: Several premium voice options available
- **Fast inference**: Optimized for real-time speech synthesis
## 🚀 Quick Start
### Installation
```
pip install https://github.com/KittenML/KittenTTS/releases/download/0.1/kittentts-0.1.0-py3-none-any.whl
```
### Basic Usage
```
from kittentts import KittenTTS
m = KittenTTS("KittenML/kitten-tts-nano-0.1")
audio = m.generate("This high quality TTS model works without a GPU", voice='expr-voice-2-f' )
# available_voices : [ 'expr-voice-2-m', 'expr-voice-2-f', 'expr-voice-3-m', 'expr-voice-3-f', 'expr-voice-4-m', 'expr-voice-4-f', 'expr-voice-5-m', 'expr-voice-5-f' ]
# Save the audio
import soundfile as sf
sf.write('output.wav', audio, 24000)
```
## 💻 System Requirements
Works literally everywhere
## Checklist
- [x] Release a preview model
- [ ] Release the fully trained model weights
- [ ] Release mobile SDK
- [ ] Release web version