https://github.com/intelligentsandbox/stt
custom stt system
https://github.com/intelligentsandbox/stt
fastapi speech-to-text vad whisper
Last synced: about 1 month ago
JSON representation
custom stt system
- Host: GitHub
- URL: https://github.com/intelligentsandbox/stt
- Owner: IntelligentSandbox
- License: gpl-2.0
- Created: 2026-05-11T17:05:33.000Z (about 1 month ago)
- Default Branch: main
- Last Pushed: 2026-05-11T19:37:23.000Z (about 1 month ago)
- Last Synced: 2026-05-11T20:36:47.833Z (about 1 month ago)
- Topics: fastapi, speech-to-text, vad, whisper
- Language: Python
- Homepage: http://localhost:8002/
- Size: 22.5 KB
- Stars: 0
- Watchers: 0
- Forks: 1
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# STT
A FastAPI-based service for real-time speech-to-text using [faster-whisper](https://github.com/SYSTRAN/faster-whisper) and WebRTC VAD.
https://github.com/user-attachments/assets/83c49dcb-e263-43f4-96fc-1430650a5689
## Installation
```bash
# Install system requirements
sudo apt install portaudio19-dev
# Install python dependencies
python3 src/setup.py
source src/stt-venv/bin/activate
```
## Usage
**Start the service:**
```bash
cd src/
python app.py
```
**Python example:**
```python
import requests
with open("audio.wav", "rb") as f:
response = requests.post(
"http://localhost:8002/transcribe",
files={"file": f}
)
print(response.json()["text"])
```