https://github.com/intelligentsandbox/stt

custom stt system
https://github.com/intelligentsandbox/stt

fastapi speech-to-text vad whisper

Last synced: about 2 months ago
JSON representation

custom stt system

Host: GitHub
URL: https://github.com/intelligentsandbox/stt
Owner: IntelligentSandbox
License: gpl-2.0
Created: 2026-05-11T17:05:33.000Z (about 2 months ago)
Default Branch: main
Last Pushed: 2026-05-11T19:37:23.000Z (about 2 months ago)
Last Synced: 2026-05-11T20:36:47.833Z (about 2 months ago)
Topics: fastapi, speech-to-text, vad, whisper
Language: Python
Homepage: http://localhost:8002/
Size: 22.5 KB
Stars: 0
Watchers: 0
Forks: 1
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # STT

A FastAPI-based service for real-time speech-to-text using [faster-whisper](https://github.com/SYSTRAN/faster-whisper) and WebRTC VAD.

https://github.com/user-attachments/assets/83c49dcb-e263-43f4-96fc-1430650a5689

## Installation

```bash

# Install system requirements

sudo apt install portaudio19-dev

# Install python dependencies

python3 src/setup.py

source src/stt-venv/bin/activate

```

## Usage

**Start the service:**

```bash

cd src/

python app.py

```

**Python example:**

```python

import requests

with open("audio.wav", "rb") as f:

    response = requests.post(

        "http://localhost:8002/transcribe",

        files={"file": f}

    )

    print(response.json()["text"])

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/intelligentsandbox/stt

Awesome Lists containing this project

README