Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/nitely/speech-to-text

Speech-To-Text experiments
https://github.com/nitely/speech-to-text

Last synced: 28 days ago
JSON representation

Speech-To-Text experiments

Awesome Lists containing this project

README

        

Speech-To-Text experiments; nothing to see here; shhhh

## What?

- Get audio from device (mic or desktop) or audio file
- Normalize the audio to 16kHz mono
- Get the speech segments with VAD with a min/max duration
- Get the transcript of the segment with Whisper
- Ideally with real-time stream support. Albeit whisper.cpp works
best on 30s segments

## What for?

- Voice comms
- Transcriptions
- Closed Caption
- Real-time translation
- Capture system output speech to text

## Why?

Learnings.

## License

Unlicense. You cannot use this or even look at it. Move along.