Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/ajxv/rtstt

Real time speech to text transcription using OpenAi whisper
https://github.com/ajxv/rtstt

live-transcription openai openai-whisper python3 transcription whisper

Last synced: about 1 month ago
JSON representation

Real time speech to text transcription using OpenAi whisper

Host: GitHub
URL: https://github.com/ajxv/rtstt
Owner: ajxv
Created: 2024-11-04T10:47:34.000Z (3 months ago)
Default Branch: main
Last Pushed: 2024-11-06T17:59:01.000Z (3 months ago)
Last Synced: 2024-11-12T01:17:10.618Z (2 months ago)
Topics: live-transcription, openai, openai-whisper, python3, transcription, whisper
Language: HTML
Homepage:
Size: 15.6 KB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Real Time Speech To Text (Using OpenAi Whisper)

## Requirements
- Python 3.1x

## Setting Up
- Install requirements using `pip install -r requirements.txt`

## Running the Application
- Run the flask app using `python3 app.py`

## Selecting the Appropriate Model
Whisper offers several models that balance speed and accuracy:

- `tiny`: Fastest but least accurate
- `base`: A balance between speed and accuracy
- `small`: More accurate, slower than base
- `medium`: Even more accurate, slower than small
- `large`: Most accurate but slowest

You can select a model by specifying it when loading the Whisper model. For example:
```python
self.model = whisper.load_model("medium")
```

## Demo
![sample](https://github.com/user-attachments/assets/90d45012-5f1d-4fc1-b72d-5a47c3eb4c63)

## To-Dos
- [ ] Improve accuracy of transcription
- [ ] Add support for multiple languages
- [ ] Optimize performance for low-latency environments
- [ ] Implement speaker recognition
- [ ] Webohook - Create separate sessions(?) for each connected client

## Contribution Guidelines
Contributions are welcome! Please follow these steps to contribute:

1. Fork the repository.
2. Create a new branch (`git checkout -b feature-branch`).
3. Make your changes.
4. Commit your changes (`git commit -m 'Add new feature'`).
5. Push to the branch (`git push origin feature-branch`).
6. Create a pull request.

## License
This project is licensed under the MIT License.