An open API service indexing awesome lists of open source software.

https://github.com/noxs1d/speech-to-text

🤖ML project which record audio and converts it to text
https://github.com/noxs1d/speech-to-text

ai fastapi ml torch whisper

Last synced: 3 months ago
JSON representation

🤖ML project which record audio and converts it to text

Awesome Lists containing this project

README

          

# Speech-to-Text System
___
This project is a Speech-to-Text application built with Python, designed to process audio files, perform transcription, and support functionalities such as recording, uploading, playing audio, and converting it to text.

![architecture](SPEECH-TO-TEXT.png)

## Features
1. Record Audio: Users can record audio directly through the interface.
2. Upload Audio File: Users can upload an existing audio file for processing.
3. Play Uploaded Audio: The system allows users to listen to uploaded audio files.
4. Convert Audio to Text: Audio files are transcribed into text using state-of-the-art machine learning models.
## Technologies Used
This project leverages the following frameworks and libraries:

- FastAPI: For building a fast, modern, and asynchronous web API.
- Whisper: OpenAI's speech recognition model for transcription.
- Torch (PyTorch): For loading and running the Whisper model efficiently.
- Wave: For audio file manipulation and playback.