An open API service indexing awesome lists of open source software.

https://github.com/kartikmehta8/ai-voice-to-text-transcriber-real-time

This project is a simple GUI-based application that records audio, transcribes speech using AI models from Hugging Face, and summarizes the transcribed text. The app is built using Python, PyQt6, sounddevice, and transformers.
https://github.com/kartikmehta8/ai-voice-to-text-transcriber-real-time

Last synced: 3 months ago
JSON representation

This project is a simple GUI-based application that records audio, transcribes speech using AI models from Hugging Face, and summarizes the transcribed text. The app is built using Python, PyQt6, sounddevice, and transformers.

Awesome Lists containing this project

README

        

# Voice-to-Text Transcription (Local)

![Banner](./banner.png)

This project is a simple GUI-based application that records audio, transcribes speech using AI models from Hugging Face, and summarizes the transcribed text. The app is built using **Python**, **PyQt6**, **sounddevice**, and **transformers**.

## Features
**Real-time voice recording**
**AI-based transcription** using `facebook/wav2vec2-large-960h`
**AI-powered text summarization** using `facebook/bart-large-cnn`
**Simple GUI** with Start/Stop buttons using `PyQt6`

## How It Works
1. Click **Start Recording** to record your voice.
2. Click **Stop Recording** to stop and save the audio.
3. The AI will transcribe and summarize the text automatically.
4. The summarized text appears in the GUI.

## Future Enhancements
- ✅ Add support for multiple languages.
- ✅ Save transcripts to a file.
- ✅ Live streaming transcription.