Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/shaadclt/groq-whisper-transcription-app

A Streamlit-based web application that transcribes audio files using OpenAI's Whisper API. You can either upload an MP3 file or input a YouTube URL to convert video audio into text within seconds.
https://github.com/shaadclt/groq-whisper-transcription-app

groq streamlit transcription whisper-api

Last synced: 3 months ago
JSON representation

A Streamlit-based web application that transcribes audio files using OpenAI's Whisper API. You can either upload an MP3 file or input a YouTube URL to convert video audio into text within seconds.

Host: GitHub
URL: https://github.com/shaadclt/groq-whisper-transcription-app
Owner: shaadclt
License: mit
Created: 2024-10-21T06:30:52.000Z (3 months ago)
Default Branch: main
Last Pushed: 2024-10-21T06:41:29.000Z (3 months ago)
Last Synced: 2024-10-21T09:26:26.078Z (3 months ago)
Topics: groq, streamlit, transcription, whisper-api
Language: Python
Homepage:
Size: 14.6 KB
Stars: 3
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE.txt

Awesome Lists containing this project

README

# 🎤 Groq Whisper Transcription App
A Streamlit-based web application that transcribes audio files using OpenAI's Whisper API. You can either upload an MP3 file or input a YouTube URL to convert video audio into text within seconds.

## Features
- **Upload MP3 Audio:** Upload an MP3 file, re-encode it to an efficient OGG (Opus) format, and get a transcription.
- **YouTube Audio Transcription:** Enter a YouTube video URL, download the audio, convert it to OGG, and transcribe it.
- **Streamlined Audio Processing:** The app optimizes file size by converting audio to Opus format and embedding it for playback.
- **Whisper API**: Leverages OpenAI's fast and accurate whisper-large-v3 model for speech-to-text transcription.

## Installation
1. **Clone the repository:**

```bash
git clone https://github.com/your-username/Groq-Whisper-Transcription-App.git
cd Groq-Whisper-Transcription-App
```

2. **Set up a virtual environment** (optional):

```bash
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
```

3. **Install dependencies:**

```bash
pip install -r requirements.txt
```

4. **Set up your environment variables:** Create a `.env` file in the root directory and add your Groq API key:

```bash
GROQ_API_KEY=your_groq_api_key
```

5. Run the app:

```bash
streamlit run app.py
```

## How to Use
### MP3 Upload
1. In the "📂 Upload Audio" tab, upload an MP3 file.
2. The file will be processed, re-encoded, and ready for transcription.
3. Click "📝 Transcribe" to generate a text transcript.

### YouTube to Audio Transcription
1. Go to the "🎥 YouTube to Audio" tab.
2. Enter a valid YouTube URL and click "⬇️ Download and Transcribe".
3. The audio will be downloaded, re-encoded, and transcribed.

## License
This project is licensed under the [MIT License](LICENSE.txt).