https://github.com/waghib/speech-to-text-using-whisper

A web-based audio transcription tool that converts speech to text using OpenAI's Whisper model. Built with Python and Streamlit for easy audio file uploads and accurate transcriptions.
https://github.com/waghib/speech-to-text-using-whisper

openai speech-to-text streamlit whisper-ai

Last synced: about 2 months ago
JSON representation

A web-based audio transcription tool that converts speech to text using OpenAI's Whisper model. Built with Python and Streamlit for easy audio file uploads and accurate transcriptions.

Host: GitHub
URL: https://github.com/waghib/speech-to-text-using-whisper
Owner: Waghib
Created: 2025-02-03T18:08:13.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2025-02-04T18:52:58.000Z (over 1 year ago)
Last Synced: 2025-02-12T12:17:14.885Z (over 1 year ago)
Topics: openai, speech-to-text, streamlit, whisper-ai
Language: Python
Homepage: https://speech2texts.streamlit.app/
Size: 1 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 1
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# 🎙️ AudioScribe

**Transform your audio into text with AI-powered transcription**

[![Python](https://img.shields.io/badge/Python-3.8%2B-blue.svg)](https://www.python.org/downloads/)
[![Streamlit](https://img.shields.io/badge/Streamlit-1.28%2B-FF4B4B.svg)](https://streamlit.io)
[![OpenAI Whisper](https://img.shields.io/badge/OpenAI-Whisper-00A67E.svg)](https://github.com/openai/whisper)
[![License](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE)

## 📸 Screenshots

### Upload Screen
AudioScribe Upload Screen

*Clean and intuitive interface for audio file upload*

### Transcription Result
AudioScribe Transcription

*Real-time transcription with audio preview and download options*

## ✨ Features

- 🎯 **High-Accuracy Transcription** using OpenAI's Whisper model
- 🎵 **Multiple Audio Formats** support (MP3, WAV, M4A, OGG)
- 🎧 **Built-in Audio Player** for preview
- 💾 **Instant Download** of transcription results
- 🎨 **Modern, Clean UI** for the best user experience

## 🚀 Quick Start

### Prerequisites

- Python 3.8 or higher
- FFmpeg (required for audio processing)

### Installation

1. Clone the repository:
```bash
git clone https://github.com/yourusername/AudioScribe.git
cd AudioScribe
```

2. Install required packages:
```bash
pip install -r requirements.txt
```

3. Run the application:
```bash
streamlit run main.py
```

The app will be available at `http://localhost:8501`

## 💡 Usage

1. **Upload Audio**
- Click the upload area or drag and drop your audio file
- Supported formats: MP3, WAV, M4A, OGG
- Maximum file size: 200MB

2. **Preview Audio** (Optional)
- Use the built-in audio player to verify your upload
- Check the audio quality before transcription

3. **Get Transcription**
- Wait for the AI model to process your audio
- View the transcribed text in real-time
- Download the results as a text file

## 🛠️ Tech Stack

- **Frontend**: [Streamlit](https://streamlit.io)
- **AI Model**: [OpenAI Whisper](https://github.com/openai/whisper)
- **Audio Processing**: [FFmpeg](https://ffmpeg.org)
- **Python Libraries**:
- `streamlit`: Web application framework
- `whisper`: AI transcription model
- `tempfile`: Temporary file handling
- `datetime`: Timestamp generation

## 📝 License

This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

## 🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

1. Fork the repository
2. Create your feature branch (`git checkout -b feature/AmazingFeature`)
3. Commit your changes (`git commit -m 'Add some AmazingFeature'`)
4. Push to the branch (`git push origin feature/AmazingFeature`)
5. Open a Pull Request

## 📬 Contact

Your Name - [@website](https://waghib.github.io) - waghibahmad30@gmail.com

Project Link: [https://github.com/Waghib/WhisperAI-Speech-Recognition](https://github.com/Waghib/WhisperAI-Speech-Recognition)

---

Made with ❤️ using OpenAI Whisper and Streamlit

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/waghib/speech-to-text-using-whisper

Awesome Lists containing this project

README