An open API service indexing awesome lists of open source software.

https://github.com/waghib/speech-to-text-using-whisper

A web-based audio transcription tool that converts speech to text using OpenAI's Whisper model. Built with Python and Streamlit for easy audio file uploads and accurate transcriptions.
https://github.com/waghib/speech-to-text-using-whisper

openai speech-to-text streamlit whisper-ai

Last synced: about 2 months ago
JSON representation

A web-based audio transcription tool that converts speech to text using OpenAI's Whisper model. Built with Python and Streamlit for easy audio file uploads and accurate transcriptions.

Awesome Lists containing this project

README

          

# 🎙️ AudioScribe

**Transform your audio into text with AI-powered transcription**

[![Python](https://img.shields.io/badge/Python-3.8%2B-blue.svg)](https://www.python.org/downloads/)
[![Streamlit](https://img.shields.io/badge/Streamlit-1.28%2B-FF4B4B.svg)](https://streamlit.io)
[![OpenAI Whisper](https://img.shields.io/badge/OpenAI-Whisper-00A67E.svg)](https://github.com/openai/whisper)
[![License](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE)

## 📸 Screenshots

### Upload Screen
AudioScribe Upload Screen

*Clean and intuitive interface for audio file upload*

### Transcription Result
AudioScribe Transcription

*Real-time transcription with audio preview and download options*

## ✨ Features

- 🎯 **High-Accuracy Transcription** using OpenAI's Whisper model
- 🎵 **Multiple Audio Formats** support (MP3, WAV, M4A, OGG)
- 🎧 **Built-in Audio Player** for preview
- 💾 **Instant Download** of transcription results
- 🎨 **Modern, Clean UI** for the best user experience

## 🚀 Quick Start

### Prerequisites

- Python 3.8 or higher
- FFmpeg (required for audio processing)

### Installation

1. Clone the repository:
```bash
git clone https://github.com/yourusername/AudioScribe.git
cd AudioScribe
```

2. Install required packages:
```bash
pip install -r requirements.txt
```

3. Run the application:
```bash
streamlit run main.py
```

The app will be available at `http://localhost:8501`

## 💡 Usage

1. **Upload Audio**
- Click the upload area or drag and drop your audio file
- Supported formats: MP3, WAV, M4A, OGG
- Maximum file size: 200MB

2. **Preview Audio** (Optional)
- Use the built-in audio player to verify your upload
- Check the audio quality before transcription

3. **Get Transcription**
- Wait for the AI model to process your audio
- View the transcribed text in real-time
- Download the results as a text file

## 🛠️ Tech Stack

- **Frontend**: [Streamlit](https://streamlit.io)
- **AI Model**: [OpenAI Whisper](https://github.com/openai/whisper)
- **Audio Processing**: [FFmpeg](https://ffmpeg.org)
- **Python Libraries**:
- `streamlit`: Web application framework
- `whisper`: AI transcription model
- `tempfile`: Temporary file handling
- `datetime`: Timestamp generation

## 📝 License

This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

## 🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

1. Fork the repository
2. Create your feature branch (`git checkout -b feature/AmazingFeature`)
3. Commit your changes (`git commit -m 'Add some AmazingFeature'`)
4. Push to the branch (`git push origin feature/AmazingFeature`)
5. Open a Pull Request

## 📬 Contact

Your Name - [@website](https://waghib.github.io) - waghibahmad30@gmail.com

Project Link: [https://github.com/Waghib/WhisperAI-Speech-Recognition](https://github.com/Waghib/WhisperAI-Speech-Recognition)

---


Made with ❤️ using OpenAI Whisper and Streamlit