Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ognisty321/whisper-transcription-ui
Whisper Transcription UI is a user-friendly graphical interface for whisper-standalone-win. Transcribe and translate audio/video files effortlessly with customizable settings and saved preferences.
https://github.com/ognisty321/whisper-transcription-ui
gui python transcription ui whisper whisper-standalone-win
Last synced: about 1 month ago
JSON representation
Whisper Transcription UI is a user-friendly graphical interface for whisper-standalone-win. Transcribe and translate audio/video files effortlessly with customizable settings and saved preferences.
- Host: GitHub
- URL: https://github.com/ognisty321/whisper-transcription-ui
- Owner: Ognisty321
- License: other
- Created: 2024-06-01T00:37:14.000Z (6 months ago)
- Default Branch: main
- Last Pushed: 2024-06-15T12:02:38.000Z (5 months ago)
- Last Synced: 2024-10-09T19:05:52.332Z (about 1 month ago)
- Topics: gui, python, transcription, ui, whisper, whisper-standalone-win
- Language: Python
- Homepage:
- Size: 212 KB
- Stars: 4
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Whisper Transcription UI
## Overview
**Whisper Transcription UI** is a user-friendly graphical user interface (GUI) for the [whisper-standalone-win](https://github.com/Purfview/whisper-standalone-win) tool.
This intuitive application simplifies audio and video transcription and translation using various Whisper models. Customize settings to your liking and save them for future use.
## ✨ Features
- **Effortless File Handling:** Browse, select, paste, or drag and drop multiple audio and video files.
- **Direct URL Input:** Transcribe audio from online sources by providing the URL.
- **Flexible Transcription Options:**
- Select the target language.
- Choose the Whisper model that best suits your needs.
- Transcribe or translate with ease.
- Define your preferred output format.
- **Advanced Customization:** Fine-tune transcription parameters like FF MDX Kim2, VAD filter, word timestamps, temperature, and beam size.
- **Progress Monitoring:** Keep track of the transcription process.
- **Persistent Settings:** Save your preferred transcription and advanced settings.
- **Detailed Logging:** Enable logging to monitor the transcription process and troubleshoot any issues.## 🚀 Getting Started
### Prerequisites
- **[Whisper Standalone](https://github.com/Purfview/whisper-standalone-win/releases):** Download and install the latest release.
- **Python 3.x**## Installation
1. Clone the repository:
```bash
git clone https://github.com/Ognisty321/whisper-transcription-ui.git
cd whisper-transcription-ui
```2. Install required packages:
```bash
pip install PyQt6 yt-dlp
```3. Ensure `faster-whisper-xxl.exe` is available:
- **Option 1**: Place `faster-whisper-xxl.exe` in the same directory as `main.py`.
- **Option 2**: Specify the path to `faster-whisper-xxl.exe` in the `config.ini` file:```ini
[Settings]
exe_path = path/to/faster-whisper-xxl.exe
```4. Launch the application:
```bash
python main.py
```## 🎬 Usage
1. **Select Files:** Click `Browse` to choose files, drag and drop them into the interface, or paste file paths/URLs.
2. **Set Output Directory:** Specify where transcribed files should be saved.
3. **Choose Options:** Configure transcription language, model, task (transcribe/translate), output format, and other options.
4. **Advanced Options:** Fine-tune your transcription using advanced features and parameters.
5. **Transcribe:** Initiate the transcription process by clicking the `Transcribe` button.
6. **Save Settings:** Preserve your settings for future sessions using the `Save Settings` button.## ⚙️ Configuration
The application uses a `config.ini` file to store your settings. This file is automatically created in the application directory when you save your settings for the first time.
## 🙏 Acknowledgments
This project wouldn't be possible without [whisper-standalone-win](https://github.com/Purfview/whisper-standalone-win). A big thank you to its developers for their exceptional work!
## 📄 License
This project is licensed under the MIT License - see the `LICENSE` file for details.
## 📞 Contact
Have questions or suggestions? Don't hesitate to reach out to [Ognisty321](https://github.com/Ognisty321).