https://github.com/atomicerror/whisper-desktop
A gorgeous, premium Rust & Tauri GUI to compile, manage, and sequentially batch-transcribe media files utilizing whisper.cpp on Linux.
https://github.com/atomicerror/whisper-desktop
gui html javascript linux rust tauri tauri2 transcription whisper whisper-cpp
Last synced: 11 days ago
JSON representation
A gorgeous, premium Rust & Tauri GUI to compile, manage, and sequentially batch-transcribe media files utilizing whisper.cpp on Linux.
- Host: GitHub
- URL: https://github.com/atomicerror/whisper-desktop
- Owner: AtomicError
- License: mit
- Created: 2026-06-01T23:50:18.000Z (12 days ago)
- Default Branch: main
- Last Pushed: 2026-06-02T01:47:36.000Z (12 days ago)
- Last Synced: 2026-06-02T02:23:04.205Z (12 days ago)
- Topics: gui, html, javascript, linux, rust, tauri, tauri2, transcription, whisper, whisper-cpp
- Language: HTML
- Homepage:
- Size: 4.33 MB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
Whisper Desktop
A premium, state-of-the-art, and gorgeous native Rust & Tauri GUI designed for whisper.cpp.
Run high-performance local speech-to-text models with ease, beauty, and complete privacy.
---
## πΈ Screenshots & Visual Walkthrough
Here is a preview of the premium cyber-neon glassmorphic interface:
| ποΈ Dashboard Home | ποΈ Build Screen |
| :---: | :---: |
|  |  |
| π οΈ Configuration Step | ποΈ Transcription Screen |
| :---: | :---: |
|  |  |
---
## β¨ Features at a Glance
Whisper Desktop is designed to feel like a next-generation utility, combining the blistering performance of Rust/C++ with a highly aesthetic, responsive, and modern glassmorphic web dashboard.
* **β‘ Multiple Acceleration Backends:** Choose between **CPU**, **Vulkan**, **OpenVINO**, or **CUDA** directly from the UI to match your hardware capabilities.
* **π Batch Processing Queue:** Import multiple files, view their duration, remove individual files, clear the queue, and transcribe them sequentially.
* **π€ Live Audio Recording:** Record audio directly from your microphone with real-time waveform animation and transcribe it instantly.
* **ποΈ Integrated Media Converter:** Automatically extract audio from video files using integrated utilities.
* **π¨ Cyber-Neon Glassmorphic Design:** A premium dark-mode interface with elegant micro-animations, harmonized gradient glowing borders, and intuitive layout transitions.
* **π¦ Easy Drag & Drop HUD:** Drag audio or video files anywhere into the application to instantly load them into your queue.
* **βοΈ Full Transcription Settings:** Adjust transcription parameters such as GGML model selection, thread count, target language, translation to English, and output formats (TXT, SRT, VTT).
---
## π οΈ Interactive Architecture
Whisper Desktop orchestrates native `whisper.cpp` binaries using Tauriβs lightning-fast Rust IPC bridge.
```mermaid
graph TD
A[Glassmorphic UI - HTML/CSS/JS] -->|Tauri IPC Command| B(Tauri Core - Rust)
B -->|Launch Process| C[whisper.cpp Core C++ Engine]
C -->|Thread Allocation| D[CPU Execution]
C -->|GPU Acceleration| E[Vulkan / CUDA / OpenVINO]
F[(Local GGML Models)] -.->|Loads Model| C
G[ffmpeg Engine] -.->|Extracts Audio| B
```
---
## π Installation & Packaging
### π¦ Arch Linux (AUR)
Whisper Desktop is available in the Arch User Repository (AUR) as a precompiled binary package (recommended for Arch users):
```bash
paru -S whisper-desktop-bin
```
### π§ Debian / Ubuntu (`.deb`)
Download the latest `.deb` file from the [GitHub Releases](https://github.com/AtomicError/whisper-desktop/releases) page and install it:
```bash
sudo dpkg -i Whisper.Desktop_*_amd64.deb
sudo apt-get install -f # Install dependencies if missing
```
### π© RedHat / Fedora (`.rpm`)
Download the `.rpm` package and install via `dnf`:
```bash
sudo dnf install Whisper.Desktop-*.rpm
```
### π³ AppImage
For any other Linux distribution, simply download the portable `AppImage`, make it executable, and run it:
```bash
chmod +x Whisper.Desktop_*.AppImage
./Whisper.Desktop_*.AppImage
```
---
## π» Development & Building from Source
To build Whisper Desktop locally, ensure you have the following prerequisites installed on your system:
* **Node.js** (v18 or higher) & **npm**
* **Rust** toolchain (Cargo, rustc)
* **System Libraries:** `gtk3`, `webkit2gtk-4.1`, `ffmpeg`
### 1. Clone the Repository
```bash
git clone https://github.com/ggml-org/whisper.cpp.git
cd whisper.cpp/manager/desktop
```
### 2. Install Frontend Dependencies
```bash
npm install
```
### 3. Run in Development Mode
Start the live-reloading hot development server:
```bash
npm run tauri dev
```
### 4. Build Production Packages
Compile and bundle the production release for your system:
```bash
npm run tauri build
```
Production packages will be generated inside `src-tauri/target/release/bundle/`.
---
## π¦ Project Structure
```
whisper-desktop/
βββ src/ # Glassmorphic Frontend Core
β βββ assets/ # Neon SVGs, custom icons, and visual elements
β βββ index.html # Main dashboard layout (6 premium feature panels)
β βββ index.css # Glassmorphism, animations, and color design system
β βββ main.js # IPC binding, queue state, and audio recorders
βββ src-tauri/ # Tauri backend (Rust)
β βββ src/ # Tauri Rust entry point and command routers
β βββ icons/ # Beautiful high-resolution custom cyber-neon app icons
β βββ permissions/ # Tauri v2 security policies and capability schemas
β βββ Cargo.toml # Rust manifest
β βββ tauri.conf.json # Build target configs (deb, rpm, appimage)
βββ PKGBUILD # Arch Linux packaging script
βββ README.md # Project documentation
```
---
## π Privacy & Local Processing
All audio transcription, processing, and recording are executed **100% locally** on your computer. Your audio files, recordings, and transcriptions are never sent to external servers or cloud APIs, ensuring absolute confidentiality and privacy.
---
## π License
This project is licensed under the [MIT License](LICENSE).