https://github.com/inledgroup/vozes
Talk to write in Linux. Free, private and local.
https://github.com/inledgroup/vozes
Last synced: about 1 month ago
JSON representation
Talk to write in Linux. Free, private and local.
- Host: GitHub
- URL: https://github.com/inledgroup/vozes
- Owner: InledGroup
- License: other
- Created: 2026-03-14T13:37:38.000Z (3 months ago)
- Default Branch: main
- Last Pushed: 2026-04-04T14:27:00.000Z (about 2 months ago)
- Last Synced: 2026-04-04T16:53:09.113Z (about 2 months ago)
- Language: Python
- Size: 2.27 MB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# 🎙️ Vozes: Professional Voice Dictation for Linux
[](https://github.com/InledGroup/vozes/actions)



**Vozes** is a high-performance, privacy-focused voice dictation system for Linux. Powered by a native C++ implementation of OpenAI's Whisper, it allows you to type with your voice anywhere—from professional IDEs to simple text editors—with zero latency and 100% offline processing.
---
## ✨ Key Features
- **🚀 Blazing Fast:** Powered by `whisper.cpp` for native performance.
- **🔒 100% Private:** Everything stays on your machine. No cloud, no APIs, no tracking.
- **⌨️ Global Typing:** Works like a virtual keyboard. Dictate directly into any active window.
- **🐕 Wake-Word Support:** Start dictating hands-free with "Hey Jarvis" (OpenWakeWord integration).
- **🛠️ Optimized for Linux:** Native GTK4/Adwaita interface, Udev rules for hotkeys, and seamless system integration.
- **📦 Multi-Arch:** Native support for both Intel/AMD (x64) and ARM (Raspberry Pi, Apple Silicon/Asahi).
---
## 🛠️ Installation
### 1. Download the latest release
Grab the `.deb` package for your architecture from the releases section.
> Note that the .deb contained in every release are built in different devices such as Proxmox or Ubuntu and the experience may be different in some architectures.
### 2. Install using APT
```bash
sudo apt install ./vozes_1.5.0_amd64.deb
```
*Note: This will automatically set up a dedicated Python virtual environment and system dependencies to keep your OS clean.*
### Known errors:
#### PyAudio:
Run this
```bash
sudo apt-get install portaudio19-dev python3-dev
```
### 3. Permissions (First time only)
To allow the app to listen to global hotkeys and type on your behalf, ensure your user is in the `input` group:
```bash
sudo usermod -aG input $USER
# Log out and log back in for changes to take effect
```
## Requirements:
```bash
PyAudio==0.2.14
numpy>=2.1.0
webrtcvad==2.0.10
onnxruntime>=1.17.0
scipy>=1.13.0
scikit-learn>=1.4.0
tqdm>=4.66.0
requests==2.31.0
evdev==1.7.0
PyGObject==3.48.2
```
---
## 🚀 How to Use
1. **Launch:** Open "Vozes" from your applications menu.
2. **Select Model:** Choose between `tiny`, `base`, or `small` depending on your CPU power.
3. **Dictate:**
- **Push-to-Talk:** Set a global hotkey in settings.
- **Wake-Word:** Just say *"Hey Jarvis"* and start speaking.
4. **Automatic Typing:** Your speech will be converted to text and typed instantly at your cursor location.
---
## 🏗️ Building from Source
If you want to build the package yourself:
```bash
# Clone the repo with submodules
git clone --recursive https://github.com/InledGroup/vozes.git
cd vozes
# Build the whisper-cli binary
cd bin/whisper.cpp
mkdir build && cd build
cmake .. -DWHISPER_SDL2=OFF -DWHISPER_ALL_EXTRAS=OFF -DWHISPER_BUILD_EXAMPLES=ON
make -j$(nproc) whisper-cli
cd ../../../
# Create the .deb package
./build_deb.sh
```
---
## 🔧 Requirements
- **OS:** Ubuntu 22.04+, Debian 12+, or any Debian-based distro.
- **Python:** 3.10 or higher.
- **Libraries:** `libgirepository1.0-dev`, `libportaudio2`, `libevdev2`.
---
## 🤝 Contributing
Contributions are welcome! Whether it's a bug report, a new feature, or a translation, feel free to open an Issue or a Pull Request.
---
## 📄 License
Vozes is released under the **GNU GPLv3**. See [LICENSE](LICENSE) for more details.
---
Built with ❤️ by JaimeGH.