An open API service indexing awesome lists of open source software.

https://github.com/macmixing/keyvox

KeyVox is a free local-first macOS and iOS dictation app. No accounts. No subscriptions. Big Dictation doesn’t stand a chance. Free Your Voice.
https://github.com/macmixing/keyvox

accessibility dictation macos mit-license parakeet speech-to-text swiftui whisper-ai

Last synced: 22 days ago
JSON representation

KeyVox is a free local-first macOS and iOS dictation app. No accounts. No subscriptions. Big Dictation doesn’t stand a chance. Free Your Voice.

Awesome Lists containing this project

README

          


KeyVox Logo

KeyVox


macOS Release
iOS App Store Version
macOS 13.5+
iOS 18+
License

KeyVox is a local-first macOS and iOS AI-powered dictation app with on-device Whisper and Parakeet transcription models.

KeyVox for Mac is simple. Hold your trigger key to record, release to transcribe on-device with Whisper or Parakeet, and insert text into the app you are currently using. Your custom dictionary, key dictation style settings, and weekly word total can also stay in sync across your devices with iCloud.

KeyVox for iOS brings the same speech-to-text workflow from the Mac app into a mobile experience, with on-device transcription, post-processing, shared dictionary via iCloud and synced preferences.

## Download for iOS

πŸŽ‰ **KeyVox Keyboard** is available: [**Free on the App Store**](https://apps.apple.com/us/app/keyvox-ai-voice-keyboard/id6760396964?ct=github-readme&mt=8)

## Why KeyVox

- πŸš€ Fast local transcription (no cloud transcription path)
- 🌍 Includes on-device Whisper and Parakeet transcription models
- πŸ–₯️ Parakeet works on Sonoma and later, Whisper works on Ventura and later. Both on iOS 18.6+.
- β™₯️ On-device, reversible writing styles on macOS and iOS
- πŸ”’ Privacy-first workflow with on-device inference
- ⌨️ Global trigger-key dictation from anywhere on macOS
- 🧠 Smart post-processing for custom words, lists, and time formatting
- ☁️ iCloud sync for your custom dictionary and core dictation preferences
- πŸ“Š See your weekly spoken-word total across devices
- πŸͺ„ Reliable insertion flow with Accessibility-first + fallback paths on macOS
- πŸ’ͺ Native and reliable iOS implementation with keyboard extension

## Core Features

- πŸŽ™οΈ Hold-to-talk dictation with optional hands-free mode on macOS
- πŸŽ™οΈ Tap-to-talk dictation on iOS
- 🧾 Custom dictionary with phonetic-aware matching and iCloud sync
- β™₯️ KeyVox Vibes (macOS/iOS) - Pick a style, change your mind later
- πŸ”ˆ KeyVox Speak (iOS) - On-device text-to-speech with 8 AI voices to choose from
- βš™οΈ Configurable trigger binding (Option, Command, Control, or Fn), synced across devices
- πŸ““ Optional auto-paragraph splitting with Lists preferences with sync
- 🧱 Deterministic list formatting and safe text post-processing
- πŸ“ˆ Weekly word count that reflects how much you talk across all devices
- πŸ“ Draggable recording overlay with persisted position
- πŸ”Š Optional system cue sounds with adjustable volume
- ⚠️ Recovery and warning overlays for insertion/audio edge cases

https://github.com/user-attachments/assets/891f6354-55c2-4f7f-9ebc-2fa6bbfe7b0b

## What is KeyVox Vibes?

**Choose a style. Change your mind.** KeyVox Vibes lets you apply on-device, reversible writing styles to dictated text on Mac and iOS. Pick None, Casual, Polished, or Chill before dictation ends, and KeyVox applies that Vibe before inserting your text.

### Long Press or Tap to Vibe

Vibes are reversible when the latest inserted dictation is untouched:

- πŸ“± On iOS, **long press the Vibes key** to undo the last Vibe change.
- πŸ“± On iOS, **tap to choose another Vibe, then long press** to restyle the same untouched text.
- πŸ’» On Mac, **tap the trigger key** to apply or undo the current Vibe, and **double-tap** to cycle Vibes.

This means you can dictate first, decide later, and switch between clean dictation and styled text without re-recording.

### Local and Private

Vibes run on-device using KeyVox Vibes AI, a local rewrite model with bundled KeyVox style adapters, plus KeyVox's deterministic formatting pipeline. None keeps normal post-processed dictation, Casual performs light cleanup, Polished rewrites toward a professional tone, and Chill performs cleanup followed by lowercase formatting with limited punctuation.

On Mac, KeyVox Vibes is free and requires installing KeyVox Vibes AI (~491 MB). Mac Vibes can run on macOS Ventura and later: Sequoia and newer may use Metal/GPU acceleration, while Ventura and Sonoma run Vibes on CPU only.

On iOS, KeyVox Vibes requires the local Vibes AI model and supported iOS version. You can try Vibes for 3 days, then unlock KeyVox Vibes once and use it without a subscription.

## What is KeyVox Speak? (iOS)

**Copy text. Hear it speak.** KeyVox Speak is a text-to-speech feature that runs entirely on your device using local AI voices. No cloud processing, no data sent anywhere. Just reliable, private playback of any text you copy.

### How to Access Speak

KeyVox Speak is available from multiple places on iOS:

- **Home Tab**: Tap the Speak button from the main screen
- **Keyboard Shortcut**: Trigger directly from the KeyVox keyboard
- **Share to Speak**: Share text, URLs, or images with text from any app
- **Shortcuts & Actions**: Map to Action Button or Control Center for quick access

### Fast Mode

Fast Mode starts speaking ~50% faster. Toggle Fast Mode in the toolbar when you need quicker playback and don't mind hanging out inside the app longer.

### Free to Start

KeyVox Speak is free to try with 2 speaks per day. Install the Theo voice (~19 MB) and start speaking right away. You can download up to 8 total voices in Settings.

To unlock unlimited speaks, purchase KeyVox Speak access once and use it across all your devices on the same Apple account.

**For more information** on KeyVox Speak, visit [our website](https://keyvox.app/speak).

## Quick Start

### Requirements

macOS
- macOS Ventura (13.5) or later
- Apple Silicon recommended (Intel supported)
- ~190–480 MB of disk space, depending on the installed dictation model
- Optional KeyVox Vibes AI model is ~491 MB

iOS
- iOS 18.6 or later
- ~190–480 MB of disk space, depending on the installed dictation model
- Optional KeyVox Vibes AI model is ~491 MB
- Optional KeyVox Speak shared engine is ~642 MB
- Optional KeyVox Speak voices are ~17-19 MB each

### Install and Run

### Recommended ( macOS Release DMG)

1. Download the `.dmg` from the [latest release](https://github.com/macmixing/keyvox/releases/latest).
2. Open the DMG and drag `KeyVox.app` to `Applications`.
3. Launch KeyVox and complete onboarding (Microphone, Accessibility, dictation model setup).

### Build From Source (macOS/iOS):

1. Clone the repo:
`git clone https://github.com/macmixing/keyvox.git`
2. Open:
`macOS/KeyVox.xcodeproj` or `iOS/KeyVox iOS/KeyVox iOS.xcodeproj`
3. Build and run in Xcode.
4. Complete onboarding:
Model download, Microphone permission, and Accessibility/keyboard permission.

## How to Use (macOS)

1. Configure your trigger key in Settings (default is **Right Option βŒ₯**).
2. Hold trigger, speak, release to transcribe and insert.
3. Hold **Shift** while releasing to continue recording hands-free.
4. Press **Esc** to cancel an active recording/transcription session.

## How to Use (iOS)

1. Tap microphone icon on keyboard to start recording, tap again to stop and transcribe.
2. Tap the cancel button on the keyboard toolbar to cancel recording.

## Dictionary & Settings

- Custom Dictionary entries can be added on either platform and will sync across devices via iCloud.
- Automatic **Paragraphs** and **Lists** can be configured in Settings. (Enabled by default)
- On iOS, long press the Paragraphs or Lists keyboard key to undo/redo deterministic formatting on the latest untouched dictation.

## KeyVox Speak (iOS) - Coming Soon!

KeyVox Speak brings local AI text-to-speech to iOS, letting you copy text and hear it spoken aloud with natural-sounding voices powered by on-device PocketTTS.

### Requirements

- **PocketTTS CoreML** (~642 MB): The shared AI engine that powers all voices
- **Voice files** (~17-19 MB each): Individual voice models like Alba, Azelma, Cosette, and more

Both components install on-device and run locally with no internet connection required for playback.

## Troubleshooting

- ❌ No text inserted:
Verify Accessibility permission in macOS System Settings or Keyboard Settings on iOS.
- 🎀 No input audio:
Verify microphone permission and selected input in Settings on macOS or microphone access in iOS Settings.
- πŸ“¦ Dictation model missing:
Open Settings and re-run dictation model setup/download on macOS, reinstall on iOS.

## Documentation

- πŸ“˜ macOS Engineering details: [`macOS/Docs/ENGINEERING.md`](macOS/Docs/ENGINEERING.md)
- πŸ—ΊοΈ macOS File/component map: [`macOS/Docs/CODEMAP.md`](macOS/Docs/CODEMAP.md)
- πŸ“˜ iOS Engineering details: [`iOS/Docs/ENGINEERING.md`](iOS/Docs/ENGINEERING.md)
- πŸ—ΊοΈ iOS File/component map: [`iOS/Docs/CODEMAP.md`](iOS/Docs/CODEMAP.md)
- πŸ“œ License terms: [`LICENSE.md`](LICENSE.md)
- πŸ“„ Trademark policy: [`TRADEMARK.md`](TRADEMARK.md)
- πŸ“Ž Third-party notices: [`THIRD_PARTY_NOTICES.md`](THIRD_PARTY_NOTICES.md)

## License

KeyVox uses a dual-license model:

- Source code is MIT-licensed.
- Branding and specified visual assets remain proprietary.
- Bundled third-party components/data/fonts remain under their original licenses.