https://github.com/macmixing/keyvox
KeyVox is a free local-first macOS and iOS dictation app. No accounts. No subscriptions. Big Dictation doesnβt stand a chance. Free Your Voice.
https://github.com/macmixing/keyvox
accessibility dictation macos mit-license parakeet speech-to-text swiftui whisper-ai
Last synced: 22 days ago
JSON representation
KeyVox is a free local-first macOS and iOS dictation app. No accounts. No subscriptions. Big Dictation doesnβt stand a chance. Free Your Voice.
- Host: GitHub
- URL: https://github.com/macmixing/keyvox
- Owner: macmixing
- License: other
- Created: 2026-02-11T01:17:27.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2026-05-25T12:19:12.000Z (29 days ago)
- Last Synced: 2026-05-25T14:16:56.702Z (29 days ago)
- Topics: accessibility, dictation, macos, mit-license, parakeet, speech-to-text, swiftui, whisper-ai
- Language: Swift
- Homepage: https://www.keyvox.app
- Size: 360 MB
- Stars: 24
- Watchers: 0
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- Funding: .github/FUNDING.yml
- License: LICENSE.md
- Code of conduct: CODE_OF_CONDUCT.md
- Security: SECURITY.md
Awesome Lists containing this project
README
KeyVox
KeyVox is a local-first macOS and iOS AI-powered dictation app with on-device Whisper and Parakeet transcription models.
KeyVox for Mac is simple. Hold your trigger key to record, release to transcribe on-device with Whisper or Parakeet, and insert text into the app you are currently using. Your custom dictionary, key dictation style settings, and weekly word total can also stay in sync across your devices with iCloud.
KeyVox for iOS brings the same speech-to-text workflow from the Mac app into a mobile experience, with on-device transcription, post-processing, shared dictionary via iCloud and synced preferences.
## Download for iOS
π **KeyVox Keyboard** is available: [**Free on the App Store**](https://apps.apple.com/us/app/keyvox-ai-voice-keyboard/id6760396964?ct=github-readme&mt=8)
## Why KeyVox
- π Fast local transcription (no cloud transcription path)
- π Includes on-device Whisper and Parakeet transcription models
- π₯οΈ Parakeet works on Sonoma and later, Whisper works on Ventura and later. Both on iOS 18.6+.
- β₯οΈ On-device, reversible writing styles on macOS and iOS
- π Privacy-first workflow with on-device inference
- β¨οΈ Global trigger-key dictation from anywhere on macOS
- π§ Smart post-processing for custom words, lists, and time formatting
- βοΈ iCloud sync for your custom dictionary and core dictation preferences
- π See your weekly spoken-word total across devices
- πͺ Reliable insertion flow with Accessibility-first + fallback paths on macOS
- πͺ Native and reliable iOS implementation with keyboard extension
## Core Features
- ποΈ Hold-to-talk dictation with optional hands-free mode on macOS
- ποΈ Tap-to-talk dictation on iOS
- π§Ύ Custom dictionary with phonetic-aware matching and iCloud sync
- β₯οΈ KeyVox Vibes (macOS/iOS) - Pick a style, change your mind later
- π KeyVox Speak (iOS) - On-device text-to-speech with 8 AI voices to choose from
- βοΈ Configurable trigger binding (Option, Command, Control, or Fn), synced across devices
- π Optional auto-paragraph splitting with Lists preferences with sync
- π§± Deterministic list formatting and safe text post-processing
- π Weekly word count that reflects how much you talk across all devices
- π Draggable recording overlay with persisted position
- π Optional system cue sounds with adjustable volume
- β οΈ Recovery and warning overlays for insertion/audio edge cases
https://github.com/user-attachments/assets/891f6354-55c2-4f7f-9ebc-2fa6bbfe7b0b
## What is KeyVox Vibes?
**Choose a style. Change your mind.** KeyVox Vibes lets you apply on-device, reversible writing styles to dictated text on Mac and iOS. Pick None, Casual, Polished, or Chill before dictation ends, and KeyVox applies that Vibe before inserting your text.
### Long Press or Tap to Vibe
Vibes are reversible when the latest inserted dictation is untouched:
- π± On iOS, **long press the Vibes key** to undo the last Vibe change.
- π± On iOS, **tap to choose another Vibe, then long press** to restyle the same untouched text.
- π» On Mac, **tap the trigger key** to apply or undo the current Vibe, and **double-tap** to cycle Vibes.
This means you can dictate first, decide later, and switch between clean dictation and styled text without re-recording.
### Local and Private
Vibes run on-device using KeyVox Vibes AI, a local rewrite model with bundled KeyVox style adapters, plus KeyVox's deterministic formatting pipeline. None keeps normal post-processed dictation, Casual performs light cleanup, Polished rewrites toward a professional tone, and Chill performs cleanup followed by lowercase formatting with limited punctuation.
On Mac, KeyVox Vibes is free and requires installing KeyVox Vibes AI (~491 MB). Mac Vibes can run on macOS Ventura and later: Sequoia and newer may use Metal/GPU acceleration, while Ventura and Sonoma run Vibes on CPU only.
On iOS, KeyVox Vibes requires the local Vibes AI model and supported iOS version. You can try Vibes for 3 days, then unlock KeyVox Vibes once and use it without a subscription.
## What is KeyVox Speak? (iOS)
**Copy text. Hear it speak.** KeyVox Speak is a text-to-speech feature that runs entirely on your device using local AI voices. No cloud processing, no data sent anywhere. Just reliable, private playback of any text you copy.
### How to Access Speak
KeyVox Speak is available from multiple places on iOS:
- **Home Tab**: Tap the Speak button from the main screen
- **Keyboard Shortcut**: Trigger directly from the KeyVox keyboard
- **Share to Speak**: Share text, URLs, or images with text from any app
- **Shortcuts & Actions**: Map to Action Button or Control Center for quick access
### Fast Mode
Fast Mode starts speaking ~50% faster. Toggle Fast Mode in the toolbar when you need quicker playback and don't mind hanging out inside the app longer.
### Free to Start
KeyVox Speak is free to try with 2 speaks per day. Install the Theo voice (~19 MB) and start speaking right away. You can download up to 8 total voices in Settings.
To unlock unlimited speaks, purchase KeyVox Speak access once and use it across all your devices on the same Apple account.
**For more information** on KeyVox Speak, visit [our website](https://keyvox.app/speak).
## Quick Start
### Requirements
macOS
- macOS Ventura (13.5) or later
- Apple Silicon recommended (Intel supported)
- ~190β480 MB of disk space, depending on the installed dictation model
- Optional KeyVox Vibes AI model is ~491 MB
iOS
- iOS 18.6 or later
- ~190β480 MB of disk space, depending on the installed dictation model
- Optional KeyVox Vibes AI model is ~491 MB
- Optional KeyVox Speak shared engine is ~642 MB
- Optional KeyVox Speak voices are ~17-19 MB each
### Install and Run
### Recommended ( macOS Release DMG)
1. Download the `.dmg` from the [latest release](https://github.com/macmixing/keyvox/releases/latest).
2. Open the DMG and drag `KeyVox.app` to `Applications`.
3. Launch KeyVox and complete onboarding (Microphone, Accessibility, dictation model setup).
### Build From Source (macOS/iOS):
1. Clone the repo:
`git clone https://github.com/macmixing/keyvox.git`
2. Open:
`macOS/KeyVox.xcodeproj` or `iOS/KeyVox iOS/KeyVox iOS.xcodeproj`
3. Build and run in Xcode.
4. Complete onboarding:
Model download, Microphone permission, and Accessibility/keyboard permission.
## How to Use (macOS)
1. Configure your trigger key in Settings (default is **Right Option β₯**).
2. Hold trigger, speak, release to transcribe and insert.
3. Hold **Shift** while releasing to continue recording hands-free.
4. Press **Esc** to cancel an active recording/transcription session.
## How to Use (iOS)
1. Tap microphone icon on keyboard to start recording, tap again to stop and transcribe.
2. Tap the cancel button on the keyboard toolbar to cancel recording.
## Dictionary & Settings
- Custom Dictionary entries can be added on either platform and will sync across devices via iCloud.
- Automatic **Paragraphs** and **Lists** can be configured in Settings. (Enabled by default)
- On iOS, long press the Paragraphs or Lists keyboard key to undo/redo deterministic formatting on the latest untouched dictation.
## KeyVox Speak (iOS) - Coming Soon!
KeyVox Speak brings local AI text-to-speech to iOS, letting you copy text and hear it spoken aloud with natural-sounding voices powered by on-device PocketTTS.
### Requirements
- **PocketTTS CoreML** (~642 MB): The shared AI engine that powers all voices
- **Voice files** (~17-19 MB each): Individual voice models like Alba, Azelma, Cosette, and more
Both components install on-device and run locally with no internet connection required for playback.
## Troubleshooting
- β No text inserted:
Verify Accessibility permission in macOS System Settings or Keyboard Settings on iOS.
- π€ No input audio:
Verify microphone permission and selected input in Settings on macOS or microphone access in iOS Settings.
- π¦ Dictation model missing:
Open Settings and re-run dictation model setup/download on macOS, reinstall on iOS.
## Documentation
- π macOS Engineering details: [`macOS/Docs/ENGINEERING.md`](macOS/Docs/ENGINEERING.md)
- πΊοΈ macOS File/component map: [`macOS/Docs/CODEMAP.md`](macOS/Docs/CODEMAP.md)
- π iOS Engineering details: [`iOS/Docs/ENGINEERING.md`](iOS/Docs/ENGINEERING.md)
- πΊοΈ iOS File/component map: [`iOS/Docs/CODEMAP.md`](iOS/Docs/CODEMAP.md)
- π License terms: [`LICENSE.md`](LICENSE.md)
- π Trademark policy: [`TRADEMARK.md`](TRADEMARK.md)
- π Third-party notices: [`THIRD_PARTY_NOTICES.md`](THIRD_PARTY_NOTICES.md)
## License
KeyVox uses a dual-license model:
- Source code is MIT-licensed.
- Branding and specified visual assets remain proprietary.
- Bundled third-party components/data/fonts remain under their original licenses.