https://github.com/sebsto/wispr

Privacy-first voice dictation for macOS — powered by on-device Whisper AI. No cloud, no tracking.
https://github.com/sebsto/wispr

accessibility local-first macos menubar-app privacy speech-to-text swift swiftui voice-dictation whisper

Last synced: 2 months ago
JSON representation

Privacy-first voice dictation for macOS — powered by on-device Whisper AI. No cloud, no tracking.

Host: GitHub
URL: https://github.com/sebsto/wispr
Owner: sebsto
License: apache-2.0
Created: 2026-02-27T19:49:25.000Z (4 months ago)
Default Branch: main
Last Pushed: 2026-04-10T08:13:43.000Z (2 months ago)
Last Synced: 2026-04-10T09:03:52.955Z (2 months ago)
Topics: accessibility, local-first, macos, menubar-app, privacy, speech-to-text, swift, swiftui, voice-dictation, whisper
Language: Swift
Homepage: https://wispr.stormacq.com
Size: 11.4 MB
Stars: 85
Watchers: 0
Forks: 10
Open Issues: 4
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # Wispr

A macOS menu bar app for local speech-to-text transcription powered by [OpenAI Whisper](https://github.com/openai/whisper) and [NVIDIA Parakeet](https://docs.nvidia.com/nemo-framework/user-guide/latest/nemotoolkit/asr/intro.html).

Wispr runs entirely on-device — your audio never leaves your Mac.

## Features

- **Hotkey-triggered dictation** — press a shortcut to start/stop recording, transcribed text is inserted at the cursor

- **Dual engine architecture** — choose between OpenAI Whisper and NVIDIA Parakeet models through a unified interface

- **Multiple models** — Whisper Tiny (~75 MB) to Large v3 (~3 GB), Parakeet V3 (~400 MB), and Realtime 120M (~150 MB)

- **Low-latency streaming** — Parakeet Realtime 120M provides end-of-utterance detection for near-instant results (English)

- **Model management** — download, activate, switch, and delete models from a single UI

- **Multi-language support** — Whisper supports 90+ languages, Parakeet V3 supports 25 languages

- **Menu bar native** — lives in your menu bar, stays out of the way

- **Onboarding flow** — guided setup for permissions, model selection, and a test dictation

- **Accessibility-first** — full keyboard navigation, VoiceOver support, and high-contrast mode

## Models

| Model | Engine | Size | Streaming | Languages | Notes |

|-------|--------|------|-----------|-----------|-------|

| Tiny | Whisper | ~75 MB | No | 90+ | Fastest, lower accuracy |

| Base | Whisper | ~140 MB | No | 90+ | Good balance for quick tasks |

| Small | Whisper | ~460 MB | No | 90+ | Solid general-purpose |

| Medium | Whisper | ~1.5 GB | No | 90+ | High accuracy |

| Large v3 | Whisper | ~3 GB | No | 90+ | Best Whisper accuracy |

| Parakeet V3 | Parakeet | ~400 MB | No | 25 | Fast, high accuracy, multilingual |

| Realtime 120M | Parakeet | ~150 MB | Yes | English | Low-latency with end-of-utterance detection |

## Installation

### Homebrew (Recommended)

```bash

brew tap sebsto/macos

brew install wispr

```

### Building from Source

Requires macOS 15.0+ and Xcode 16+

1. Clone the repo

2. Open `wispr.xcodeproj` in Xcode

3. Build and run (⌘R)

4. Follow the onboarding flow to grant permissions and download a model

### Xcode 26.4 build fix

~~Previously, Xcode 26.4 required a manual patch to FluidAudio's `AsrManager`

for Swift 6 concurrency compliance. This is no longer needed — FluidAudio

dropped its `swift-transformers` dependency (removing the version conflict with

WhisperKit) and resolved the concurrency issue in their latest release

([FluidInference/FluidAudio#448](https://github.com/FluidInference/FluidAudio/issues/448)).

No workaround is required; the project builds cleanly on Xcode 26.4.~~

See also: [argmaxinc/WhisperKit#451](https://github.com/argmaxinc/WhisperKit/issues/451).

## Requirements

- macOS 15.0+

- Microphone permission

## Architecture

| Layer | Path | Description |

|-------|------|-------------|

| Models | `wispr/Models/` | Data types — model info, permissions, app state, errors |

| Services | `wispr/Services/` | Core logic — audio engine, Whisper/Parakeet integration, hotkey monitoring, settings |

| UI | `wispr/UI/` | SwiftUI views — menu bar, recording overlay, settings, onboarding |

| Utilities | `wispr/Utilities/` | Logging, theming, SF Symbols, preview helpers |

The app uses a `CompositeTranscriptionEngine` that routes to the correct backend (WhisperService or ParakeetService) based on the selected model. Both engines conform to a shared `TranscriptionEngine` protocol, so switching between them is seamless.

## License

This project is licensed under the Apache License 2.0. See [LICENSE](LICENSE) for details.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/sebsto/wispr

Awesome Lists containing this project

README