https://github.com/bhanafee/alfredmeetings

CLI + Alfred workflow to record, transcribe, and summarize voice calls/meetings. This is completely vibe-coded using Claude 4.8, YMMV.
https://github.com/bhanafee/alfredmeetings

alfred-workflows vibe-coded voice-ai

Last synced: 2 days ago
JSON representation

CLI + Alfred workflow to record, transcribe, and summarize voice calls/meetings. This is completely vibe-coded using Claude 4.8, YMMV.

Host: GitHub
URL: https://github.com/bhanafee/alfredmeetings
Owner: bhanafee
License: mit
Created: 2026-06-19T23:47:01.000Z (7 days ago)
Default Branch: main
Last Pushed: 2026-06-24T05:52:22.000Z (3 days ago)
Last Synced: 2026-06-24T07:28:21.110Z (3 days ago)
Topics: alfred-workflows, vibe-coded, voice-ai
Language: Shell
Homepage:
Size: 134 KB
Stars: 0
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Meeting Notes — Alfred workflow

[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE)

Record a meeting, transcribe it locally with speaker separation, and turn the
transcript into clean notes / a summary / minutes — all on-device, no cloud, no
per-meeting cost. Built for Apple Silicon.

Three independent components, each available **both as an Alfred keyword and as a
`meetings` subcommand in your shell**. Each also accepts a file, so they chain but can
run standalone:

| Keyword | `meetings` command | Input | Output |
|---|---|---|---|
| `rec` | `meetings rec` | — (toggle; run again to stop) | stereo `.m4a` (left = your mic → **Me**, right = system audio → **Them**) |
| `transcribe [file]` | `meetings transcribe [file]` | newest recording, or a file you pass | timestamped, speaker-labeled `.md` transcript |
| `notes` | `meetings notes [file]` | newest transcript, or a file you pass | processed `.md`: **Minutes**, **Summary**, **Clean up**, or a **Custom** instruction |

Everything is written to `~/Desktop/Meeting Notes/` by default (configurable).

While a recording is active, a red ● appears in the **menu bar** — click it to stop
(handy when the `rec` toggle leaves you unsure whether you started or stopped).

## How it works

- **Recording** captures a stereo file with your mic on the left and the far side on
the right, using a **Core Audio process tap** clocked by the microphone — no BlackHole
and no Audio MIDI Setup. Your output device and volume are left untouched (nothing is
rerouted), and the tap can optionally be scoped to a single app. See
[ADR 0001](docs/adr/0001-system-audio-capture-via-core-audio-process-tap.md).
- **Transcription** splits the stereo file into two mono channels, transcribes each
independently with [`mlx-whisper`](https://github.com/ml-explore/mlx-examples)
(`whisper-large-v3-turbo`), and merges the segments chronologically with `Me` /
`Them` labels. The deterministic two-way split reliably separates **you** from the
remote side. To tell *individual remote speakers* apart — who all share the one
system-audio channel — it additionally runs
[`pyannote.audio`](https://github.com/pyannote/pyannote-audio) diarization on the
Them channel and labels each speaker `Them 1`, `Them 2`, … (see
[Speaker attribution](#speaker-attribution-them-side) below).
- **Notes** sends the transcript to a local Ollama model (`qwen3:4b-instruct` by
default) via its OpenAI-compatible endpoint, using one of the bundled prompts.

## Setup

### 1. Homebrew packages

```sh
brew install ffmpeg switchaudio-osx
brew install --cask ollama
```

(No BlackHole needed — the far side is captured with a Core Audio process tap. If you
installed `blackhole-2ch` for an earlier version, it's now unused and safe to remove.)

Then start Ollama and pull the model:

```sh
open -a Ollama
ollama pull qwen3:4b-instruct
```

### 2. Python environment

```sh
./setup/install.sh
```

Creates a venv at `~/Library/Application Support/AlfredMeetings/venv` and installs
`mlx-whisper` + `openai` + `pyannote.audio`, builds the signed **capture helper**
(`MeetingCapture.app`) and menu-bar recording indicator, and symlinks the `meetings`
CLI into `~/.local/bin`. (Needs Xcode Command Line Tools for `swiftc`; no Homebrew.)

### 3. Install the workflow

```sh
./build.sh # produces dist/AlfredMeetings.alfredworkflow
open dist/AlfredMeetings.alfredworkflow
```

There are **no audio devices to configure** — recording uses a process tap. The first
`rec` prompts once for **Microphone** access (the tap is gated by the Microphone service);
click Allow. The first `transcribe` run downloads the Whisper model (~1.5 GB, one time).

## Configuration

Defaults live in [`src/config.sh`](src/config.sh) and every value can be overridden
by an environment variable. The most common ones are also exposed in Alfred's
**Configure Workflow** panel:

| Variable | Default |
|---|---|
| `MEETINGS_OUTPUT_DIR` | `~/Desktop/Meeting Notes` |
| `MEETINGS_LLM_MODEL` | `qwen3:4b-instruct` |
| `MEETINGS_LLM_BASE_URL` | `http://localhost:11434/v1` |
| `MEETINGS_WHISPER_MODEL` | `mlx-community/whisper-large-v3-turbo` |
| `MEETINGS_MIC_DEVICE` | _(from Microphone choice)_ — CoreAudio UID or name |
| `MEETINGS_THEM_APP` | _(empty)_ — app name to scope the tap; blank = all system audio |
| `MEETINGS_DIARIZE` | `auto` (`off` to disable Them-side speaker labels) |
| `MEETINGS_HF_TOKEN` | _(empty)_ — Hugging Face token for diarization |

On a 24 GB Mac you can point `MEETINGS_LLM_MODEL` at a larger model (e.g.
`qwen2.5:7b`) for higher-quality minutes.

## Command line

The same three stages run from your shell via the `meetings` command (symlinked into
`~/.local/bin` by `setup/install.sh`):

```sh
meetings rec # start; run again (or click the menu-bar ●) to stop
meetings transcribe [audio] # newest recording, or a file you pass
meetings notes minutes [transcript] # also: summary | clean
meetings notes custom "List risks and open questions"
```

`rec` auto-transcribes on stop just as it does under Alfred. The CLI and the Alfred
keywords run the **same** scripts and honour the same `MEETINGS_*` configuration.

## Speaker attribution (Them side)

The mic channel is always **you** ("Me"). Everyone you hear shares the single
system-audio channel, so to label them individually the transcriber runs
`pyannote.audio` diarization on that channel and tags each voice `Them 1`, `Them 2`, …
(the labels are plain text — search-and-replace them with real names afterward).

This needs a free Hugging Face token and one-time access to the gated model
(`pyannote/speaker-diarization-community-1`, the pyannote.audio 4.x flagship pipeline):

1. Create a token at (a **Read** token).
2. Accept the model terms at
.
3. Set the token: Alfred → **Configure Workflow → Hugging Face token**, or export
`MEETINGS_HF_TOKEN` in your shell.

Override the pipeline with `MEETINGS_DIARIZE_MODEL` if you pin a different pyannote
version (e.g. `pyannote/speaker-diarization-3.1` under pyannote.audio 3.x).

If the token, model access, or `pyannote.audio` is missing, transcription **falls back
to a single `Them` label** — nothing breaks, you just lose the per-speaker split. Set
`MEETINGS_DIARIZE=off` (or the **Speaker labels** popup) to skip it entirely.
Diarization adds a CPU pass over the Them channel, so a long meeting takes a few extra
minutes; set `MEETINGS_DIARIZE_DEVICE=mps` to try the GPU.

## Repository layout

```
build.sh Package src/ into dist/AlfredMeetings.alfredworkflow
setup/install.sh Build the venv, indicator app, and `meetings` CLI symlink
setup/audio-setup.md One-time audio device guide
src/info.plist Alfred workflow graph
src/config.sh Shared, env-overridable configuration
src/bin/ Component entrypoints (record / transcribe / notes)
src/bin/meetings Single-entrypoint CLI dispatcher for the shell
src/engine/ Python engines + prompt templates
src/indicator/ Native menu-bar "recording now" app (Swift)
```

## License

This project's source is released under the [MIT License](LICENSE).

The runtime dependencies are **not** bundled or redistributed here — `setup/install.sh`
fetches them on your machine (the Python venv via `pip`; `ffmpeg`, `ollama`, and
`switchaudio-osx` via Homebrew). They keep their own licenses (mostly MIT / BSD /
Apache-2.0). `ffmpeg` is only invoked as a separate process, so its GPL terms do not
apply to this code. See [THIRD-PARTY-NOTICES.md](THIRD-PARTY-NOTICES.md) for the full
dependency list and per-package licenses.

That file is kept honest by CI: `setup/check-notices.sh` fails the build if a dependency
declared in `setup/install.sh` isn't documented. After changing dependencies, run
`./setup/check-notices.sh --refresh` on your Mac to regenerate the version table from the
installed venv, then commit the result.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/bhanafee/alfredmeetings

Awesome Lists containing this project

README