https://github.com/vasovagal/corti

Auditory addon for the vagus second brain: auto-records meeting audio (Zoom/Slack/Meet/Discord) and files diarized transcript notes. macOS, Apple Silicon.
https://github.com/vasovagal/corti

audio-capture core-audio macos rust second-brain transcription vagus

Last synced: 8 days ago
JSON representation

Auditory addon for the vagus second brain: auto-records meeting audio (Zoom/Slack/Meet/Discord) and files diarized transcript notes. macOS, Apple Silicon.

Host: GitHub
URL: https://github.com/vasovagal/corti
Owner: vasovagal
License: mit
Created: 2026-05-30T16:49:18.000Z (about 1 month ago)
Default Branch: main
Last Pushed: 2026-06-18T00:32:00.000Z (14 days ago)
Last Synced: 2026-06-18T01:07:16.053Z (14 days ago)
Topics: audio-capture, core-audio, macos, rust, second-brain, transcription, vagus
Language: Rust
Size: 945 KB
Stars: 2
Watchers: 0
Forks: 0
Open Issues: 22
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# corti

**Auditory addon for the [vagus](https://github.com/vasovagal/vagus) second brain — Granola
without the SaaS.**

corti is a macOS menu-bar app that turns your meetings into searchable notes, automatically.
When another app grabs the microphone — Zoom, Slack huddles, Google Meet, Discord — corti
starts recording (a blinking tray icon); when the mic is released, it transcribes the call to
diarized, timestamped Markdown and files it straight into your vagus inbox. No "join meeting"
bot, no upload, no account.

Named after the **Organ of Corti** — the cochlear receptor that transduces sound into neural
signals. Exactly what this app does: audio → notes.

## Features

- **Zero-touch capture.** Detects calls automatically from the macOS mic-in-use signal — no
buttons, no scheduling, no per-app integrations.
- **Both sides, cleanly separated.** Records your mic and the far-end audio as a synchronized
2-track WAV through a single CoreAudio aggregate device — no virtual audio drivers, no
ScreenCaptureKit.
- **Knows who you were talking to.** Attributes each recording to the app that owned the call.
- **Echo cancellation on lossless audio.** An FDAF adaptive filter strips speaker bleed from
the mic track when you're on speakers, plus a residual echo suppressor that eliminates the
faint ghosts the adaptive filter can't model — so the transcriber doesn't misattribute bleed
as "Me." The mic and far-end are captured as separate lossless tracks (the far-end as a mono
echo reference) so the filter sees bit-exact audio.
- **Diarized, timestamped transcripts.** Clean Markdown with word-level timestamps and
**Me** / **Them** speaker labels, dropped right into your vagus vault.
- **Two transcription backends, picked at runtime.** **AWS Transcribe** for cloud accuracy,
or a **fully offline on-device** backend (NVIDIA Parakeet-TDT-0.6B-v3 via ONNX) when nothing
may leave the machine. Both compile in; choose with `CORTI_TRANSCRIBE_BACKEND`.
- **Standalone `corti-tap` CLI.** Force-tap system audio to a WAV on demand
(`corti-tap --inbox` to transcribe + file, `--no-mic` for listen-only webinars,
`--label` to name it).

**How it works:** detect → capture (mic + system tap) → echo-cancel → transcribe → file to
vagus (via the `vagus` CLI; corti never writes your vault directly).

## Install

```sh
brew tap vasovagal/corti
brew install --cask corti
```

corti launches into the menu bar. On first run macOS prompts once for **Microphone** and
**System Audio Recording** — grant both and you're done. (Distributed via a private cask;
pre-1.0.)

**Requirements:** Apple Silicon, macOS 15+.

## Speed & privacy

The offline backend runs **~30–60× realtime** on Apple Silicon — a 1-hour call transcribes
in roughly **1–2 minutes**, CPU-only, with no GPU dependency. Models are a ~0.7 GB one-time
download cached under `~/Library/Caches/corti/models/`; after that the local path makes **zero
network calls** — audio and transcript never leave the device. Prefer cloud accuracy? Point
`CORTI_TRANSCRIBE_BACKEND=aws` at AWS Transcribe instead. Either way, capture and
transcription run in the background; the UI never blocks.

## AEC tuning

The echo canceller works out of the box for typical laptop-speaker setups. Individual
parameters can be set via environment variables (`CORTI_AEC_FILTER_LEN`, `CORTI_AEC_MU`,
`CORTI_AEC_POWER_SMOOTHING`, `CORTI_AEC_DOUBLE_TALK_RATIO`, `CORTI_AEC_SUPPRESS_RESIDUAL`) or
persisted in `config.toml`.

> The automated `corti --calibrate-aec` parameter sweep was dropped per
> [ADR 0007](./design/adr/0007-streaming-aec-first.md) (streaming AEC-first): it depended on
> persisted raw 2-track recordings, which the no-durability/no-raw-retention stance removes.
> See [`design/04a-aec-calibration.md`](./design/04a-aec-calibration.md) for the original
> tuning analysis.

## Status

The full pipeline is built and tested end-to-end (detect → capture → AEC → transcribe →
vagus, crash-recoverable), with both transcription backends working. The live join-call → note
loop needs a signed `.app` to exercise the macOS audio-capture (TCC) grant. See
[`design/STATUS.md`](./design/STATUS.md).

## More

- [`CONTRIBUTING.md`](./CONTRIBUTING.md) — build, test, workspace layout, dev setup.
- [`design/`](./design/) — ADRs, guardrails, and the rationale behind the stack.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/vasovagal/corti

Awesome Lists containing this project

README