{"id":47698254,"url":"https://github.com/cdeust/cortex","last_synced_at":"2026-05-09T12:12:07.020Z","repository":{"id":346266914,"uuid":"1185234162","full_name":"cdeust/Cortex","owner":"cdeust","description":"C.O.R.T.E.X. — Cognitive profiling system for Claude Code","archived":false,"fork":false,"pushed_at":"2026-04-02T23:46:57.000Z","size":15959,"stargazers_count":8,"open_issues_count":0,"forks_count":1,"subscribers_count":0,"default_branch":"main","last_synced_at":"2026-04-03T03:26:27.212Z","etag":null,"topics":["agent-memory-system","agent-skills","artificial-intelligence","causal-inference","claude-code","claude-code-plugin","cognitive-architecture","cognitive-science","hopfield-network","knowledge-representation","long-term-memory","mcp-client","mcp-server","model-context-protocol","neural-network","neuroscience","persistent-memory","predictive-coding","retrieval-augmented-generation","vector-search"],"latest_commit_sha":null,"homepage":"https://ai-architect.tools","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/cdeust.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2026-03-18T11:24:37.000Z","updated_at":"2026-04-03T02:10:14.000Z","dependencies_parsed_at":null,"dependency_job_id":null,"html_url":"https://github.com/cdeust/Cortex","commit_stats":null,"previous_names":["cdeust/jarvis","cdeust/cortex"],"tags_count":17,"template":false,"template_full_name":null,"purl":"pkg:github/cdeust/Cortex","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cdeust%2FCortex","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cdeust%2FCortex/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cdeust%2FCortex/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cdeust%2FCortex/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/cdeust","download_url":"https://codeload.github.com/cdeust/Cortex/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cdeust%2FCortex/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":31577448,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-08T14:31:17.711Z","status":"ssl_error","status_checked_at":"2026-04-08T14:31:17.202Z","response_time":54,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.5:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["agent-memory-system","agent-skills","artificial-intelligence","causal-inference","claude-code","claude-code-plugin","cognitive-architecture","cognitive-science","hopfield-network","knowledge-representation","long-term-memory","mcp-client","mcp-server","model-context-protocol","neural-network","neuroscience","persistent-memory","predictive-coding","retrieval-augmented-generation","vector-search"],"created_at":"2026-04-02T16:56:42.754Z","updated_at":"2026-05-09T12:12:06.999Z","avatar_url":"https://github.com/cdeust.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003cp align=\"center\"\u003e\n  \u003cimg src=\"docs/assets/cortex-workflow-graph.png\" alt=\"Cortex workflow graph — each project becomes a dense brain-region cloud whose shape IS its code: files, commands, agents, memories and AST symbols (functions, methods, classes, modules, constants across 10 languages) are pulled into position by the real edges between them (defined_in, calls, imports, member_of, tool_used_file). Symbols touched by two projects sit in the inter-project space between their hubs; long threads mark shared files and MCPs.\" width=\"100%\"/\u003e\n\u003c/p\u003e\n\n\u003cp align=\"center\"\u003e\n  \u003ca href=\"https://github.com/cdeust/Cortex/actions/workflows/ci.yml\"\u003e\u003cimg src=\"https://github.com/cdeust/Cortex/actions/workflows/ci.yml/badge.svg\" alt=\"CI\"\u003e\u003c/a\u003e\n  \u003ca href=\"LICENSE\"\u003e\u003cimg src=\"https://img.shields.io/badge/License-MIT-blue.svg\" alt=\"MIT License\"\u003e\u003c/a\u003e\n  \u003cimg src=\"https://img.shields.io/badge/python-3.10+-blue.svg\" alt=\"Python 3.10+\"\u003e\n  \u003cimg src=\"https://img.shields.io/badge/tests-2500_passing-brightgreen.svg\" alt=\"Tests\"\u003e\n  \u003cimg src=\"https://img.shields.io/badge/citations-45_papers-orange.svg\" alt=\"Citations\"\u003e\n  \u003cimg src=\"https://img.shields.io/badge/version-3.15.0-brightgreen.svg\" alt=\"Version 3.15.0\"\u003e\n  \u003ca href=\"https://glama.ai/mcp/servers/cdeust/Cortex\"\u003e\u003cimg src=\"https://glama.ai/mcp/servers/cdeust/Cortex/badges/score.svg\" alt=\"Glama score: security A, license A\"\u003e\u003c/a\u003e\n\u003c/p\u003e\n\n\u003cp align=\"center\"\u003e\n  \u003ca href=\"#getting-started\"\u003eGetting Started\u003c/a\u003e · \u003ca href=\"#write-papers-in-cortex\"\u003eWrite Papers\u003c/a\u003e · \u003ca href=\"#what-this-actually-feels-like\"\u003eWhat It Feels Like\u003c/a\u003e · \u003ca href=\"#retrieval-that-actually-works\"\u003eBenchmarks\u003c/a\u003e · \u003ca href=\"#the-science-under-the-hood\"\u003eScience\u003c/a\u003e · \u003ca href=\"#neural-graph\"\u003eViews\u003c/a\u003e\n\u003c/p\u003e\n\n\u003cp align=\"center\"\u003e\n  \u003cstrong\u003eCompanion projects:\u003c/strong\u003e\u003cbr\u003e\n  \u003ca href=\"https://github.com/cdeust/cortex-know-when-to-stop-training-model\"\u003ecortex-beam-abstain\u003c/a\u003e — community-trained retrieval abstention model for RAG systems\u003cbr\u003e\n  \u003ca href=\"https://github.com/cdeust/zetetic-team-subagents\"\u003ezetetic-team-subagents\u003c/a\u003e — specialist Claude Code agents Cortex orchestrates with\u003cbr\u003e\n  \u003ca href=\"https://github.com/cdeust/automatised-pipeline\"\u003eautomatised-pipeline\u003c/a\u003e — automated 11-stage pipeline (findings → PRs) that Cortex drives via \u003ccode\u003erun_pipeline\u003c/code\u003e\u003cbr\u003e\n  \u003ca href=\"https://github.com/cdeust/prd-spec-generator\"\u003eprd-spec-generator\u003c/a\u003e — stateless reducer that turns a feature description into a 9-file PRD (consumes Cortex memory + the pipeline's graph intel)\n\u003c/p\u003e\n\n---\n\nClaude Code forgets you every time you close the tab. Every architecture decision you explained. Every debugging session where you traced a bug through four layers of abstraction. Every \"remember, we decided to use event sourcing, not CRUD\" correction. Gone. Next session, you're a stranger to your own tools.\n\nCortex is a persistent memory engine for Claude Code built on computational neuroscience. It remembers what you worked on, how you think, what you decided and why. Not as a dumb text dump shoved into context, but as a living memory system that consolidates, forgets intelligently, and reconstructs the right context at the right time.\n\n**26 biological mechanisms. 47 MCP tools. 9 automatic hooks. Runs entirely on your machine. PostgreSQL + pgvector.**\n\n**v3.15.0 — verification campaign + arXiv-ready papers**: 45 per-mechanism ablation rows across LongMemEval-S (17 rows, n=500) and LoCoMo (14 rows × 2 sweeps, n=1986). Headline numbers stay verified — LongMemEval R@10 = 98.4% / MRR = 0.9124, LoCoMo R@10 = 94.3% / MRR = 0.8279 — and every figure now traces to a JSON in `benchmarks/results/ablation/` with code SHAs, dirty flags, and per-row category breakdowns. The thermodynamic memory paper (`docs/arxiv-thermodynamic/main.pdf`, 30 pages, all 45 citations resolved) and the structured context assembly paper (`docs/arxiv-context-assembly/main.pdf`, 37 pages) are arXiv-ready. Two production fixes surfaced during verification: consolidation cadence is now ingest-relative instead of wall-clock (recovers MRR 0.222 → 0.8264 on backdated corpora), and the plasticity ablation no-op preserves the result-shape contract (no more silent KeyError). HOPFIELD, HDC, SPREADING_ACTIVATION, DENDRITIC_CLUSTERS, EMOTIONAL_RETRIEVAL, MOOD_CONGRUENT_RERANK, and RECONSOLIDATION are now wired end-to-end on the production read path; 23 mechanisms have CORTEX_ABLATE_\u003cMECH\u003e hooks reading at the hot path. BEAM-10M LLM head-to-head harness scaffolded at `benchmarks/llm_head_to_head/`. [Release notes →](https://github.com/cdeust/Cortex/releases/tag/v3.15.0)\n\n**v3.14.2 — call graph lit + queryable**: the workflow graph now renders the actual call and import edges between symbols — not just the AST shells. Every edge carries a *confidence* (0.0–1.0) and a *reason* tag (`direct-ast`, `import-scope-lookup`, `memory-entities-link`, …) so you can tell a resolved call from a same-name guess at a glance. Knowledge-graph entities ship as a first-class layer: ~10k entities extracted from memory text land between the memory ring and the file shell, heat-weighted centroid-placed near the memories that mention them. And a new `query_workflow_graph` MCP tool returns typed subgraphs on demand — filter by `node_kind`, `edge_kind`, `neighbour_of \u003cid\u003e + depth`, or `domain`, so downstream agents can reason over graph slices without rebuilding from scratch.\n\n**v3.14.0 — neural graph \u0026 AST integration**: the workflow graph reveals itself one layer at a time — first your projects, then their tools, then the files those tools touched, then the code itself (functions, methods, classes) parsed from 10 languages (Rust, Python, TypeScript, Java, Kotlin, Swift, Objective-C, C, C++, Go) via the [automatised-pipeline](https://github.com/cdeust/automatised-pipeline) Rust AST backend. A symbol that is imported by two projects literally sits in the space between those two projects on the map, so the picture of *what connects to what* is the picture of your codebase. Each project is indexed once and cached on disk; reopening the graph hydrates in milliseconds, and only projects whose source actually changed are re-read. Click any node — a file, a function, a command — and the side panel lists the *named* things it is connected to (callers, imports, the files that used it) instead of a bare count. [Release notes →](https://github.com/cdeust/Cortex/releases/tag/v3.14.0)\n\n## Getting Started\n\n```bash\nclaude plugin marketplace add cdeust/Cortex\nclaude plugin install cortex\n```\n\nRestart your Claude Code session, then run:\n\n```\n/cortex-setup-project\n```\n\nThis handles everything: PostgreSQL + pgvector installation, database creation, embedding model download, cognitive profile building from session history, codebase seeding, conversation import, and hook registration. Zero manual steps.\n\nAfter install, verify everything is wired correctly:\n\n```bash\npython3 -m mcp_server.doctor\n```\n\n(or, from inside the marketplace clone: `python3 ~/.claude/plugins/cache/cortex-plugins/cortex/*/mcp_server/doctor.py`)\n\nSeven checks in two seconds: Python, PG driver, DATABASE_URL, PG connection, extensions, writable methodology dir, I10 pool-capacity invariant. Exit 0 means ready.\n\n\u003e **Using Claude Cowork?** Install [Cortex-cowork](https://github.com/cdeust/Cortex-cowork) instead — uses SQLite, no PostgreSQL required.\n\n\u003cdetails\u003e\n\u003csummary\u003e\u003cstrong\u003eMore options\u003c/strong\u003e (Clone, Docker, Manual setup)\u003c/summary\u003e\n\n**Clone + setup script:**\n```bash\ngit clone https://github.com/cdeust/Cortex.git \u0026\u0026 cd Cortex\nbash scripts/setup.sh        # macOS / Linux\npython3 scripts/setup.py     # Windows / cross-platform\n```\n\n**Docker:**\n```bash\ngit clone https://github.com/cdeust/Cortex.git \u0026\u0026 cd Cortex\ndocker build -t cortex-runtime -f docker/Dockerfile .\ndocker run -it \\\n  -v $(pwd):/workspace \\\n  -v cortex-pgdata:/var/lib/postgresql/17/data \\\n  -v ~/.claude:/home/cortex/.claude-host:ro \\\n  cortex-runtime\n```\n\n**Manual:** See [detailed manual setup instructions](docs/manual-setup.md).\n\n\u003c/details\u003e\n\n---\n\n## Write papers in Cortex\n\nCortex doesn't just remember — it authors. Every memory that passes the pipeline becomes a structured wiki page, editable in place with a full scientific writing environment:\n\n\u003cp align=\"center\"\u003e\n\u003cimg src=\"docs/wiki-edit.png\" width=\"100%\" alt=\"Cortex Wiki editor — CodeMirror 6 source pane on the left, live-preview pane on the right with headings, lists, and structured sections rendered via the project's LaTeX-inspired typography\" /\u003e\n\u003c/p\u003e\n\n- **CodeMirror 6 inline editor** with live preview; save round-trips atomically to the `.md` file on disk (git-diffable).\n- **LaTeX math** — `$E=\\nabla \\cdot F$` and `$$…$$` blocks rendered live via KaTeX.\n- **BibTeX citations** — drop `.bib` files under `wiki/_bibliography/`, use `[@friston2010]` inline, and Citation.js resolves them to `(Friston 2010)` with an auto-generated APA bibliography.\n- **Figure / equation / table auto-numbering** — `{#fig:arch}` labels, `{@fig:arch}` cross-refs, resolved to `Figure 1` / `Equation 3` / `Section 2.1`.\n- **Pandoc export** — one click produces PDF (via LaTeX), TEX, DOCX, or HTML. Journal-submittable from the same markdown that feeds the memory pipeline.\n\nThe source stays markdown. Your `.md` files remain grep-able, diffable, and interoperable with any external tool. Cortex adds a rendering + editing + export layer on top without stealing your content into a proprietary format.\n\n---\n\n## What this actually feels like\n\n**Monday.** You spend an hour debugging a webhook handler. After tracing through four layers, you find the root cause: a race condition in the Redis session store where TTL expiry can fire between the auth check and the permission lookup. You discuss the fix with Claude, decide on an approach, and implement it. Session ends.\n\n**Thursday.** Different project, but a user reports intermittent logouts. You open Claude Code. Before you even describe the bug, Cortex has already injected three memories: Monday's race condition analysis, a decision from two weeks ago to use Redis for all session state, and a lesson from an older session about TTL edge cases in distributed caches.\n\nClaude doesn't just have your conversation history. It has *context*. It connects the current problem to past decisions, surfaces lessons you forgot you learned, and skips the part where you re-explain your entire architecture.\n\n**Three weeks later.** Those individual debugging sessions have been consolidated into a general pattern: \"authentication edge cases involving TTL-based caches.\" The specific Redis commands compressed to a summary. The debugging steps faded. The principle survived. Your next auth issue starts with institutional knowledge, not a blank page.\n\nThat's the difference. Not \"here's what you said last time.\" Real recall — the kind where your tools understand the *shape* of what you've been building.\n\n---\n\n## Retrieval that actually works\n\nWe tested Cortex against three published benchmarks. All scores are **retrieval-only** — no LLM reader in the evaluation loop. We measure whether the right memory shows up, not whether a model can generate a good answer from it.\n\n### LongMemEval — can you find a fact from 40 sessions ago?\n\nLongMemEval (Wu et al., ICLR 2025): 500 human-curated questions embedded in ~40 sessions of conversation history (~115k tokens). The paper's best retrieval hit 78.4% Recall@10.\n\n| | Cortex | What it means |\n|---|---|---|\n| Recall@10 | **98.4%** | The right memory shows up in the top 10 results for nearly every question |\n| MRR | **0.9124** | The correct answer is usually the first or second result |\n\n| Category | MRR | R@10 | Why this score |\n|---|---|---|---|\n| Single-session (assistant) | 1.000 | 100.0% | Verbatim assistant responses are easy to match |\n| Multi-session reasoning | 0.962 | 100.0% | Entity graph connects evidence across sessions |\n| Knowledge updates | 0.925 | 100.0% | Heat decay naturally surfaces the newest version of a fact |\n| Temporal reasoning | 0.926 | 98.5% | Time anchors embedded directly in memory content |\n| Single-session (user) | 0.814 | 94.3% | User phrasing varies more than assistant responses |\n| Single-session (preference) | 0.668 | 93.3% | Preferences are implicit — harder to retrieve by keyword |\n\nKnowledge updates scored highest because heat-based decay naturally pushes newer information above older versions of the same fact. This wasn't designed for the benchmark. It's just how the thermodynamic model works.\n\n### LoCoMo — can you handle trick questions and multi-hop reasoning?\n\nLoCoMo (Maharana et al., ACL 2024): 1,986 questions across 10 conversations, including adversarial trick questions designed to confuse retrieval, multi-hop queries requiring evidence from multiple turns, and temporal reasoning about when things happened.\n\n| | Cortex | What it means |\n|---|---|---|\n| Recall@10 | **94.3%** | Right memory in top 10 over 9 times out of 10 (n=1986, BASELINE_NO_CONSOLIDATION, post-plasticity-fix) |\n| MRR | **0.8279** | Correct answer is typically the first result |\n\n| Category | MRR | R@10 | Why this score |\n|---|---|---|---|\n| Adversarial | 0.881 | 96.0% | Trick questions can't fool five fused signals |\n| Open-domain | 0.875 | 96.9% | Broad questions benefit from multi-signal coverage |\n| Multi-hop | 0.779 | 90.3% | Entity graph connects evidence across turns |\n| Single-hop | 0.741 | 94.0% | Direct factual questions — strong but room to improve |\n| Temporal | 0.577 | 78.3% | \"When did X happen?\" is the hardest category — needs better time-series matching |\n\nNo LLM at query time. No API calls. Just a 22MB embedding model, PostgreSQL with pgvector, and neuroscience algorithms doing the heavy lifting. Five retrieval signals fused server-side (vector similarity, full-text search, trigram matching, thermodynamic heat, recency), then reranked by a cross-encoder.\n\n### BEAM — 10 million tokens of conversation, one memory system\n\nBEAM (Tavakoli et al., ICLR 2026) is the hardest long-term memory benchmark published. 10 conversations, each spanning 10 million tokens. 200 probing questions across 10 memory abilities, including three that no prior benchmark tests: contradiction resolution, event ordering, and instruction following.\n\nEvery system in the paper collapses at this scale. The best result reported (LIGHT on Llama-4-Maverick) scores 0.266. Context-window approaches can't fit it. Standard RAG drowns in noise.\n\n| Split | WRRF baseline | With Context Assembler | What happened |\n|---|---|---|---|\n| BEAM-100K | 0.591 | **0.602** | Flat search still works at small scale |\n| **BEAM-10M** | 0.353 | **0.471 (+33.4%)** | Structured assembly dominates when flat search drowns |\n\n**BEAM-10M per-ability breakdown (Temporal Context Assembler — no oracle labels, timestamps only):**\n\n| Ability | MRR | R@10 | Δ vs WRRF | What happened |\n|---|---|---|---|---|\n| knowledge_update | **0.950** | 100.0% | +0.115 | Day-level grouping keeps knowledge updates tighter than topic labels |\n| contradiction_resolution | **0.892** | 95.0% | +0.259 | Temporal proximity catches contradictions better than topic boundaries |\n| information_extraction | **0.592** | 75.0% | +0.144 | Same-day memories cluster the right facts |\n| preference_following | **0.508** | 60.0% | +0.096 | Preferences cluster by time, not topic |\n| abstention | **0.600** | 60.0% | +0.500 | Temporal scoping correctly empties irrelevant stages |\n| temporal_reasoning | **0.460** | 50.0% | +0.090 | Time anchors naturally align with temporal stages |\n| multi_session_reasoning | 0.425 | 60.0% | +0.010 | Cross-day bridging via entity graph — marginal gain |\n| instruction_following | 0.150 | 15.0% | +0.082 | Instructions still look like normal questions |\n| summarization | 0.083 | 11.1% | −0.103 | Temporal scoping too narrow for broad summary queries |\n| event_ordering | 0.050 | 5.0% | −0.017 | Chronological sequencing needs more than retrieval |\n\nEight of ten abilities improve. The key finding: **temporal day-level partitioning outperforms BEAM's ground-truth topic labels** (0.471 vs 0.429 with oracle plan_id). This was not predicted — it means temporal proximity is a stronger stage signal than topic boundaries for conversational memory, and the architecture generalizes without any oracle metadata.\n\nAt 10 million tokens per conversation, you have ~7,500 memories that all look similar to a vector search engine. The [Structured Context Assembly](docs/research-post-context-assembly.md) architecture fixes this by breaking the conversation into stages (distinct topics), retrieving within the current stage first, following entity graph connections to related stages, and falling back to summaries for everything else. 8 of 10 memory abilities improve.\n\nThis architecture was originally designed in September 2025 for generating coherent 9-page PRDs on Apple Intelligence's 4096-token context window ([ai-prd-builder](https://github.com/cdeust/ai-prd-builder), commit [`462de01`](https://github.com/cdeust/ai-prd-builder/commit/462de01) — one month before the BEAM paper existed). It works because the problem is the same at both scales: you can't fit everything in context, so you need to be smart about what goes in.\n\n**Honest caveat:** BEAM doesn't define a retrieval MRR metric — the paper uses LLM-as-judge nugget scoring. Our \"MRR\" is a retrieval proxy (rank of first substring-matching memory). The paper's \"LIGHT\" scores are end-to-end QA, shown for directional reference.\n\n\u003cdetails\u003e\n\u003csummary\u003eRunning benchmarks yourself\u003c/summary\u003e\n\n```bash\npip install -e \".[postgresql,benchmarks,dev]\"\n\npython benchmarks/beam/run_benchmark.py --split 100K          # ~10 min\npython benchmarks/beam/run_benchmark.py --split 10M           # ~50 min\nCORTEX_USE_ASSEMBLER=1 python benchmarks/beam/run_benchmark.py --split 10M\npython benchmarks/locomo/run_benchmark.py                     # ~40 min\npython benchmarks/longmemeval/run_benchmark.py --variant s    # ~45 min\n```\n\nAll scores on fresh database (DROP + CREATE per run), TRUNCATE between conversations, FlashRank preflight verified. See [full methodology](docs/research-post-context-assembly.md).\n\n\u003c/details\u003e\n\n---\n\n## The science under the hood\n\nCortex doesn't store memories the way a database stores rows. It treats them more like a brain treats experiences.\n\n**Memories have temperature.** Every memory starts hot. Access it and it stays hot. Ignore it and it cools. Below a threshold, it compresses: full text → summary → keywords → fades entirely. This isn't a bug — it's [rate-distortion optimal forgetting](docs/papers/science.md), the same mathematical framework your brain uses to decide what's worth keeping. Important memories resist compression. Surprising ones get a heat boost. Boring, redundant ones quietly disappear. *(Anderson \u0026 Lebiere 1998; Ebbinghaus 1885)*\n\n**Storage has a gatekeeper.** Not everything deserves to be remembered. Cortex maintains a predictive model of what it already knows, and only stores information that violates its expectations. Tell it the same thing twice and the write gate blocks the second attempt. This is predictive coding — the same mechanism your neocortex uses to filter sensory input. Only prediction errors get through. *(Friston 2005; Bastos et al. 2012)*\n\n**Retrieval changes the memory.** When you recall a memory in a new context, Cortex doesn't just passively hand it back. It compares the retrieval context against the storage context, and if there's enough mismatch, it reconsolidates — updates the memory to reflect what's true now. This is real neuroscience. Nader et al. showed in 2000 that retrieved memories become labile and can be rewritten. Your codebase evolves, and so do Cortex's memories of it. *(Dudai 2012; Nader et al. 2000)*\n\n**Emotional memories are stronger.** Frustration during debugging, excitement when a test passes, urgency in a production incident — Cortex detects emotional valence and encodes those memories with more force. They decay slower, compress later, and surface faster. Like how you remember your worst production outage in vivid detail but can't recall last Tuesday's standup. *(Wang \u0026 Bhatt 2024; Yerkes-Dodson 1908)*\n\n**Background consolidation runs like sleep.** When you're away, a consolidation cycle processes recent memories: decays old ones, compresses verbose ones, promotes recurring patterns into general knowledge (episodic → semantic transfer), discovers entity relationships, and runs \"dream replay\" where related memories are compared and new connections emerge. *(McClelland et al. 1995; Foster \u0026 Wilson 2006; Buzsáki 2015)*\n\n**Similar memories stay distinct.** Pattern separation — modeled on the dentate gyrus, which keeps \"Tuesday's standup\" separate from \"Wednesday's standup\" even though they're almost identical. Without this, retrieval returns the same generic match for every similar query. *(Leutgeb et al. 2007; Yassa \u0026 Stark 2011)*\n\n**45 papers total.** Every algorithm, constant, and threshold traces to a published source. Full citations, equations, ablation data, and per-module implementation audit: **[docs/papers/science.md](docs/papers/science.md)** | **[Thermodynamic memory paper (PDF, 30 pages)](docs/arxiv-thermodynamic/main.pdf)** | **[Structured context assembly paper (PDF, 37 pages)](docs/arxiv-context-assembly/main.pdf)** | **[Research post on structured context assembly](docs/research-post-context-assembly.md)**\n\n---\n\n## Hippocampal Replay: context that survives compaction\n\nClaude Code has a 200k/1M token context window. During long sessions, when that window fills up, it compacts: summarizes older messages, strips tool outputs, paraphrases your instructions. Important nuance evaporates. Decisions you anchored early in the conversation dissolve into vague summaries.\n\nHippocampal Replay fixes this. Named after the neuroscience phenomenon where your brain replays important experiences during sleep to consolidate them, it treats context compaction as \"sleep\" and replays what matters when Claude \"wakes up.\"\n\n**Before compaction hits,** a hook fires. Cortex drains your active context — what you were working on, which files were open, what decisions you'd made, what errors were unresolved — and stores it as a checkpoint.\n\n**After compaction,** a second hook fires. Cortex reconstructs your context intelligently. Not by dumping everything back in, but by assembling the right pieces: your latest checkpoint, any facts you'd anchored as critical, the hottest project memories, and predictions about what you'll need next.\n\nYou can be explicit about what matters:\n\n```\ncortex:anchor({ content: \"We're using the event-sourcing pattern. All state changes go through the event bus.\", reason: \"Architecture constraint\" })\n```\n\nAnchored memories get maximum protection. They always survive compaction, no matter what.\n\n---\n\n## Auto-generated project wiki\n\nEvery time you store a memory, Cortex doesn't just save text — it extracts entities, builds relationships, detects schemas, and links the new memory into a growing knowledge graph. Over time, this becomes a **living wiki of your project**: decisions and their rationale, patterns that emerged, lessons learned, architectural constraints, and how they all connect.\n\nExplore it through:\n- **`/cortex-visualize`** — opens the interactive workflow graph in your browser (Graph is the default view; Knowledge / Wiki / Board / Pipeline tabs over the same data)\n- **`get_causal_chain`** — trace how one decision led to another\n- **`get_project_story`** — auto-generated narrative of your project's evolution\n- **`detect_gaps`** — find areas where knowledge is thin or isolated\n\nThis isn't documentation you write. It's documentation that writes itself from how you work.\n\n---\n\n## Neural Graph\n\nLaunch with `/cortex-visualize`. The default landing view is **Graph** — a live, radial-hierarchical map of everything Claude has ever done in your projects. Knowledge / Wiki / Board / Pipeline tabs sit over the same data for different reading angles.\n\n\u003cp align=\"center\"\u003e\n\u003cimg src=\"docs/assets/cortex-workflow-graph.png\" width=\"100%\" alt=\"Cortex workflow graph — many brain-region clouds, one per project, with inner radial shells grouping nodes by Claude surface (setup → tools → files → discussions → memories)\" /\u003e\n\u003c/p\u003e\n\n**Graph View — the Claude workflow map.** Each project becomes a **cloud of nodes** around one gold domain hub. Inside every cloud, nodes are arranged in six concentric levels by the Claude surface (or the code itself) that produced them:\n\n| Level | What's there | How to click through |\n|---|---|---|\n| **L1 · Claude setup** | Skills · Commands · Hooks · Agents · MCPs | Click a skill for its file path; click an MCP to see which domains share it (thin indigo threads bridge clouds) |\n| **L2 · Tools** | One hub per Claude tool per domain (Edit · Write · Read · Grep · Glob · Bash · Task) | Click a hub for files touched + total uses |\n| **L3 · Files** | Every file Claude ever opened, read, edited, searched, or referenced in a Bash command — colored by primary tool (green edited / cyan read / fuchsia searched / orange bash-only) | Click for `first_seen`, `last_accessed`, `last_modified`, and a **See diff against HEAD** button that renders new/modified/deleted/historical content inline |\n| **L4 · Discussions** | One node per Claude Code session | Click for `started_at`, duration, message count, and a **View full conversation** button that replays every turn (including tool calls) |\n| **L5 · Memories** | Persistent memories, colored by consolidation stage (labile → early LTP → late LTP → consolidated → semantic) | Click for full content, tags, and every scientific measurement |\n| **L6 · AST symbols** | The code itself — functions (cyan), methods (sky), classes/structs/enums/traits/protocols (violet), modules/packages/namespaces (amber), constants/fields/properties (slate) — parsed from 10 languages (Rust, Python, TypeScript, Java, Kotlin, Swift, Objective-C, C, C++, Go) and laid out as petals around their parent file in L3 | Click for qualified name, symbol type, parent file, and the named edges: `defined_in`, `calls`, `imports`, `member_of`. A symbol imported by two projects sits in the space between their clouds, making `what connects to what` literally the shape of the code |\n\n**What L6 is for.** L5 and below tell you *what Claude did*; L6 tells you *what the code is*. Once AST symbols are on the map, three things become visible for free: (1) **shared code** — any function, class or module referenced by two projects drifts into the inter-project gap, so reused primitives reveal themselves without a dependency audit; (2) **impact** — clicking a symbol surfaces every caller, importer, and member edge, so \"if I change this, what breaks?\" is a graph neighbourhood, not a grep; (3) **the picture of the codebase itself** — because the forces come from real `defined_in` / `calls` / `imports` / `member_of` edges, a dense petal around a file means a fat internal API and a thin one means a leaf module. Click any node and the side panel lists the *named* callers, imports, and members instead of a bare count. L6 nodes are the only ones without a fixed radial slot — they orbit their parent file, so the layer collapses cleanly when you filter it out.\n\nThin dashed **violet threads** between clouds mark cross-domain files and shared MCPs. A single **grouped filter select** (`All` / `L1–L6` / by kind / by file cluster / by AST edge kind / `Cross-domain`) isolates any slice; a text search narrows within that slice.\n\nEverything Claude touches live is visible: Edit, Write, Read, Grep, Glob, NotebookRead, NotebookEdit, and Bash paths inside commands — captured via the `PostToolUse` hook with compact markers so the graph rebuilds every ~2 minutes with fresh data.\n\n\u003cp align=\"center\"\u003e\n\u003cimg src=\"docs/assets/cortex-consolidation-board.png\" width=\"100%\" alt=\"Cortex Board view — five columns for labile, early LTP, late LTP, consolidated, and reconsolidating memories, each column header showing total count and per-bucket stage metrics (decay, vulnerability, plasticity, heat, importance, encoding, interference, hippo, replay) plus cards grouped by stage\" /\u003e\n\u003c/p\u003e\n\n**Board View** — consolidation stages as kanban columns (`labile` · `early_ltp` · `late_ltp` · `consolidated` · `reconsolidating`). Each column header reads live bucket metrics: **decay rate**, **vulnerability**, **plasticity**, **heat / importance / encoding / interference** medians, **hippocampal dependency**, and **replay count** — with the advancement rule (`replay ≥ 3`, `DA ≥ 1 or imp \u003e 0.3`, etc.) printed under the bar. \"At-risk\" counter flags memories near promotion or decay. Cards inside each column carry heat, importance, surprise, valence, arousal, and the exact tool that created the memory.\n\n\u003cp align=\"center\"\u003e\n\u003cimg src=\"docs/assets/cortex-memory-detail.png\" width=\"100%\" alt=\"Cortex memory detail modal — stage pill, tags, valence chip, full body, then a Scientific measurements grid with plain-language explanations of consolidation stage, activity (heat), baseline activity, importance, surprise, emotional tone, emotional intensity, confidence, plasticity, stability\" /\u003e\n\u003c/p\u003e\n\n**Detail panel — every measurement explained.** Clicking a memory (or a file, skill, command, agent, hook, MCP, discussion) opens a modal with the raw value **and** a one-line plain-language explanation. Consolidation stage, activity (heat), baseline activity, importance, surprise, emotional tone, emotional intensity, confidence, plasticity, stability — each is a labeled bar with a sentence like *\"How unexpected this memory was when it arrived. Surprises stick in the mind better than routine events.\"* No more staring at opaque numbers.\n\n**Knowledge View** — curated memory cards with heat-based left border, emotion tag, consolidation stage, and evidence file references. Filter by domain or emotion; click any card for a full-screen detail panel with Markdown + JSON pretty-print.\n\n**Wiki View** — every memory admitted by the grounded-theory pipeline lands here as a structured page (ADR / spec / lesson / convention / note) with:\n\n- EB Garamond body, IBM Plex Mono code, centered academic-paper layout\n- **Heat bar**, lifecycle pill (`active` / `area` / `archived` / `evergreen`), staleness flag, backlinks footer\n- **Inspector drawer** — full audit trail (memos, source claim events, draft history) for every page\n- **Inline CodeMirror 6 editor** + live preview with KaTeX math (see [Write Papers in Cortex](#write-papers-in-cortex) above)\n- **BibTeX citations**, figure/equation/table auto-numbering, cross-references\n- **Pandoc export** → PDF / LaTeX / DOCX / HTML\n\n**Pipeline View** — horizontal Sankey flow from domains through the write gate into consolidation stages. Width of each ribbon = memory volume. Makes retention and drop-off across stages visible at a glance.\n\n---\n\n## Agent Integration\n\nCortex works with teams of specialized agents. Each agent has scoped memory (`agent_topic`) while sharing critical decisions across the team — based on Wegner's transactive memory theory (1987): teams store more knowledge than individuals because each member specializes.\n\n\u003cp align=\"center\"\u003e\n\u003cimg src=\"docs/diagram-team-memory.svg\" alt=\"Transactive Memory System\" width=\"80%\"/\u003e\n\u003c/p\u003e\n\n**Specialization** — each agent writes to its own topic. Engineer's debugging notes don't clutter tester's recall.\n\n**Coordination** — decisions auto-protect and propagate. When engineer decides \"use Redis over Memcached,\" every agent sees it at next session start.\n\n**Directory** — entity-based queries span all topics. \"What do we know about the reranker?\" returns results from engineer, tester, and researcher.\n\nWorks with any custom agents. See [zetetic-team-subagents](https://github.com/cdeust/zetetic-team-subagents) for a ready-made team of **27 specialists** — each with scoped memory that doesn't clutter the others.\n\n---\n\n## Architecture\n\nClean Architecture with strict dependency rules. Inner layers never import outer layers.\n\n\u003cp align=\"center\"\u003e\n\u003cimg src=\"docs/diagram-architecture.svg\" alt=\"Clean Architecture layers\" width=\"80%\"/\u003e\n\u003c/p\u003e\n\n| Layer | What lives here | Count |\n|---|---|---|\n| **core/** | Neuroscience + retrieval logic | 118 modules |\n| **context_assembly/** | Structured context assembler | 10 modules |\n| **infrastructure/** | PostgreSQL, embeddings, file I/O | 33 modules |\n| **handlers/** | MCP tools | 62 tools |\n| **hooks/** | Lifecycle automation | 7 hooks |\n| **observability/** | Prometheus text-format metrics | 1 module |\n\n**Storage:** PostgreSQL 15+ with pgvector (HNSW) and pg_trgm. All retrieval in PL/pgSQL stored procedures.\n\n**Concurrency (v3.13+):** `psycopg_pool.ConnectionPool` with two latency classes — `interactive_pool` (min=2, max=8) for recall/remember/anchor, `batch_pool` (min=1, max=2) for consolidate/ingest. Tool handlers run on worker threads via `asyncio.to_thread`; per-tool admission semaphores bound fan-out. Heat is a *function* computed at read time by `effective_heat()` — homeostatic writes one scalar per domain per run instead of N rows.\n\n**Configuration:** Set `DATABASE_URL` (default: `postgresql://localhost:5432/cortex`). All parameters use `CORTEX_MEMORY_` prefix — see `mcp_server/infrastructure/memory_config.py` for the full list (~40 parameters).\n\n---\n\n## Security\n\nRuns **100% locally** — MCP over stdio, PostgreSQL on localhost, visualization on 127.0.0.1. No data leaves your machine. Audit score: **91/100**.\n\n---\n\n## Development\n\n```bash\npytest                    # 2500+ tests\nruff check .              # Lint\nruff format --check .     # Format\n```\n\n---\n\n## Verification\n\nEvery benchmark headline number above is backed by a per-mechanism ablation campaign on the appropriate benchmark for each mechanism's mechanism-of-action. The campaign comprises three artefact sets at full n on a single-seed protocol with code SHAs, dirty flags, manifests, and per-row JSON outputs preserved alongside the writeups:\n\n- **LongMemEval-S, 17 rows, n=500** — `tasks/e1-v3-results.md`. Per-mechanism deltas across the integrated stack at the calibrated equilibrium; category-specialization analysis.\n- **LoCoMo, 14 rows, n=1986 (pre-plasticity-fix bytes)** — `tasks/e1-v3-locomo-results.md`. Two-baseline (NO_CONSOLIDATION / WITH_CONSOLIDATION) design; empirical resolution of the architectural-mismatch hypothesis (RECONSOLIDATION ΔMRR = +0.0076, ADAPTIVE_DECAY ΔMRR = -0.0163).\n- **LoCoMo, 14 rows, n=1986 (post-plasticity-fix bytes)** — `tasks/e1-v3-locomo-results-post-fix.md`. Re-run on commit `2f45bcb` (descendant of plasticity result-shape fix `5f737fe`); cadence-fix anchor agreement re-validated identically (ΔvsNO = +0.0014); two consolidation-only rows (HOMEOSTATIC_PLASTICITY, SCHEMA_ENGINE) recover positive contributions previously masked by the contract bug.\n\nTotal: 45 per-mechanism evidence rows across 26 enum mechanisms (17 read-path + 9 consolidation-only routed to LoCoMo). The full thermodynamic memory paper, including §6.3 per-mechanism evidence and §6.3.4.1 plasticity-fix re-run subsection, is at `docs/arxiv-thermodynamic/main.pdf` (30 pages, all 45 citations resolved). The companion structured context assembly paper is at `docs/arxiv-context-assembly/main.pdf` (37 pages).\n\n## License\n\nMIT\n\n## Citation\n\nIf you reference the system, the paper PDFs on `main` are the canonical artefacts (arXiv IDs forthcoming, endorsement in progress):\n\n```bibtex\n@software{cortex2026,\n  title={Cortex: Persistent Memory for Claude Code},\n  author={Deust, Clement},\n  year={2026},\n  url={https://github.com/cdeust/Cortex}\n}\n\n@unpublished{deust2026thermodynamic,\n  title={Thermodynamic Memory for Conversational Agents:\n         A Per-Mechanism Ablation Study on LongMemEval and LoCoMo},\n  author={Deust, Clement},\n  year={2026},\n  note={arXiv ID forthcoming, endorsement in progress},\n  url={https://github.com/cdeust/Cortex/blob/main/docs/arxiv-thermodynamic/main.pdf}\n}\n\n@unpublished{deust2026context,\n  title={Structured Context Assembly for Long-Horizon Conversational Memory},\n  author={Deust, Clement},\n  year={2026},\n  note={arXiv ID forthcoming, endorsement in progress},\n  url={https://github.com/cdeust/Cortex/blob/main/docs/arxiv-context-assembly/main.pdf}\n}\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcdeust%2Fcortex","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcdeust%2Fcortex","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcdeust%2Fcortex/lists"}