https://github.com/arcadamarket/rag-runtime-kernel

Persistent memory and deterministic state control for any LLM, in two tiers. Tier 1: a single markdown spec you paste into any chat — no install. Tier 2: a formally-verified Python runtime (676 tests, TLA+) that runs state off-LLM. LLM proposes, system decides, state persists.
https://github.com/arcadamarket/rag-runtime-kernel

agent-memory crash-recovery deterministic llm mcp rag state-machine tla-plus token-efficiency

Last synced: 4 days ago
JSON representation

Host: GitHub
URL: https://github.com/arcadamarket/rag-runtime-kernel
Owner: arcadamarket
License: agpl-3.0
Created: 2026-05-03T01:36:00.000Z (about 2 months ago)
Default Branch: main
Last Pushed: 2026-06-02T06:58:47.000Z (16 days ago)
Last Synced: 2026-06-02T07:22:13.124Z (16 days ago)
Topics: agent-memory, crash-recovery, deterministic, llm, mcp, rag, state-machine, tla-plus, token-efficiency
Language: Python
Homepage:
Size: 2.58 MB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Roadmap: docs/ROADMAP.md

Awesome Lists containing this project

README

# RAG Runtime Kernel

> **LLM proposes. System decides. State persists.**

Persistent memory and deterministic state control for any LLM. The kernel keeps state management **out of the language model** — bootstrap, validation, persistence, and crash recovery run as deterministic code, so the model spends its tokens on reasoning, not on bookkeeping.

It ships in **two tiers** so it fits both a non-technical user pasting one file into a chat, and a developer running a serious, long-lived, token-critical project on a hardened Python backbone.

---

## Choose Your Path

| | **Tier 1 — Simple** | **Tier 2 — Enforced** |
|---|---|---|
| **Who it's for** | Anyone. No Python, no Node, no install. | Builders of large, multi-session, token-critical projects who want hard guarantees. |
| **What you run** | One markdown specification, dropped into a chat session. | The `rag_kernel` Python runtime (MCP or HTTP server) alongside the spec. |
| **How rules are applied** | The LLM **self-enforces** the spec by instruction (autonomous). | The Python kernel **intercepts and validates** every state change. The LLM cannot bypass it. |
| **Determinism** | As reliable as the model following instructions. | Deterministic state machine — formally verified (TLA+) and covered by 1,235 passing tests. |
| **Token cost of state ops** | The model reads and reasons over the spec (~100 KB). | **Zero LLM tokens** for bootstrap, validation, persistence, and recovery — they run in Python. |
| **Version** | Specification **v3.2.2** | Runtime kernel **v0.4.7** |
| **Setup effort** | Seconds. Paste a file. | Minutes. Copy `rag_kernel/`, run one command. |

> **Same project, same RAG files.** Start in Tier 1 and graduate to Tier 2 without rewriting anything — the enforced runtime reads and writes the exact same `RAG/` state. Tier 2 is a strict superset of Tier 1.

> **On the two version numbers.** This repo tracks two things on separate version lines: the **specification** (the protocol the LLM follows — currently `v3.2.2`) and the **runtime kernel** (the Python engine that enforces it — currently `v0.4.7`). Tier 1 uses the spec alone; Tier 2 uses the runtime to enforce that spec.

---

## What Problem This Solves

Every LLM session starts from zero. Close the tab, lose the state. The common workarounds are fragile: chat-history dumps, vector stores that retrieve the wrong thing, and framework lock-in that breaks when you switch platforms. Underneath all of them sits a deeper problem — **the language model is doing its own bookkeeping.** Tracking what's done, what's pending, which decision superseded which, whether a write actually landed: every one of those is reasoning the model has to redo each session, and every one of them costs tokens and invites drift.

RAG Runtime Kernel moves that bookkeeping out of the model. State lives in plain files on disk. The lifecycle is a fixed state machine. And in **Enforced mode** the transitions, validation, and persistence are executed by deterministic Python — not proposed by the model and hoped for.

**The shift this represents (Tier 2):**

- **State management leaves the LLM entirely.** The model proposes a JSON action; the kernel validates it against policy and either commits or rejects it. The model never directly mutates state.
- **Bootstrap costs zero LLM tokens.** `rag_kernel init` parses the ~100 KB specification and produces `RAG_MASTER.json` deterministically — no model call. The work that used to mean "feed the model a 20K-token spec and ask it to build the RAG" is now a function call.
- **Determinism is proven, not asserted.** The state machine is verified with TLA+ (the same class of formal method Amazon uses for AWS) and exercised by 1,235 unit tests — all passing.

**What you get in both tiers:**

- **Persistence** — project state survives across sessions, tabs, and platforms.
- **Lean context** — HOT/COLD memory tiers keep only active state in the window; archival data loads on demand.
- **Audit trail** — every state transition, decision, and conflict is logged and traceable.
- **Conflict ledger** — when a new fact contradicts a stored one, both are preserved, never silently overwritten.

---

## Quick Start

### Tier 1 — Simple (no install)

Best for Claude Projects, ChatGPT, or any chat interface.

1. Open a new project or conversation.
2. Add [`INIT_UNIVERSAL_RUNTIME_KERNEL_v3.2.2.md`](INIT_UNIVERSAL_RUNTIME_KERNEL_v3.2.2.md) to the session as a file (it's a full specification, ~100 KB — it goes into a **project/session**, not the short system-prompt field).
3. Send: **"Initialize the project."** The LLM self-bootstraps, scans your folder if it has file access, and builds the `RAG/` state.
4. On ChatGPT / GPT Web without file tools: download the generated RAG files at session end and re-upload them at the start of each session to restore state.

That's it — no Python, no dependencies.

### Tier 2 — Enforced (Python runtime)

Best for long-lived, multi-session, token-critical projects where you want hard guarantees.

**1. Copy the runtime into your project:**

```bash
git clone https://github.com/arcadamarket/rag-runtime-kernel.git temp-clone
cp -r temp-clone/rag_kernel YOUR_PROJECT/rag_kernel
rm -rf temp-clone
```

PowerShell / CMD equivalents

```powershell
# PowerShell
git clone https://github.com/arcadamarket/rag-runtime-kernel.git temp-clone
Copy-Item -Recurse temp-clone\rag_kernel YOUR_PROJECT\rag_kernel
Remove-Item -Recurse -Force temp-clone
```

```cmd
:: CMD
git clone https://github.com/arcadamarket/rag-runtime-kernel.git temp-clone
xcopy temp-clone\rag_kernel YOUR_PROJECT\rag_kernel\ /E /I
rmdir /s /q temp-clone
```

**2. Bootstrap the RAG deterministically (zero LLM tokens):**

```bash
python -m rag_kernel init --spec RAG/INIT_UNIVERSAL_RUNTIME_KERNEL_v3.2.2.md --output RAG/
# optional: merge project-specific context
python -m rag_kernel configure --rag RAG/RAG_MASTER.json --context your_context.json
```

**3. Run the kernel as a server:**

```bash
python -m rag_kernel mcp --project /path/to/your/RAG # MCP mode (Claude Desktop)
python -m rag_kernel serve --project /path/to/your/RAG --port 7437 # HTTP mode (GPT Custom Actions / any client)
```

Every state mutation now flows through the kernel's proposal → validation → commit pipeline. Full platform-specific setup: [`docs/LAUNCH_MANUAL.md`](docs/LAUNCH_MANUAL.md).

> Works for **both new projects and existing ones**. On an existing project, the boot scan inventories your files, classifies them by tier, and extracts knowledge into COLD storage — your prior work becomes queryable, trackable, and persistent.

---

## What's Actually Proven

This section states only what is measured or formally verified — no marketing percentages.

**Determinism (Tier 2):**

- **1,142 / 1,142 unit tests passing** (runtime v0.4.2) across 19 runtime modules (state machine with TLA+-enforced transition guards, persistence/WAL, COLD manager, concurrency, conflict engine, schemas, HTTP API, MCP transport, spec parser, session logger, generated guards, guard generator, context-truncation policy, graph orchestrator, agent/session supervisor, and the DRIFT-ELIM project-state layer — item-lifecycle core, atomic mutation store, deterministic renders, and the fail-loud session auditor).
- **TLA+ formal verification:** the TLC model checker exhaustively explored **389,522 states (168,520 distinct)** to depth 19 and confirmed **8 safety invariants + 3 liveness properties with zero violations**. The TLA+ spec is a 1:1 transcription of the Python state machine. Two genuine liveness bugs were found and fixed during verification.
- **The verified model is now mechanically enforced at runtime (FV-PHASE4):** the state machine's transition table is *generated* from the TLA+ model and legality is checked through the generated predicate — the runtime can no longer drift from what TLC proved. A `guardgen --check` gate detects any model/code divergence.
- Unit tests prove "these 1,235 scenarios work." TLA+ proves "no reachable state can violate the invariants, and the system always makes progress." The second is a strictly stronger guarantee.

**Token economy (Tier 2):**

- **Bootstrap: 0 LLM tokens.** `rag_kernel init` parses the ~100 KB / ~20K-token specification in Python. No model is involved.
- **State operations: 0 LLM reasoning tokens.** Validation, atomic writes, WAL, checkpointing, COLD partitioning, and crash recovery all execute as code. The model's only job is to *propose*; it never spends tokens managing or re-deriving state.
- **Lean active context.** HOT memory holds only live state (on the order of ~15 KB); archival data is loaded on demand rather than carried in every prompt.

We deliberately do **not** publish a single headline "X% token savings" number — the honest claim is structural: the entire state-management layer is removed from the model's token budget. Your actual savings depend on your project size and platform.

---

## How It Compares

A positioning comparison, not a controlled benchmark. Full notes: [`docs/benchmark_comparison.md`](docs/benchmark_comparison.md).

| Capability | RAG Runtime Kernel | Claude Code | lean-ctx | LLM Wiki |
|---|---|---|---|---|
| **Cross-session memory** | Full: HOT/COLD + WAL + crash recovery | Partial: CLAUDE.md + auto-memory, no crash recovery | None (compresses I/O, doesn't persist state) | Pattern only |
| **Deterministic state machine** | Yes — formally verified (TLA+), 1,235 tests | No | No | No |
| **Where state work runs** | Off the LLM, in Python (Tier 2) | In-session, model-mediated | N/A — I/O compression layer | In the LLM / external tooling |
| **Token approach** | State ops cost **0 LLM tokens**; lean HOT boot | Grows without curation | **60–99% raw I/O compression (best in class)** | Depends on wiki quality |
| **Cross-platform** | Claude + GPT + any LLM, one spec | Claude Code CLI only | Editor-focused | Platform-agnostic pattern |
| **Dependencies** | Tier 1: none. Tier 2: Python only | Node.js + CLI | Rust binary | Varies |
| **Crash recovery** | WAL replay + .bak rotation + RECOVERY state | File-history checkpoints | N/A | None |
| **Conflict tracking** | Explicit ledger — both sources preserved | None | N/A | None |

**Honest take:** if raw token compression is your only goal, **lean-ctx wins** — it's purpose-built for that and pairs cleanly with this kernel (lean-ctx compresses the I/O layer; the kernel manages the state layer). Where this project is genuinely distinct is the combination of a **formally-verified deterministic state machine, atomic persistence with crash recovery, a conflict ledger, and one spec that runs across Claude and GPT** — no other system in this list offers that set.

---

## What This Is

A **specification** plus an optional **runtime that enforces it** — together they turn any LLM into a controlled, auditable agent with persistent project memory. Three layers:

```
LLM (reasoning engine)
| JSON proposals
Policy Layer (the specification)
| validated transitions
Runtime Kernel (state + persistence) <- enforced by Python in Tier 2
| atomic writes
Filesystem (source of truth)
```

In Tier 1 the LLM plays the role of the runtime by following the spec. In Tier 2 the Python kernel *is* the runtime, and the LLM can only propose.

---

## Formally Verified with TLA+

The state machine is verified using [TLA+](https://lamport.azurewebsites.net/tla/tla.html) and the TLC model checker — the same formal-methods technique [Amazon uses to verify AWS infrastructure](https://lamport.azurewebsites.net/tla/amazon-excerpt.html).

TLC exhaustively explored **389,522 states** (168,520 distinct) at depth 19 and verified all 8 safety invariants + 3 liveness properties with zero violations:

| Safety Invariant | What It Proves |
|---|---|
| TypeInvariant | All state variables hold valid types at all times |
| TransitionSafety | Every reachable state is legal per the transition graph |
| SingleWriter | At most one proposal staged at any time (no concurrent mutations) |
| WALConsistency | Write-ahead log is append-only, monotone, never lags behind state |
| TerminalSafety | CLOSING is irreversible — no exit, no crash, no pending proposals |
| NoDeadlock | Every non-terminal state has at least one enabled action |
| CrashRecoveryConsistency | Crash flag is only true when state is RECOVERY |
| WALPrecedesStateChange | WAL entry exists before any state transition commits |

| Liveness Property | What It Proves |
|---|---|
| EventualProgress | The system always eventually returns to READY from any reachable state |
| EventualTermination | CLOSING is stable — once reached, it stays (no infinite loops) |
| ProposalEventuallyResolved | A staged proposal always reaches COMMITTED, REJECTED, or NONE |

Phase 2 verification found and fixed two genuine liveness bugs: a BOOTING↔RECOVERY direct-transition loop, and a crash-at-full-WAL deadlock. The TLA+ specification (`formal/RAGKernel.tla`) maps 1:1 to the runtime code. Full results in [`formal/TLC_RESULTS.md`](formal/TLC_RESULTS.md).

---

## Core Features

**Structured Memory (HOT/COLD)** — Active state stays lean; archival data loads on demand with automatic partitioning.

**Deterministic State Machine** — `BOOTING → READY → WORKING → CHECKPOINTING → CLOSING` with a `RECOVERY` path.

**Proposal → Validation → Commit** — The LLM proposes JSON actions; the system validates against policy, then commits or rejects.

**Atomic Persistence** — All writes are atomic and hash-verified. A write-ahead log enables crash recovery.

**COLD Partitioning** — Auto-splits into sessions / inventory / conflicts / evidence with sub-partitioning and integrity-preserving chopping.

**Conflict Engine** — Auto-categorizes conflicts into 7 types, scores confidence, and auto-resolves low-risk cases; preserves both sides otherwise.

**Tool Fallback Chain** — Ordered fallback for file operations across platform tools.

**Cross-Platform** — Claude Projects, ChatGPT, Cowork, Claude Code, any LLM.

**Multi-Account Safety** — Session identity tagging, write-collision detection, anti-corruption guards.

**Full Audit Trail** — Every state transition, decision, and conflict logged.

---

## Using with Cowork

[Cowork](https://docs.claude.com) is Anthropic's desktop tool for non-developers to automate file and task management. Its direct file access lets the kernel read and write `RAG/` files with no manual copy-paste, and its task automation pairs naturally with the kernel's checkpoint and audit system. For a new project, drop the Init Prompt in and the system bootstraps and scans your folder; for an existing one, point it at the folder during bootstrap and your work becomes tracked state.

## Using with Claude Code

[Claude Code](https://docs.claude.com) is Anthropic's CLI for agentic coding. The kernel adds context persistence across its stateless sessions, a deterministic state machine to structure long-running development, zero-token file ops via direct filesystem access, and a conflict ledger that preserves both sides when new code contradicts a prior decision. Add a `RAG/` directory, bootstrap, and it starts tracking state.

---

## Prerequisites

**Tier 1 minimum:** an LLM that supports file uploads or long-form input, plus a project folder.

**Tier 2:** Python 3.10+. [Filesystem MCP](https://github.com/modelcontextprotocol/servers) recommended for direct file read/write; a shell/PowerShell MCP is optional.

## Repository Structure

```
rag-runtime-kernel/
├── INIT_UNIVERSAL_RUNTI
├── INIT_UNIVERSAL_RUNTI
├── CONTRIBUTING.md
├── CHANGELOG.md
├── docs/
│ ├── architecture.md
│ ├── benchmark_comparison.md
│ ├── design_principles.md
│ ├── test_analysis_gpt_web.md
│ ├── LAUNCH_MANUAL.md
│ ├── LOCAL_TESTING_GUIDE.md
│ ├── v3.2_ARCHITECTURE_DESIGN.md
│ └── ROADMAP.md
├── rag_kernel/
│ ├── __init__.py
│ ├── __main__.py
│ ├── api.py
│ ├── state_machine.py
│ ├── persistence.py
│ ├── cold_manager.py
│ ├── concurrency.py
│ ├── conflict_engine.py
│ ├── mcp_transport.py
│ ├── schemas.py
│ ├── session_logger.py
│ ├── spec_parser.py
│ ├── guardgen.py
│ ├── generated_guards.py
│ ├── context_policy.py
│ ├── graph_orchestrator.py
│ ├── agent_supervisor.py
│ ├── drift_control.py
│ ├── drift_store.py
│ ├── drift_render.py
│ └── drift_audit.py
├── tests/
├── .github/
├── formal/
│ ├── RAGKernel.tla
│ ├── RAGKernel.cfg
│ └── TLC_RESULTS.md
├── LICENSE
└── README.md
``` ME_KERNEL_v3.2.2.md # The specification (Tier 1 + Tier 2) ME_KERNEL_v3.2.0.md # Previous spec version (archived) # How to report issues # Version history # System architecture # Positioning vs alternatives # Core design philosophy # GPT Web platform findings # Full setup guide (all platforms + tiers) # Local dev testing & GPT Custom Actions # Runtime architecture design doc # Development roadmap # Tier 2 runtime kernel (v0.4.7) # Package entry, discover() capability registry # CLI (init / configure / verify / health / serve / mcp / session / checkpoint / gc / graph / resolve / defer / render / note) # HTTP API (FastAPI) # Deterministic state engine # Atomic writes, WAL, hash verification # COLD partition manager # Lock manager, write-collision guard # Conflict auto-categorization (7 categories, auto-resolve) # MCP tool interface # Pydantic models for proposals/state # Universal JSONL session observability # Deterministic MD→RAG parser (zero LLM); single self-version stamp across HOT+COLD, fail-loud on survivor (FIX-2); substitution + _-template-key strip so a fresh deploy is born clean (FIX-3, v0.4.6) # Deterministic TLA+ → Python guard generator (build-time) # Generated, runtime-enforced transition table + guards # Kernel-enforced context-truncation policy (M-009) # Graph Orchestrator: DAG core + execution engine (v0.4.0) # Graph Orchestrator: observable off-process worker supervisor / AgentView (v0.4.0) # DRIFT-ELIM: item-lifecycle core — ItemStatus enum + LIFECYCLE guards + immutable TrackedItem (v0.4.0) # DRIFT-ELIM: atomic mutation API over tracked_items + backlog migration; lifecycle CLI (v0.4.0) # DRIFT-ELIM: deterministic renders of open_tasks/deferred_items/backlog/ERROR_LOG from tracked_items (sole authority); render CLI (v0.4.0) # DRIFT-ELIM: fail-loud session-boundary auditor — render parity, supersede refs, note/status, side-store scan, current_status freshness + FIX-1 integrity invariants (WAL/.bak/COLD/placeholder/template-key/session-id) (v0.4.4) # 1,235 tests (v0.4.7 release) # FUNDING.yml + issue templates # TLA+ state machine specification # TLC model checker configuration # Verification results (389K states, 8 safety + 3 liveness) # AGPL-3.0

## Session Lifecycle

1. **BOOTING** — Load HOT, verify consistency, check WAL, probe tools.
2. **READY** — Accept tasks.
3. **WORKING / INGESTING** — Execute tasks, ingest files, extract knowledge.
4. **CHECKPOINTING** — Save atomically with backup rotation.
5. **CLOSING** — Audit findings, final save.

## Disclaimer & Known Limitations

- **Tier 1 is self-enforced** — the LLM follows the spec by instruction, not by hard runtime constraints. For hard guarantees, use Tier 2.
- **Persistence depends on platform** — full atomic writes with file/MCP access; manual file management on GPT Web (no atomic writes, no real token counter).
- **Context window ceiling** — the spec is ~100 KB / ~20K tokens; in Tier 1 it occupies the window, so very large projects may hit limits. Tier 2 keeps the spec out of the model via deterministic parsing.
- **Single-writer** — concurrent writes are detected and halted, not auto-merged.
- **Not a database** — this is structured file-based memory, not a production database replacement.

See [`docs/test_analysis_gpt_web.md`](docs/test_analysis_gpt_web.md) for platform-specific findings.

## Roadmap

See [`docs/ROADMAP.md`](docs/ROADMAP.md) for the complete roadmap.

| Line | Version | Status | Focus |
|---|---|---|---|
| Spec | **v3.2.2** | Released | **ENV-NORM** — §3a tool hierarchy rewritten to **tmux-mcp primary** for shell/git/test (composed commands verbatim, no orphan `1`); `wsl-exec` demoted to atomic fallback; new `session_start_shell_rule`; §3 `doctor` boot preflight. Prior v3.2.1: known-issues registry reconciled to 12 universal keys (INS-044 fetch-to-disk), §37 fetch/VCS/shell tooling enumeration + `audit-env` (INS-045), §31 Step 0 environment audit (INS-043). |
| Runtime | **v0.4.7** | Released | **FIX-4 — parity-mirror `.bak` contract (K6)** from the eBay Session-Zero deploy audit: settles + enforces the `.bak` semantics FIX-1 left ambiguous (where the eBay backup sat 3 checkpoints stale, HOT seq 3 / `.bak` seq 0). The `.bak` is now a **byte-identical parity-mirror** of the last committed HOT, refreshed via an opt-in `mirror_bak=True` on the canonical writers (full checkpoint/close, `drift_store`, `drift_render`); the generic write path keeps its prior-file crash backup. `check_bak_parity` now asserts true byte-parity (the rollback-prev one-behind allowance is gone). `DRIFT_AUDIT_VERSION` → 1.3.0. No new module (19), health 20/20, 1,235 tests. |
| Runtime | **v0.4.6** | Released | **FIX-3 — init/configure build-time hygiene (K3+K5+K7)** from the eBay Session-Zero deploy audit: `spec_parser` substitutes the build-deterministic `` placeholder and strips `_`-prefixed `:template` keys from `operating_protocol` so a fresh deploy is born clean, and `KernelApp` mints a canonical `S` session id (not `S-{pid}-…`) and stamps `meta.written_by_session` on every checkpoint — preventing at build the defects FIX-1 could only detect. Dogfooded: `init --spec v3.2.2` went from 3 audit findings to 0. No new module (19), health 20/20, 1,219 tests. |
| Runtime | **v0.4.5** | Released | **FIX-2 — single self-version token + `verify` gate (K4+K8)** from the eBay Session-Zero deploy audit: the spec's HOT/COLD templates now carry one `` token that `spec_parser` substitutes and stamps into the COLD `init_prompt_reference` from the spec's own version — root-causing the COLD↔HOT version drift FIX-1 could only detect. New deterministic `rag_kernel verify` post-init coherence gate; `init` fails loud on any unsubstituted token; SESSION_ZERO verify gate rewritten off the file-size heuristic onto `verify`/`audit`. No new module (19), health 20/20, 1,202 tests. |
| Runtime | **v0.4.4** | Released | **FIX-1 — integrity auditor + WAL hardening (K1+K2)** from the eBay Session-Zero deploy audit: seven fail-loud integrity invariants (WAL monotonicity, RAG↔.bak parity, COLD↔HOT spec-version, unsubstituted-placeholder, leaked-template-key, non-empty `written_by_session`, session-id coherence) + a `health` WAL-replay self-test. Dogfooded live — caught a real latent COLD↔HOT drift in this repo's own RAG. No new module (19), health 20/20, 1,180 tests. |
| Runtime | **v0.4.3** | Released | **AUDIT-CS-FRESHNESS** — `audit` now guards the `current_status` narrative against the live runtime version + git HEAD (E-043), failing loud on a stale snapshot; new `audit --git-head` flag with best-effort auto-resolution. No new module (19), health 20/20, 1,159 tests. |
| Runtime | **v0.4.2** | Released | **ENV-NORM** — `doctor` preflight (env summary, fail-closed stale-`.git/index.lock` `--fix`, shell-policy first-move, `--emit-runner`) + guarded `add` verb (closes the no-ADD-verb gap), paired with spec v3.2.2 tmux-primary tool hierarchy. No new module (19), health 20/20, 1,142 tests. |
| Runtime | **v0.4.1** | Released | Kernel hardening from the eBay S0 deployment audit: `audit-env` fetch/VCS/shell tooling enumeration (INS-045) + `init` fail-loud on missing `--spec` (INS-046), bundling DRIFT-ELIM inc 6. No new module (19), health 20/20, 1,123 tests. |
| Runtime | **v0.3.0** | Released | 13 modules, 758 tests. TLA+ guards **enforced** at runtime (FV-PHASE3/4) — transition table generated from the model, `guardgen`/`generated_guards` registered; **M-009** kernel-enforced context-truncation policy (per-region token accounting, deterministic eviction, HOT never evicted, checkpoint/evict/halt). |
| Runtime | **v0.2.7** | Released | 12 modules, 676 tests. Graduated POV, delta checkpoints, conflict auto-categorization engine, session logger, session/checkpoint/gc CLI, spec enforcement. |
| Runtime | **v0.2.0** | Released | Zero-touch bootstrap (`rag_kernel init`), capability self-discovery (`discover()`), project configuration (`rag_kernel configure`). |
| Runtime | **v0.4.0** | Released | **Graph Orchestrator** — DAG execution, dependency tracking, deterministic-levels + OS-process parallel scheduling, checkpoint-per-node, transactional rollback, and an observable agent/session supervisor; runtime-wired via `KernelApp.run_graph`, CLI `rag_kernel graph run`, and MCP `rag_graph_run`. **DRIFT-ELIM** (deterministic project-state layer) — item-lifecycle core, atomic mutation API over `tracked_items` + backlog migration, the `rag_kernel resolve\|defer\|…` lifecycle CLI, deterministic **renders** making `tracked_items` the sole authority (legacy `open_tasks`/`deferred_items`/backlog become projections via `rag_kernel render`), and a fail-loud session auditor that asserts render == canonical. 19 modules, health 20/20, 1,082 tests. |

## Reporting Issues

Found a bug? Please [open an issue](../../issues/new/choose) using the provided templates. See [`CONTRIBUTING.md`](CONTRIBUTING.md).

## Support

**Developer:** Artem Pakhol
**LinkedIn:** [linkedin.com/in/pakhol](https://www.linkedin.com/in/pakhol)

## License

Licensed under the [GNU Affero General Public License v3.0](https://www.gnu.org/licenses/agpl-3.0.html) — see [LICENSE](LICENSE).

**What this means:** you may use, modify, and distribute this software, but any modified version you deploy (including as a network service) must also be released under AGPL-3.0 with attribution to the original project.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/arcadamarket/rag-runtime-kernel

Awesome Lists containing this project

README