https://github.com/ntholm86/ai-steward

Autonomous code improvement loop - harness-captured LLM evidence, operator-held Commander's Intent, CI-enforced. Reference implementation of Principles of Earned Autonomy.
https://github.com/ntholm86/ai-steward

ai-agents anthropic audit-trail autonomous-agents autonomous-coding claude code-improvement commanders-intent harness-protocol improvement-loop llm-agents observable-autonomy principles-of-earned-autonomy python self-improving

Last synced: 9 days ago
JSON representation

Autonomous code improvement loop - harness-captured LLM evidence, operator-held Commander's Intent, CI-enforced. Reference implementation of Principles of Earned Autonomy.

Host: GitHub
URL: https://github.com/ntholm86/ai-steward
Owner: ntholm86
License: mit
Created: 2026-06-20T11:39:09.000Z (12 days ago)
Default Branch: main
Last Pushed: 2026-06-22T06:48:02.000Z (10 days ago)
Last Synced: 2026-06-22T08:10:39.206Z (10 days ago)
Topics: ai-agents, anthropic, audit-trail, autonomous-agents, autonomous-coding, claude, code-improvement, commanders-intent, harness-protocol, improvement-loop, llm-agents, observable-autonomy, principles-of-earned-autonomy, python, self-improving
Language: Python
Homepage: https://github.com/ntholm86/principles-of-earned-autonomy
Size: 1.54 MB
Stars: 0
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# ai-steward

An autonomous code improvement loop with structural accountability.

**Purpose 1 — Proof:** A reference implementation of [Principles of Earned Autonomy](https://github.com/ntholm86/manifesto). Demonstrates that autonomous delegation can be structurally trustworthy — not through promises, but through Observable Autonomy (harness-captured LLM evidence), Commander's Intent (operator-written destination), and Convergence Is Silence (stop when done, not when tired).

**Purpose 2 — Tool:** Genuinely useful. Write a destination → run the loop → review the staged diff → commit or discard. Works on any codebase. Cost is measured, not claimed (~$0.018/cycle on claude-haiku-4-5).

## How it works

```
┌──────────────────────────────────┐
│ OPERATOR │
│ .acm/destination.md │ ← Commander's Intent: what + why
│ Reviews staged diffs │
│ Commits or discards │
└──────────────┬───────────────────┘
│
▼
┌──────────────────────────────────┐
│ PIPELINE │
│ PRE-FLIGHT → SCAN → IMPLEMENT │
│ → VERIFY → RECORD │
│ │
│ • PRE-FLIGHT: git clean, tests │ ← zero LLM tokens
│ • SCAN: read files + dest → LLM │ ← one cheap model call
│ • IMPLEMENT: apply change → LLM │ ← one cheap model call
│ • VERIFY: syntax, size, tests │ ← zero LLM tokens
│ • RECORD: append trail entry │ ← zero LLM tokens
└──────────────┬───────────────────┘
│
▼
┌──────────────────────────────────┐
│ HARNESS (localhost:8474) │
│ Intercepts all LLM API calls │
│ Writes .acm/sessions/*.jsonl │ ← BEFORE response is processed
│ Hash-chained, tamper-evident │ ← agent cannot fabricate evidence
└──────────────────────────────────┘
```

Every cycle produces two records in the target repo's `.acm/` directory:
- `audit-trail.md` — the agent's reasoning (what it proposed and why)
- `sessions/.jsonl` — the harness-captured LLM evidence (independent, unmodifiable)

The agent cannot claim it did something it did not do.

## Quickstart

**Prerequisites:** Python 3.12+, [llm-harness-proxy](https://github.com/ntholm86/llm-harness-proxy) running on `localhost:8474`, `ANTHROPIC_API_KEY` set.

```bash
pip install -e ".[dev]"
```

**In your target repo**, create `.ai-steward.yaml`:

```yaml
models:
analyze: claude-haiku-4-5
propose: claude-haiku-4-5
implement: claude-haiku-4-5
verify: claude-haiku-4-5
judge: claude-haiku-4-5
```

And `.acm/destination.md` — write what you want the codebase to become and why:

```markdown
# Destination — my-project

What this is for and what a good improvement looks like.
The agent reads this before every SCAN. Be honest about trade-offs.
```

**Run:**

```bash
ai-steward run /path/to/your/repo
```

The loop proposes one change, applies it, verifies tests pass, and stages the diff. You inspect and commit or discard.

## V1 status

- Self-targeting proven: ai-steward runs against its own repository
- Observable Autonomy structural: harness sessions co-located with trail in `.acm/`
- Commander's Intent structural: SCAN reads `.acm/destination.md` before every proposal
- 66 tests, mypy-clean, CI on GitHub Actions
- Cost baseline: ~$0.018/cycle (SCAN + IMPLEMENT, claude-haiku-4-5)

**V2 (not yet implemented):** model-family independence (proposer and verifier from different families), configurable verify commands (for non-Python repos), multi-cycle convergence tracking.

## Observable Autonomy

The harness proxy writes the session ledger **before** the LLM response is processed. The agent cannot selectively omit calls, retroactively modify evidence, or claim a reasoning path it did not take. The trail entry (agent-authored) and the JSONL session (harness-captured) are co-located but distinct — two trust levels, one directory.

```
.acm/
audit-trail.md ← agent's claim about what happened
sessions/
01KV....jsonl ← independent proof of what the LLM was asked and answered
```

## Development

```bash
pip install -e ".[dev]"
pytest # 66 tests
mypy src/ # type-check
```

CI runs both on push via `.github/workflows/ci.yml`.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ntholm86/ai-steward

Awesome Lists containing this project

README