https://github.com/410979729/proofrail-openclaw

Execution harness for OpenClaw agents: evidence-first changes, verification gates, dangerous command controls, and codex-harness / claude-code-harness discoverability.
https://github.com/410979729/proofrail-openclaw

agent-harness claude-code-harness codex-harness evidence-first execution-harness openclaw proofrail runtime-guardrails

Last synced: 2 days ago
JSON representation

Execution harness for OpenClaw agents: evidence-first changes, verification gates, dangerous command controls, and codex-harness / claude-code-harness discoverability.

Host: GitHub
URL: https://github.com/410979729/proofrail-openclaw
Owner: 410979729
License: mit
Created: 2026-05-27T13:27:59.000Z (5 days ago)
Default Branch: main
Last Pushed: 2026-05-27T14:51:18.000Z (5 days ago)
Last Synced: 2026-05-27T15:26:28.375Z (5 days ago)
Topics: agent-harness, claude-code-harness, codex-harness, evidence-first, execution-harness, openclaw, proofrail, runtime-guardrails
Language: TypeScript
Size: 86.9 KB
Stars: 1
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Security: SECURITY.md

Awesome Lists containing this project

README

          # Proofrail for OpenClaw

An execution harness for OpenClaw agents. Evidence gates, post-change verification, risk controls — making agents work in a disciplined, reviewable way.

It focuses on the runtime behaviors that matter in practice:

- evidence before mutation

- mutation before verification

- repeated low-signal probe blocking

- dangerous command approval or blocking

- large output summarization

- per-session compaction anchors

- session-scoped task ledger

- post-mutation validation suggestions

- JSONL audit trails for guardrail decisions

## Why Proofrail exists

Prompt quality matters, but workflow control matters just as much. Proofrail is not a prompt pack. It is a control-and-correction plugin that sits around tool use and enforces a tighter engineering loop:

- no evidence, no mutation

- after changes, validate before continuing

- risky actions are blocked by default; approval mode is opt-in when the host has a working plugin approval route

This is what turns “just answer” behavior into “inspect, act, verify, recover” behavior. The goal is simple: less blind mutation, less state drift, and more verifiable work.

## Release Status

This tree is prepared for release: `v0.0.2`.

Notable `v0.0.2` scope includes:

- broadened read-result success detection for plain-text and text-bearing object outputs

- readback of the touched target now clears pending verification instead of deadlocking the session

- stronger blocked-turn anti-bypass guidance and same-target evidence gating

- expanded runtime smoke coverage for readback validation regressions

The current scope is intentionally OpenClaw-first:

- make the OpenClaw plugin production-grade first

- keep module boundaries clean enough to extract reusable core later

- keep host-specific integration narrow so future variants can land cleanly

## Naming

The public product name is **Proofrail**.

The OpenClaw variant is published as:

- package name: `proofrail-openclaw`

- plugin id: `proofrail`

- release title: **Proofrail for OpenClaw**

In OpenClaw config, use `plugins.entries.proofrail.config`.

This naming keeps the Proofrail brand consistent across docs, config, runtime artifacts, and future host-specific variants.

## Configuration

Plugin-specific settings are read from `plugins.entries.proofrail.config`.

Config surface exposed in `openclaw.plugin.json`:

- `dangerousCommandAction`: `block` or `approve` (default: `block`)

- `summaryThresholdChars`: summarization threshold for oversized tool output

- `lowSignalBlockThreshold`: repeated low-signal probe threshold

Hook decisions prefer the live per-handler config injected by OpenClaw (`event.context.pluginConfig`) and fall back to `api.pluginConfig` when the hook payload does not carry context.

Host-global state such as `tools.exec.security` still comes from the runtime config snapshot.

## Installation

From a local working tree:

```bash

openclaw plugins install 

```

From a packed tarball:

```bash

npm pack

openclaw plugins install npm-pack:./proofrail-openclaw-0.0.2.tgz

```

After install, enable the plugin if needed and restart the serving Gateway before expecting hook behavior to change.

## Runtime Model

Current runtime state intentionally supports only `observe`, `execute`, and `review`.

`plan` and `wait_user` remain reserved contract states for future workflow tools.

## Design Goals

1. Keep deterministic guardrails in runtime hooks, not fragile prompt prose.

2. Keep the plugin modular so future `plan`, `task`, and `ask-user` features do not collapse into one file.

3. Keep host-specific code narrow so shared abstractions can be extracted later.

4. Keep runtime artifacts out of source control.

5. Keep the public release tree clean: typed, packable, and reviewer-friendly.

## Module Layout

- `index.ts`: plugin entry only

- `lib/register-hooks.ts`: host-facing hook wiring and orchestration

- `lib/constants.ts`: policy constants and appended behavior rules

- `lib/types.ts`: local contracts for hook-facing code

- `lib/text.ts`: pure text extraction and normalization helpers

- `lib/path.ts`: file/path hint helpers

- `lib/tooling.ts`: shared runtime configuration and normalization helpers

- `lib/tool-normalize.ts`: canonical tool-name and category normalization

- `lib/command-risk.ts`: shell command risk classification, mutating exec detection, and validation exec detection

- `lib/evidence-policy.ts`: evidence, low-signal, intent, and mutation labeling policies

- `lib/result-status.ts`: normalized tool result success/failure detection

- `lib/session-state.ts`: in-memory session state lifecycle

- `lib/validation.ts`: changed-path derivation and narrow validation suggestions

- `lib/task-ledger.ts`: session task context, close summaries, and final review checklist

- `lib/audit.ts`: best-effort JSONL audit trail

- `lib/compaction.ts`: compaction snapshot persistence

- `lib/workflow/contracts.ts`: forward-looking workflow interfaces for tasks, plans, ask-user, and policy decisions

- `schemas/`: forward-looking contracts for planned workflow primitives

- `evals/`: reviewer-facing evaluation stubs for safety, workflow, and recovery

- `docs/`: maintainer-facing architecture notes

## Runtime Artifacts

The plugin writes runtime artifacts under the OpenClaw state directory:

- compaction snapshots under `state/plugins/proofrail/sessions//last-compaction-snapshot.json`

- JSONL audit entries under `state/plugins/proofrail/audit.jsonl`

If the host runtime does not expose `api.runtime.state.resolveStateDir()`, Proofrail falls back to a local `.proofrail/` directory under the plugin root for best-effort development smoke tests.

These paths are runtime artifacts and must stay out of version control.

## Planned Next Steps

Phase 1, still OpenClaw-only:

- add durable task records

- add plan mode primitives

- add structured ask-user state

- add worktree isolation policy

- add broader eval coverage for safety and recovery paths

Phase 2:

- extract shared runtime core from stable OpenClaw modules

- add additional host integrations on top of the stable core

## Non-Goals

- copying proprietary prompts or closed-system internals

- implying official affiliation with OpenClaw, any model vendor, or any provider

- mixing local instance identity, credentials, or private ops policy into the plugin

## Release Verification

Before publishing a release bundle, run from a clean checkout:

```bash

npm install

npm run typecheck

npm run smoke

npm pack --dry-run

```

Release notes for this tree live in `CHANGELOG.md`.

## Review Bundle Expectations

A reviewable bundle should include:

- source files and maintainer docs

- a deterministic smoke test for hook behavior

- package metadata for OpenClaw plugin discovery

- no machine-specific absolute paths or private deployment notes

- no dead repository links before the public repo exists

- no mixed-language user-facing runtime text

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/410979729/proofrail-openclaw

Awesome Lists containing this project

README