https://github.com/byte271/aosp

AOSP: AI-Operable Software Protocol — Shadow-first execution, temporal branching, and tamper-evident audit for AI actions
https://github.com/byte271/aosp
Last synced: 29 days ago
JSON representation
AOSP: AI-Operable Software Protocol — Shadow-first execution, temporal branching, and tamper-evident audit for AI actions
Host: GitHub
URL: https://github.com/byte271/aosp
Owner: byte271
License: mit
Created: 2026-03-20T18:15:51.000Z (3 months ago)
Default Branch: main
Last Pushed: 2026-03-20T19:31:58.000Z (3 months ago)
Last Synced: 2026-04-03T18:53:09.938Z (3 months ago)
Language: TypeScript
Homepage: https://aosp-cli.pages.dev
Size: 111 KB
Stars: 0
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project

README

          # AOSP — AI-Operable Software Protocol

A TypeScript framework for controlled AI execution. AOSP wraps every AI-initiated action in a structured lifecycle: **shadow-execute first, verify the diff, then commit to reality** — with full audit trails, policy enforcement, and rollback capability.

## What It Does

When an AI agent wants to take an action (write a file, call an API, modify a database), AOSP interposes a safety layer:

1. **Shadow execution** — run the action in a sandbox, capture predicted side effects

2. **Policy check** — evaluate against declarative rules (DSL) and risk thresholds

3. **Approval gate** — route high-risk actions for human review

4. **Commit** — apply the change to the real system

5. **Trace** — record everything in a tamper-evident Merkle chain

This is the core loop. On top of it, AOSP adds **temporal branching** — the ability to fork execution into multiple parallel timelines, score all outcomes, and only commit the winner.

## Core Modules

### Kernel

The orchestrator. Wires together all modules and drives the 10-step execution lifecycle (identity validation, state fetch, risk assessment, policy evaluation, blast radius analysis, shadow execution, approval, commit, state update, trace recording).

### Safety (`safe/`)

- **PolicyEngine** — programmatic policy rules with risk-level gating

- **PolicyDSL** — declarative rules like `DENY WHEN action MATCHES "delete_*"`

- **ApprovalGateway** — human-in-the-loop approval for high-risk actions

- **BlastRadiusAnalyzer** — graph-based impact analysis with transitive dependency tracking

### Execution (`exec/`)

- **ShadowExecutor** — preview actions without real side effects via pluggable adapters

- **TransactionCoordinator** — atomic multi-step execution with ordered rollback

- **PipelineEngine** — multi-stage pipelines with conditional branching and parallel stages

- **TemporalBranchEngine** — parallel timeline exploration (see below)

### Identity (`id/`)

- **IdentityManager** — actor registration, trust levels, scope-based authorization

### Trace & Audit (`trace/`)

- **TraceStore** — structured event recording with query, export, and statistics

- **MerkleChain** — append-only hash chain for tamper-evident audit trails with per-entry proofs

### Observability (`info/`)

- **StateManager** — surface state snapshots with history and diffing

- **DriftDetector** — detect unauthorized out-of-band changes by comparing expected vs. actual state

### Recovery (`magic/`)

- **MagicRecovery** — automatic recovery strategies for failed operations

### API (`api/`)

- **ApiLayer** — JSON message transport for remote AOSP operations

- **HttpServer** — HTTP server wrapping the API layer

### Adapters (`adapters/`)

- **FilesystemAdapter** — reference implementation for local filesystem operations (read, write, delete, list, mkdir) with shadow support and rollback

### Persistence (`persist/`)

- **PersistenceStore** — minimal, dependency-free key/value contract for durable audit state

- **InMemoryPersistenceStore** — volatile backend for tests and ephemeral runs

- **JsonFilePersistenceStore** — one JSON document per key with atomic writes (temp file + rename) so a crash mid-write never corrupts an existing snapshot

The `TraceStore` can `snapshot()` / `restore()` its full state (traces, event log, and Merkle chain), and `persist()` / `hydrate()` against any `PersistenceStore` — so the tamper-evident audit trail survives process restarts.

### Compliance (`compliance/`)

- **NistComplianceProfile** — maps AOSP's runtime safety mechanisms onto the four functions of the **NIST AI RMF 1.0** (Govern / Map / Measure / Manage), grades each control as satisfied / partial / unsatisfied, scores coverage, and emits an exportable JSON report with a prioritized gap list

- **gatherEvidence** — derives a compliance evidence snapshot directly from a live AOSP instance

### Bridge (`bridge/`)

- **A2ASafetyBridge** — the trust layer for the Agent-to-Agent (A2A) protocol. A2A standardizes how agents *communicate*; the bridge decides whether a delegation should be *trusted*. It attributes a trust level to the originating agent (never honoring self-claimed trust by default) and routes the requested action through the full kernel lifecycle (shadow → policy → approval → commit) before any real effect occurs. *"A2A handles communication. AOSP handles trust."*

## Temporal Branching

The standout feature. Given an intent (e.g., "set up a project"), AOSP can:

- **Fork** the intent into multiple competing strategies (minimal, standard, full-scaffold, etc.)

- **Shadow-execute** all strategies in parallel, capturing predicted diffs for each

- **Score** each timeline across 6 weighted dimensions: confidence, safety, minimality, completeness, speed, and reversibility — with auto-normalizing weights

- **Rank** timelines with risk-aware tiebreaking (score → risk level → reversibility)

- **Commit** only the winning timeline to reality

- **Prune** losing timelines (with cascade to child branches)

Additional temporal features:

- **Reality Merge Requests** — PR-style review objects with quorum-based approval and auto-expiry

- **Pairwise Timeline Diffs** — side-by-side comparison of any two timelines with per-dimension breakdown

- **Counterfactual Analysis** — "what if we'd picked the other timeline?" with avoided risks and sacrificed benefits

- **Prevented Futures Report** — tracks timelines that were blocked (high risk, low confidence, budget exceeded, policy denied)

- **Non-Selection Proofs** — SHA-256 hashed evidence of why each losing timeline lost, chained via Merkle-style hashing

- **Branch Budgeting** — limits on timeline count, fork depth, total intents, and wall-clock time

- **4 Pruning Strategies** — `score_threshold`, `confidence_decay`, `risk_ceiling`, `diminishing_returns` (independently configurable)

- **Commit-Time Revalidation** — re-checks reality drift before merging

- **Sandbox Mode** — explore alternate futures with no real mutations

- **Future Search Engine** — query/filter timelines by score, risk, confidence, phase, name, surface

- **Custom Safety Gate Checks** — register dynamic pre-merge validation functions

- **Action Diffs** — structured per-action diff with content preview and size estimation

- **Auto-Strategy Inference** — generate strategy variants from a single intent

- **Exploration Statistics** — avg score, standard deviation, risk distribution, budget utilization

## Quick Start

```bash

# Install dependencies

npm install

# Build

npm run build

# Run the interactive demo (12 steps covering the full feature set)

npm run demo

```

### Programmatic Usage

```typescript

import { createAosp, createFilesystemSurface, FilesystemAdapter } from 'aosp';

const aosp = createAosp();

// Register a surface (the thing being acted upon)

const surface = createFilesystemSurface('my-fs', 'My Filesystem', '/tmp/workspace');

aosp.kernel.registerSurface(surface);

aosp.executor.registerAdapter(new FilesystemAdapter('my-fs', '/tmp/workspace'));

// Register an actor

aosp.identity.register({

  id: 'agent-1', type: 'ai', name: 'My Agent',

  trust: 'elevated', scopes: ['my-fs/*'],

});

// Execute with full lifecycle

const trace = await aosp.kernel.execute({

  id: 'intent-1',

  surface: 'my-fs',

  action: 'write_file',

  params: { path: 'hello.txt', content: 'Hello, world!' },

  initiator: { id: 'agent-1', type: 'ai', name: 'My Agent', trust: 'elevated', scopes: ['my-fs/*'] },

  timestamp: Date.now(),

  metadata: {},

});

console.log(trace.status); // 'committed'

```

### Temporal Branching Usage

```typescript

const { temporal, executor } = createAosp();

temporal.setExecModule(executor as any);

temporal.registerSurface(surface);

temporal.setBudget({ maxTimelines: 10, maxDepth: 3, maxTotalIntents: 50 });

const result = await temporal.explore([

  { name: 'approach-a', intents: [/* ... */], metadata: {} },

  { name: 'approach-b', intents: [/* ... */], metadata: {} },

]);

console.log(result.comparison.winner.name);  // best strategy

console.log(result.stats.avgScore);           // exploration statistics

console.log(result.explorationProofHash);     // tamper-evident hash

// Commit the winner

await temporal.merge(result.comparison.winner.id);

```

### Durable Audit (Persistence)

```typescript

import { createAosp, JsonFilePersistenceStore } from 'aosp';

const aosp = createAosp();

// ... execute some actions ...

const backend = new JsonFilePersistenceStore('./aosp-audit');

await aosp.trace.persist(backend);          // snapshot traces + Merkle chain to disk

// Later, in a fresh process:

const next = createAosp();

await next.trace.hydrate(backend);          // restore the verifiable audit trail

console.log(next.trace.verifyChain().valid); // true

```

### NIST AI RMF Compliance

```typescript

import { createAosp, gatherEvidence } from 'aosp';

const aosp = createAosp();

const report = aosp.nist.generateReport(gatherEvidence(aosp));

console.log(report.summary.coverageScore); // 0–100

console.log(report.summary.readiness);      // 'non-compliant' | 'developing' | 'substantial' | 'compliant'

console.log(report.gaps);                   // prioritized recommendations

```

### A2A Safety Bridge

```typescript

const aosp = createAosp();

// ... register surface + adapter ...

// Operator grants trust to a known remote agent (self-claimed trust is ignored).

aosp.bridge.registerAgent({ id: 'agent-a', name: 'Planner', trust: 'elevated', scopes: ['*'] });

aosp.identity.register({ id: 'agent-a', type: 'agent', name: 'Planner', trust: 'elevated', scopes: ['*'] });

const result = await aosp.bridge.mediate({

  from: { id: 'agent-a', name: 'Planner', trust: 'elevated', scopes: ['*'] },

  surface: 'my-fs',

  action: 'write_file',

  params: { path: 'out.txt', content: 'delegated work' },

  task: 'persist the result',

});

console.log(result.decision);  // 'trusted' | 'rejected' | 'pending_approval' | 'error'

console.log(result.committed); // whether it reached reality

```

## CLI

```bash

# Full lifecycle execution

aosp execute   [params...]

# Shadow-only (preview)

aosp shadow   [params...]

# Trace management

aosp trace list

aosp trace show 

aosp trace export [id]

# Merkle chain

aosp chain verify

aosp chain show

# Policy DSL

aosp policy add ''

aosp policy list

# NIST AI RMF compliance report (add --json for the full report)

aosp nist

aosp nist --json

# System status

aosp status

# Rollback

aosp rollback 

# Interactive demo

aosp demo

```

## Project Structure

```

src/

  kernel/       Kernel orchestrator + type definitions

  safe/         Policy engine, DSL, approval gateway, blast radius

  exec/         Shadow executor, transactions, pipelines, temporal branching

  id/           Identity and authorization

  trace/        Trace store + Merkle chain (with snapshot/restore)

  info/         State management + drift detection

  magic/        Recovery strategies

  api/          Transport layer + HTTP server

  adapters/     Reference filesystem adapter

  persist/      Durable persistence stores (in-memory + JSON file)

  compliance/   NIST AI RMF compliance profile

  bridge/       A2A safety bridge (cross-agent trust layer)

  spec/         Protocol versioning

  cli/          CLI with full lifecycle demo + `nist` report

  index.ts      Public API exports + convenience factory

```

Zero runtime dependencies beyond `chalk`, `commander`, and `uuid`. The test suite

(`npm test`, Vitest) covers the kernel, safety, execution, trace/Merkle, persistence,

compliance, and A2A bridge modules.

## Limitations

- This is a protocol framework, not a production-ready system. It demonstrates the architecture for controlled AI execution.

- The shadow execution model depends on adapters faithfully simulating actions. The included filesystem adapter is a reference implementation.

- Temporal branching scores timelines based on shadow execution results. The scoring is only as good as the shadow predictions.

- The Merkle chain and trace store are in-memory by default, but can be snapshotted and persisted via a `PersistenceStore` (a JSON-file backend ships in `persist/`); a higher-throughput deployment would add a database-backed store implementing the same interface.

- The NIST compliance profile maps a curated, representative subset of AI RMF subcategories; it is a self-assessment aid, not a certified attestation.

- Policy DSL supports basic patterns (`MATCHES`, `CONTAINS`, `==`, `!=`, `>`, `>=`, `<`, `<=`, `IN`). Complex policies should use programmatic rules.

## License

MIT
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/byte271/aosp

Awesome Lists containing this project

README