https://github.com/edycutjong/litmus

🧪 Output-grading quality gate agent — grades any deliverable 0-100 with a rubric, on-chain
https://github.com/edycutjong/litmus

a2a agent croo grading quality

Last synced: 3 days ago
JSON representation

🧪 Output-grading quality gate agent — grades any deliverable 0-100 with a rubric, on-chain

Host: GitHub
URL: https://github.com/edycutjong/litmus
Owner: edycutjong
License: mit
Created: 2026-06-13T12:27:09.000Z (11 days ago)
Default Branch: main
Last Pushed: 2026-06-14T02:32:34.000Z (10 days ago)
Last Synced: 2026-06-14T03:14:13.815Z (10 days ago)
Topics: a2a, agent, croo, grading, quality
Language: TypeScript
Size: 12.8 MB
Stars: 0
Watchers: 0
Forks: 0
Open Issues: 7
Metadata Files:
- Readme: README.md
- License: LICENSE
- Security: SECURITY.md

Awesome Lists containing this project

README

          


  

  
Litmus 🧪

  Output-grading quality gate agent — grades any deliverable 0-100 with a rubric, on-chain

  

  


  [![Live Demo](https://img.shields.io/badge/🚀_Live-Demo-06b6d4?style=for-the-badge)](https://mock.croo.network)

  [![Built for CROO Hackathon](https://img.shields.io/badge/DoraHacks-CROO_Hackathon_2026-8b5cf6?style=for-the-badge)](https://dorahacks.io)

  


  ![TypeScript](https://img.shields.io/badge/TypeScript-3178C6?style=flat&logo=typescript&logoColor=white)

  ![Node.js](https://img.shields.io/badge/Node.js-339933?style=flat&logo=node.js&logoColor=white)

  [![CI](https://github.com/edycutjong/litmus/actions/workflows/ci.yml/badge.svg)](https://github.com/edycutjong/litmus/actions/workflows/ci.yml)



---

## 📸 See it in Action



  



> **The Quality Gate Workflow.** Deliverable Received → Litmus Applies Grading Rubric → Score (0-100) Calculated → Feedback & On-Chain Grade Delivered.

---

## 💡 The Problem & Solution

In an autonomous agent economy, output quality varies wildly. How do you trust an agent's work without manual human review?

**Litmus** is an AI Quality Gate Agent. It acts as an automated, impartial grader that evaluates deliverables against strict, predefined rubrics. If an agent submits subpar code, writing, or analysis, Litmus rejects it, ensuring only high-quality work passes the gate.

**Key Features:**

- ⚖️ **Objective Grading:** Evaluates work across multiple rubric categories, assigning a deterministic score from 0-100.

- 🚧 **Quality Gatekeeper:** Automatically rejects work that falls below the acceptable threshold.

- ⛓️ **On-Chain Attestation:** Cryptographically signs the grade to ensure the evaluation is immutable and verifiable.

## 🌌 The Constellation — On-Chain A2A Graph

Litmus is the constellation's **quality oracle**: other agents pay it on-chain to grade a deliverable 0–100 against a rubric. A two-model "tribunal" (with a tiebreaker) keeps scoring stable (σ < 4). Verifiable, paid, impartial grading-as-a-service is a primitive a normal API marketplace can't offer.

```mermaid

graph LR

    User([Any Agent / User]) -->|hires to grade| L[Litmus 🧪]

    M[Maestro 🎼] -->|grade + re-grade in its reflection loop| L

    G[Gauntlet 🧤] -.->|certifies| L

    classDef hot fill:#F59E0B,stroke:#111,color:#111,font-weight:bold;

    class L hot;

```

- **Depth:** Maestro hires Litmus **twice** per pipeline — once to grade, once to re-grade the self-corrected draft — making it a high-traffic A2A node.

- **Anti-gaming:** rubric weights are validated and Format/Clarity is capped at 15% so agents can't farm a passing grade on style alone.

## 🔗 Live Run Log — On-Chain Proof (Base Mainnet)

Real CAP grading orders Litmus fulfilled as a **provider**.

**Total real CAP orders: _0_** · _last updated: 2026-06-__

| # | Date | Counterparty (requester) | Amount (USDC) | Order ID | Tx (BaseScan) | Score |

|---|------|--------------------------|---------------|----------|---------------|-------|

| 1 | _2026-06-__ | _Maestro / external_ | _0.00_ | `_ord_…_` | [0x…](https://basescan.org/tx/0x…) | _N_/100 |

> Order IDs + pay tx are in the provider logs and the CROO dashboard. Delete this note once populated.

## 🏗️ Architecture & Tech Stack

| Layer | Technology |

|---|---|

| **Runtime** | Node.js (TypeScript) |

| **Ecosystem** | Constellation A2A (croo-core) |

| **Testing** | Vitest |

## 🚀 Getting Started

### Prerequisites

- Node.js ≥ 20

- npm

### Installation

1. Clone: `git clone https://github.com/edycutjong/litmus.git`

2. Install: `npm install`

3. Configure: `cp .env.example .env.local` and fill in your service ID + an LLM key (skip for mock mode)

### ▶️ Run it now — offline mock mode (no wallet, no USDC)

```bash

npm install

CROO_MOCK=true npm run dev   # boots the grader provider with no on-chain calls

```

Grading works with **no API key** (deterministic mock grade); set `OPENAI_API_KEY` and/or `ANTHROPIC_API_KEY` to enable the live LLM tribunal. Run `npm run stability` to reproduce the σ < 4 scoring-variance harness.

## 🧪 Testing & CI

**4-stage pipeline:** Quality → Security → Build → Deploy Gate

```bash

# ── Code Quality ────────────────────────────

make lint          # ESLint

make typecheck     # TypeScript check

make test          # Run tests

make test-coverage # Coverage report

make ci            # Full quality gate

# ── Security ────────────────────────────────

make security-scan # npm audit + license check

```

| Layer | Tool | Status |

|---|---|---|

| Code Quality | ESLint + TypeScript | ✅ |

| Unit Testing | Vitest | ✅ |

| Security (SAST) | CodeQL | ✅ |

| Security (SCA) | Dependabot + npm audit | ✅ |

| Secret Scanning | TruffleHog | ✅ |

## 📁 Project Structure

```text

dorahacks-croo-litmus/

├── docs/              # README assets (hero, screenshots)

├── src/               # Application source code

├── scripts/           # Build and run scripts

├── __tests__/         # Vitest test suites

├── .github/           # CI workflows

└── README.md          # You are here

```

## 🚢 Deploy

Containerized for any PaaS. Litmus is a background **worker** (connects out to the CROO WebSocket — no inbound port):

```bash

docker build -t litmus .

docker run --env-file .env.local litmus

```

## 📄 License

[MIT](LICENSE) © 2026 Edy Cu

## 🙏 Acknowledgments

Built for the DoraHacks CROO Hackathon 2026.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/edycutjong/litmus

Awesome Lists containing this project

README

Litmus 🧪