https://github.com/randomittin/superx

One prompt → finished project. Autonomous Claude Code agent with wave-based parallel execution, pixel-art dashboard, and ~75% token savings.
https://github.com/randomittin/superx
agent-orchestration ai-agent ai-coding ai-orchestration anthropic automation autonomous-agent claude-agent claude-ai claude-code claude-plugin claude-skill cli coding-assistant developer-tools devtools llm-agent llm-framework llm-tools multi-agent
Last synced: about 2 months ago
JSON representation
One prompt → finished project. Autonomous Claude Code agent with wave-based parallel execution, pixel-art dashboard, and ~75% token savings.
Host: GitHub
URL: https://github.com/randomittin/superx
Owner: randomittin
License: mit
Created: 2026-04-06T09:57:34.000Z (2 months ago)
Default Branch: main
Last Pushed: 2026-04-19T06:18:29.000Z (about 2 months ago)
Last Synced: 2026-04-19T07:02:14.365Z (about 2 months ago)
Topics: agent-orchestration, ai-agent, ai-coding, ai-orchestration, anthropic, automation, autonomous-agent, claude-agent, claude-ai, claude-code, claude-plugin, claude-skill, cli, coding-assistant, developer-tools, devtools, llm-agent, llm-framework, llm-tools, multi-agent
Language: JavaScript
Homepage: https://github.com/randomittin/superx#readme
Size: 5 MB
Stars: 3
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
Awesome Lists containing this project

README

          # superx

**One prompt → finished project.** Autonomous [Claude Code](https://code.claude.com) agent that plans, executes in parallel, verifies, and ships.

[![Claude Code](https://img.shields.io/badge/Claude%20Code-plugin-e056a0?style=flat-square)](https://code.claude.com)

[![License: MIT](https://img.shields.io/badge/License-MIT-9b59b6?style=flat-square)](LICENSE)

[![Version](https://img.shields.io/badge/version-1.0.0-00d4ff?style=flat-square)](CHANGELOG.md)

[![Install](https://img.shields.io/badge/install-curl-4ecca3?style=flat-square)](#quick-start)

```bash

curl -fsSL https://raw.githubusercontent.com/randomittin/superx/main/install.sh | bash

```

What is superx?

superx is an autonomous superskill manager for [Claude Code](https://code.claude.com) by [Anthropic](https://www.anthropic.com). It assesses task complexity, creates structured plans with acceptance criteria, executes in parallel waves with fresh context per wave, verifies everything, and ships — with the judgment of a senior dev / CTO.

A real-time **pixel-art dashboard** ships with the plugin so you can watch the work happen: a live isometric city map of your project, a war room of agents, streaming logs, and a timeline of every decision.

**Keywords:** Claude Code plugin, Claude AI agent, Anthropic, autonomous coding agent, multi-agent orchestration, AI pair programmer, LLM agent framework, developer automation, CTO-level AI, parallel agent execution, pixel-art dashboard, wave-based execution, acceptance criteria gates, token compression, caveman mode, cross-session memory, autonomous repo maintenance.

Features

- **One prompt → finished project.** Drop a task in, get a working repo back.

- **Hybrid planning pipeline.** Complexity-aware: simple tasks execute directly, medium/complex go through planning → parallel execution → verification.

- **Wave-based parallel execution.** Tasks grouped into dependency waves. Each wave runs in parallel with fresh 200K-token context — no context rot.

- **Acceptance criteria as blocking gates.** Every task has runnable checks (grep, curl, test commands). Tasks cannot advance until all criteria pass.

- **Plan verification before execution.** Plans validated up to 3 iterations before code is written.

- **File-based state in `.planning/`.** Human-readable Markdown committed to git. Survives `/clear`.

- **Question-mark protocol.** Claude only stops to ask when it actually needs you. Dashboard surfaces the question with quick-pick buttons.

- **Conversation continuity.** Replies use `claude --resume ` for full context.

- **Real-time pixel dashboard.** Isometric city map, war room, streaming logs, day/night theme, history drawer.

- **Companion plugins auto-installed.** caveman (~75% token savings), superpowers (brainstorming), claude-mem (persistent memory).

- **Quality gates.** Tests, lint, code review — enforced via hooks before any push.

- **Atomic commits per task.** Each task = one git commit. Bisectable, revertable.

- **Auto-checkpointing.** Background git commits + crash recovery.

- **Maintainer mode.** Autonomous repo maintenance — triage, fix, test, batch release.

- **Token budgets.** Set a budget, get warned at 80%.

- **GitHub integration.** One-click commit + push from the dashboard.

---

## Quick start

### One-line install

```bash

curl -fsSL https://raw.githubusercontent.com/randomittin/superx/main/install.sh | bash

```

This installs Node.js (if missing) → Claude Code → superx → companion plugins (caveman, superpowers, claude-mem). Then:

```bash

source ~/.zshrc   # or open a new terminal

cd /path/to/your/project

superx "build a real-time dashboard with auth and charts"

```

### Or install via Claude Code plugin marketplace

```bash

claude plugins marketplace add randomittin/superx-marketplace

claude plugins install superx

```

### Or install manually

```bash

git clone https://github.com/randomittin/superx.git ~/.superx

export PATH="$PATH:$HOME/.superx/bin"

```

### Prerequisites

- **Claude Code** 1.0+ with valid auth ([install guide](https://docs.claude.com/en/docs/claude-code/setup))

- **Python** 3.11+ (for the dashboard; stdlib only, no pip install)

- **Git**

---

## Usage

```bash

superx "deploy to vercel"     # Run a task end-to-end

superx                        # Interactive Claude session with superx powers

superx --dashboard            # Start the pixel dashboard (http://localhost:8080)

superx --update               # Pull latest version from GitHub

superx --setup                # Re-run companion plugin setup

superx --help                 # Show help

```

superx launches Claude Code with:

- `--dangerously-skip-permissions` — full autonomy

- `--plugin-dir ` — all superx agents + skills loaded

- `--agent superx` — CTO-level orchestrator as main agent

- **caveman** — token compression (~65-75% savings)

- **superpowers** — brainstorming, debugging, skill-creator

- **claude-mem** — persistent memory across sessions

### Dashboard mode

```bash

cd /path/to/your/project

superx --dashboard

# open http://localhost:8080

```

Auto-sets the current directory as the project. Click `+ NEW` to start a fresh project with a native folder picker.

---

## How it works — the hybrid pipeline

superx assesses every incoming task and routes it through the right level of planning:

```

User prompt

    ↓

Assess complexity

    ├── Simple (single-file fix, config, question)

    │     → Execute directly. No planning overhead.

    │

    ├── Medium (feature addition, 2-5 file bug)

    │     → Lightweight plan with acceptance criteria → execute → verify

    │

    └── Complex (new project, major feature, 6+ files)

          → Full pipeline ↓

Init .planning/ in project dir

    ↓

Discuss — analyze codebase, surface assumptions → CONTEXT.md

    ↓

Plan — planner agent creates PLAN-{phase}.md

    • Tasks grouped into dependency waves

    • Each task: read_first, action, acceptance_criteria, verify, done

    • Self-verification loop (max 3 iterations)

    ↓

Execute — wave by wave, parallel within each wave

    • Wave 1 (no deps) → fresh context, parallel subagents

    • Wave 2 (depends on 1) → fresh context, parallel subagents

    • Each task = atomic git commit

    • Acceptance criteria BLOCK progression

    ↓

Verify — verifier checks ALL criteria + requirement coverage

    • PASS → ship

    • FAIL → diagnosis + fix plan → re-execute

```

### Why fresh context per wave?

After 4-5 agents complete work and report back, accumulated conversation history can exceed 100K+ tokens. The next wave's agents would be working in degraded context. Instead, each wave gets a clean 200K-token window — just the plan, the context doc, and the source files it needs. No garbage from prior waves.

### Why acceptance criteria as gates?

A task like "Create login API" isn't done when code is written. It's done when:

```

grep "export const login" src/api.ts       # function exists

curl -s localhost:3000/api/login | jq .    # endpoint responds

npm test -- --grep "auth"                   # tests pass

```

These are blocking — the wave-executor won't commit until all pass. If they fail after 2 fix attempts, the task is marked BLOCKED and the orchestrator escalates.

---

## `.planning/` state directory

All planning state lives as human-readable Markdown in your project's `.planning/` directory:

| File | Purpose |

|---|---|

| `PROJECT.md` | Vision, constraints, tech stack |

| `REQUIREMENTS.md` | v1 (must-have), v2 (next), out-of-scope |

| `STATE.md` | Living memory: current phase, decisions, blockers, metrics. YAML frontmatter derived from body content (rebuilt on every write). |

| `CONTEXT.md` | Per-phase user preferences + codebase assumptions |

| `PLAN-{phase}.md` | Task specifications with waves, acceptance criteria, dependencies |

| `SUMMARY-{phase}.md` | Execution results with commit hashes |

These files are:

- **Human-readable** — open them in any editor

- **Git-committable** — track planning decisions in version history

- **Session-surviving** — persist across `/clear` and server restarts

- **Concurrent-safe** — lockfile-based mutual exclusion for parallel agents

---

## Agent roster

| Agent | Role | Model | Effort | When spawned |

|---|---|---|---|---|

| `superx` | Main orchestrator | Opus | max | Always (session agent) |

| `architect` | Decomposition + planning | Opus | high | Complex tasks |

| `planner` | Wave-grouped plans + acceptance criteria | Opus | high | Medium + complex tasks |

| `wave-executor` | Execute one wave (up to 10 parallel) | Opus | high | Per wave during execution |

| `verifier` | Sentinel gate + truth scoring | Opus | high | After each phase |

| `coder` | Feature implementation | Opus | high | Simple + within waves |

| `design` | UI/UX design | Opus | high | When UI work detected |

| `security-auditor` | OWASP, secrets scan, auth review | Opus | max | Complex tasks with auth/API |

| `database-architect` | Schema, migrations, query optimization | Opus | high | Data layer tasks |

| `incident-responder` | Triage, diagnose, mitigate, postmortem | Opus | max | Production fires |

| `reviewer` | Code review before push | Opus | high | Quality gate |

| `test-runner` | Test writing and execution | Sonnet | default | Quality gate |

| `docs-writer` | Documentation | Sonnet | default | Post-execution |

| `lint-quality` | Lint and formatting | Haiku | low | Quality gate |

---

## Architecture

```

superx/

├── .claude-plugin/plugin.json    # Plugin manifest (v1.0.0)

├── agents/                       # 14 specialized agent definitions

│   ├── superx.md                 # Main orchestrator (Opus)

│   ├── architect.md              # Decomposition + planning (Opus)

│   ├── planner.md                # Structured plans with acceptance criteria

│   ├── wave-executor.md          # Per-wave parallel execution

│   ├── verifier.md               # Post-execution verification

│   ├── coder.md                  # Implementation (Opus/high)

│   ├── design.md                 # UI/UX (Opus/high)

│   ├── security-auditor.md       # OWASP + secrets scan (Opus/max)

│   ├── database-architect.md     # Schema + migrations (Opus/high)

│   ├── incident-responder.md     # Production fires (Opus/max)

│   ├── reviewer.md               # Code review (Opus/high)

│   ├── test-runner.md            # Tests (Sonnet/default)

│   ├── docs-writer.md            # Docs (Sonnet/default)

│   └── lint-quality.md           # Lint (Haiku/low)

├── skills/superx/                # Main skill + reference docs

├── commands/                     # Slash commands

├── hooks/

│   ├── hooks.json                # Quality gate hooks (4 event types)

│   └── statusline.sh             # HUD for Claude Code status bar

├── bin/                          # CLI tools

│   ├── superx                    # Main launcher (self-bootstrapping)

│   ├── lib/planning.sh           # .planning/ state management

│   ├── lib/dispatch.sh           # File-based task dispatch queue

│   ├── superx-state              # State CRUD

│   ├── detect-skills             # Skill inventory

│   ├── conflict-log              # Conflict tracking

│   └── authenticity-check        # Package verification

├── ui/                           # Pixel dashboard

│   ├── server.py                 # Stdlib HTTP + SSE server (auto-updates on startup)

│   └── static/                   # HTML/CSS/JS + 35 sprite tiles

├── docs/superpowers/specs/       # Design docs

├── install.sh                    # One-line installer

├── settings.json                 # Default settings

├── CHANGELOG.md

├── LICENSE                       # MIT

└── README.md                     # This file

```

---

## Companion plugins (auto-installed)

| Plugin | What it does | Impact |

|---|---|---|

| **[caveman](https://github.com/juliusbrussee/caveman)** | Terse output compression | ~65-75% output token savings |

| **[superpowers](https://github.com/anthropics/claude-plugins-official)** | Brainstorming, debugging, skill-creator | Better planning + design quality |

| **[claude-mem](https://github.com/thedotmack/claude-mem)** | Persistent memory across sessions | No repeated context between sessions |

All three are installed automatically on first run (`superx --setup` to re-run).

---

## Dashboard

### How it works

1. **Set the project directory** — click `+ NEW` or the GitHub icon, browse for a folder or type a path.

2. **Type a task** in the prompt bar and hit RUN.

3. Watch the **timeline** (left) and **logs panel** (right) update in real time. The **map tab** renders your project as an isometric city.

4. If Claude needs input, the **awaiting-input panel** opens with the question, auto-detected option buttons, and a SEND button.

5. When done, the session is archived to the **history drawer** (clock icon).

### Map controls

| Control | Action |

|---|---|

| **+ / −** | Zoom |

| **Sun/Moon** | Day / dawn / dusk / night theme |

| **⛶** | Fullscreen |

| **Drag** | Pan |

| **Hover** | Building tooltip with package name + file count |

### Status badges

| Badge | Meaning |

|---|---|

| `IDLE` | Nothing running |

| `RUNNING` | Claude subprocess actively streaming |

| `AWAITING INPUT` | Claude finished with a question, waiting on you |

| `ERROR` | Subprocess exited non-zero |

---

## Autonomy levels

| Level | Name | Behavior |

|---|---|---|

| 1 | Guided | Asks approval on every action |

| 2 | Checkpoint | Runs autonomously, pauses at milestones (default) |

| 3 | Full Auto | Runs until complete or blocked |

Change with `/superx:level <1|2|3>` or cycle with `/superx:level +` / `-`.

---

## Slash commands

| Command | Description |

|---|---|

| `/superx:level <1\|2\|3\|+\|->` | Set or cycle autonomy level |

| `/superx:status` | Show project state and quality gates |

| `/superx:maintain [on\|off\|status]` | Activate maintainer mode |

| `/superx:maintain-check [--dry-run]` | Run one maintenance cycle |

| `/superx:reflect` | Force a conflict reflection pass |

---

## Quality gates

Every push must pass:

1. All tests passing

2. Lint clean (zero warnings)

3. Conflict reflection done

4. Code review completed

5. No dirty (untested) changes

Enforced via `hooks/hooks.json` PreToolUse hook on `git push`.

---

## Maintainer mode

```bash

/superx:maintain

```

Continuous repo maintenance: triage → fix → test → review → batch release.

| Severity x Confidence | Action |

|---|---|

| Critical x Any | Alert + hotfix + human approval |

| High x High | Auto-fix + PR + request review |

| Medium/Low x High | Auto-fix, batch into patch release |

| Any x Low | Escalate with context |

---

## Requirements

- **Claude Code** 1.0+ ([install guide](https://docs.claude.com/en/docs/claude-code/setup))

- **Python** 3.11+ (stdlib only)

- **Git**

- **`jq`** (for state helpers): `brew install jq`

- **`gh` CLI** (optional): `brew install gh`

---

## Troubleshooting

| Problem | Fix |

|---|---|

| "Set a project directory first" | Click `+ NEW` or GitHub icon, enter path |

| Map shows old project | Click `+ NEW`, set the new path — map auto-refreshes |

| Awaiting panel never appears | Claude's response must end with `?`. Check system preamble isn't overridden |

| "Resuming from checkpoint" loop | Click STOP. Delete `superx-checkpoint.json` + `superx-session.json`. Restart server |

| Page won't load | Default port 8080. Override: `SUPERX_PORT=9090 python ui/server.py` |

| `superx: command not found` | Run `source ~/.zshrc` or add `export PATH="$PATH:$HOME/.superx/bin"` to your shell profile |

| SSH permission denied on install | The marketplace uses HTTPS. Run `claude plugins marketplace update superx-marketplace` |

---

## Tips and tricks

### Safer alternative to skip-permissions

```bash

superx --auto "build X"     # uses --permission-mode auto instead

```

Auto-mode uses a background safety classifier — blocks prompt injection and risky escalation while still letting superx work autonomously.

### Project memory via CLAUDE.md

superx auto-generates a `CLAUDE.md` in your project root on first run. This file survives `/clear` and session restarts — Claude reads it on every new session for persistent project conventions, architecture notes, and style rules.

### Flicker-free rendering

superx sets `CLAUDE_CODE_NO_FLICKER=1` automatically for stable alt-screen rendering with mouse support. No configuration needed.

### Recovery from errors

```bash

# Inside Claude Code:

/rewind                    # undo recent changes

# or press Esc Esc

# Outside:

superx --resume            # continue last conversation with full context

```

### Voice input

```bash

# Inside a superx session:

/voice                     # push-to-talk, 20 languages

```

### Remote control

```bash

# Inside a superx session:

/rc                        # continue from phone/tablet/browser

```

### Parallel sessions with git worktrees

superx's wave-executor already uses `isolation: worktree` for parallel tasks. For manual multi-session work:

```bash

# Terminal 1:

cd /project && superx "build auth"

# Terminal 2:

cd /project && superx "build dashboard"

```

Each gets its own git worktree — no conflicts.

### Chrome integration

```bash

claude --chrome --plugin-dir ~/.superx   # browser automation

```

Test web apps, debug console, automate forms, extract data.

### Scheduled background tasks

```bash

# Inside a superx session:

/loop "run tests and report failures"     # local, recurring

/schedule "check for dependency updates"  # cloud, Anthropic infrastructure

```

---

## Inspired by

- **[get-shit-done](https://github.com/gsd-build/get-shit-done)** — wave-based execution, acceptance criteria gates, `.planning/` state files, plan verification loops

- **[caveman](https://github.com/juliusbrussee/caveman)** — token compression via terse communication

- **[claude-mem](https://github.com/thedotmack/claude-mem)** — persistent cross-session memory

- **[claude-code-best-practice](https://github.com/shanraisshan/claude-code-best-practice)** — comprehensive tips, tricks, and workflow patterns

---

## Contributing

PRs welcome. See [CHANGELOG.md](CHANGELOG.md) for release history and [docs/superpowers/specs/](docs/superpowers/specs/) for design docs.

---

## License

[MIT](LICENSE)
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/randomittin/superx

Awesome Lists containing this project

README