https://github.com/lukaskellerstein/dex

Autonomous feature implementation engine — spawns fresh Claude Code agents per phase (Ralph Wiggum loop) with spec-kit planning, real-time trace UI, and git automation
https://github.com/lukaskellerstein/dex

ai-agents autonomous-coding claude-agent-sdk claude-code electron react spec-kit sqlite typescript vite

Last synced: about 1 month ago
JSON representation

Autonomous feature implementation engine — spawns fresh Claude Code agents per phase (Ralph Wiggum loop) with spec-kit planning, real-time trace UI, and git automation

Host: GitHub
URL: https://github.com/lukaskellerstein/dex
Owner: lukaskellerstein
Created: 2026-04-13T20:07:28.000Z (3 months ago)
Default Branch: main
Last Pushed: 2026-05-24T13:07:20.000Z (about 2 months ago)
Last Synced: 2026-05-24T15:06:27.979Z (about 2 months ago)
Topics: ai-agents, autonomous-coding, claude-agent-sdk, claude-code, electron, react, spec-kit, sqlite, typescript, vite
Language: TypeScript
Size: 4.37 MB
Stars: 3
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Dex

Autonomous feature engine — fresh agents per phase, structured planning, full trace visibility.

---

Dex is an Electron desktop app that automates feature implementation using the [Ralph Wiggum loop](https://ghuntley.com/ralph/) philosophy, the [Claude Agent SDK](https://docs.anthropic.com/en/docs/agents-and-tools/claude-agent-sdk), and [spec-kit](https://github.com/anthropics/claude-code/tree/main/skills). It spawns a fresh Claude Code agent per **phase** of work — each with clean context to prevent token bloat — while streaming every tool call, subagent spawn, and thinking step to a real-time trace UI.

## Why Dex? The Three-Pillar Synthesis

Each pillar alone has critical gaps. Dex exists because the combination eliminates what each one lacks:

| Used alone | What breaks |
|---|---|
| **Ralph Wiggum loop** | No structured specs — the agent picks tasks from a free-form TODO, builds the wrong thing, and burns budget. No UI, no abort, no cost tracking, no failure recovery beyond "kill the terminal." |
| **Spec-kit** | No autonomous loop — specs are great but someone has to manually run each phase, check results, and decide what's next. It's a sequential human-driven workflow. |
| **Claude Code (interactive)** | Single session degrades after ~100K tokens. No persistent memory across sessions. No structured planning. No verification beyond "it compiles." One task at a time, human-driven. |

**Dex fuses all three:**

- **Ralph's loop** provides context isolation and self-improvement — fresh `query()` per phase, learnings persist via filesystem
- **Spec-kit** provides structured planning — specs with acceptance criteria before any code is written, constitution-governed consistency
- **Claude Agent SDK** provides programmatic control — typed hooks, abort, MCP browser verification, session management

The result: describe a project, clarify it into a complete plan, then walk away. Dex autonomously specs, plans, implements, verifies (including browser E2E), and recovers from failures — feature by feature, phase by phase.

## Features

- **Phase-level orchestration** — parses `tasks.md` specs into phases and spawns one agent per phase, keeping context focused
- **Real-time agent trace** — streams tool calls, thinking blocks, subagent activity, and results as they happen with GSAP-animated timeline
- **Spec-kit integration** — discovers specs in `specs/` or `.specify/specs/`, uses `/speckit-plan` and `/speckit-implement` skills
- **Git automation** — creates branches, commits per phase, and opens PRs with cost/duration metrics
- **Execution history** — persists all runs, phases, steps, and subagent metadata to per-project JSON for replay and analysis
- **Checkpoints — time-travel over the pipeline** — every completed stage auto-captures a named checkpoint; go back, try again, fan out into N parallel variants, and keep the ones you want. Git under the hood, invisible to the user.
- **Frameless desktop UI** — custom title bar, 3-column layout (sidebar → task board → agent trace), Catppuccin-inspired dark theme

## Checkpoints (008)

Dex runs a project as a sequence of discrete stages (clarification → constitution → gap-analysis → specify → plan → tasks → implement → verify → learnings). Every completed stage automatically captures a **checkpoint** — a named save point you can rewind to later.

Four user-facing verbs, no git knowledge required:

- **Go back** to any past checkpoint.
- **Try again** — re-run the current stage (or later stages) differently.
- **Try N ways** — fork N parallel variants of the next stage, compare, pick one. Spec-only stages run in parallel worktrees, so 3 variants of `plan` take ≈ 1× the wall time, not 3×.
- **Keep this** — accept a stage's output as the new canonical state.

Plus **Record mode** (top-bar REC badge) — auto-promote every completed stage for team-shared baselines or CI fixtures.

All checkpoints travel via standard `git push --tags`; a collaborator who clones a Record-mode-produced project sees the same checkpoint tree. Power users can query the tree from the terminal:

```sh
git log --all --grep='^\[checkpoint:' # every auto-capture commit
git tag --list 'checkpoint/*' # every named save point
```

## Architecture

```mermaid
graph TD
subgraph Renderer["Renderer (React 18)"]
UI[3-Column UI]
Hooks[useOrchestrator / useProject]
end

subgraph Main["Main Process (Electron)"]
IPC[IPC Handlers]
Preload[contextBridge → dexAPI]
end

subgraph Core["Core Engine (Pure Node.js)"]
Orch[Orchestrator]
Parser[tasks.md Parser]
Git[Git Operations]
DB[(SQLite)]
end

SDK[Claude Agent SDK]

style Renderer fill:#1E1E2E,stroke:#7C3AED,color:#CDD6F4
style Main fill:#1E1E2E,stroke:#45475A,color:#CDD6F4
style Core fill:#1E1E2E,stroke:#45475A,color:#CDD6F4
style SDK fill:#7C3AED,stroke:#7C3AED,color:#FFF
```

**Data flow:** User action → React UI → IPC → Core orchestrator → Claude Agent SDK → hooks capture steps → IPC events → React hooks → UI updates (real-time streaming).

## Quick Start

### Prerequisites

- **Node.js** >= 18
- **Claude Code CLI** installed and authenticated
- **GitHub CLI** (`gh`) for PR creation

### Installation

```bash
git clone https://github.com/lukaskellerstein/dex.git
cd dex
npm install
```

### Development

```bash
./dev-setup.sh
```

This starts the Vite dev server (port 5500) and launches Electron with hot reload. Logs go to `/tmp/dex-logs/`.

### Production Build

```bash
npm run build:start
```

## Usage

1. **Open a project** — click the folder icon to select a directory containing spec-kit specs
2. **Select a spec** — the overview shows all discovered specs with phase/task counts
3. **Start a run** — choose plan or build mode; the orchestrator begins phase-by-phase execution
4. **Watch the trace** — tool calls, thinking blocks, and subagent activity stream in real-time
5. **Review the PR** — on completion, a PR is created with commit history and cost metrics

## Configuration

| Setting | Description | Default |
|---------|-------------|---------|
| `mode` | `plan` (spec planning) or `build` (implementation) | `build` |
| `model` | Claude model to use | SDK default |
| `maxTurns` | Max agent turns per phase | `200` |
| `phases` | `"all"` or specific phase numbers `[1, 3]` | `"all"` |

## Project Structure

```
dex/
├── src/
│ ├── main/ # Electron main process
│ │ ├── index.ts # App lifecycle, BrowserWindow, IPC
│ │ ├── preload.ts # contextBridge → window.dexAPI
│ │ └── ipc/ # Handler modules (orchestrator, project, history)
│ ├── core/ # Orchestration engine (pure Node.js, no Electron imports)
│ │ ├── orchestrator.ts # Phase loop, agent spawning, event emission
│ │ ├── parser.ts # tasks.md → Phase[] with Task[]
│ │ ├── git.ts # Branch creation, PR generation
│ │ ├── database.ts # SQLite schema & queries (runs, traces, steps)
│ │ └── types.ts # Shared interfaces (Phase, Task, AgentStep, etc.)
│ └── renderer/ # React 18 UI
│ ├── App.tsx # Root component, view switching
│ ├── hooks/ # useOrchestrator, useProject
│ ├── components/
│ │ ├── layout/ # AppShell, Topbar, WindowControls
│ │ ├── project-overview/ # Spec cards grid
│ │ ├── task-board/ # Phase/task views, progress bar
│ │ └── agent-trace/ # Step timeline, tool cards, subagent pills
│ └── styles/ # Catppuccin-inspired CSS custom properties
├── tests/ # Diagnostic scripts
├── docs/ # Logo assets
├── dev-setup.sh # Development environment bootstrap
├── vite.config.ts # Vite config (renderer build)
├── tsconfig.json # TypeScript (main + core)
└── package.json
```

## Tech Stack

| Layer | Technology |
|-------|-----------|
| Desktop | Electron 30 (frameless BrowserWindow) |
| UI | React 18, CSS Custom Properties, GSAP, Lucide React |
| Engine | Pure Node.js orchestrator, Claude Agent SDK |
| Data | better-sqlite3 (execution history) |
| Build | Vite, TypeScript (strict mode) |
| Git | GitHub CLI (`gh`) for automated PRs |

## Contributing

1. Fork the repository
2. Create your feature branch (`git checkout -b feature/amazing-feature`)
3. Commit your changes (`git commit -m 'Add amazing feature'`)
4. Push to the branch (`git push origin feature/amazing-feature`)
5. Open a Pull Request

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/lukaskellerstein/dex

Awesome Lists containing this project

README