https://github.com/anton-abyzov/specweave
Spec-first AI development: describe a feature → AI creates spec + plan + tasks, builds autonomously, syncs to GitHub/JIRA. Domain-expert skills for PM, Architect, Frontend, QA learn your patterns permanently. Claude Code, Codex, Cursor, Copilot & more.
https://github.com/anton-abyzov/specweave
ai azure-devops claude cli developer-tools documentation jira testing typescript
Last synced: 12 days ago
JSON representation
Spec-first AI development: describe a feature → AI creates spec + plan + tasks, builds autonomously, syncs to GitHub/JIRA. Domain-expert skills for PM, Architect, Frontend, QA learn your patterns permanently. Claude Code, Codex, Cursor, Copilot & more.
- Host: GitHub
- URL: https://github.com/anton-abyzov/specweave
- Owner: anton-abyzov
- License: mit
- Created: 2025-10-25T21:31:34.000Z (8 months ago)
- Default Branch: develop
- Last Pushed: 2026-05-13T06:57:22.000Z (about 1 month ago)
- Last Synced: 2026-05-13T08:35:58.888Z (about 1 month ago)
- Topics: ai, azure-devops, claude, cli, developer-tools, documentation, jira, testing, typescript
- Language: TypeScript
- Homepage: https://spec-weave.com
- Size: 189 MB
- Stars: 140
- Watchers: 2
- Forks: 18
- Open Issues: 9
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: .github/CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
- Security: SECURITY.md
- Agents: AGENTS.md
Awesome Lists containing this project
README
SpecWeave
AI-assisted development, under control.
Stop prompting. Start specifying.
## The Problem
**36.82% of AI skills have security flaws** ([Snyk ToxicSkills](https://snyk.io/blog/toxicskills-malicious-ai-agent-skills-clawhub/)). In May 2025, 170 out of 1,645 vibe-coded apps had security vulnerabilities exposing personal data. No specs. No tests. No review. Just vibes.
Every alternative is an instruction layer — Cursor Rules, Copilot Instructions, Windsurf Rules, CLAUDE.md. They tell the AI *how* to write code but never *what* to build, never *how* to test it, and never *when* it's done.
SpecWeave is a spec-first development layer. Configuration, not prompting. Enforced, not hoped for.
## The Solution
```
You: "Build a checkout flow with Stripe"
↓
spec.md → plan.md → tasks.md ← you review the plan
↓
Autonomous execution for hours ← AI builds, tests, fixes
↓
Quality gates (Grill + Judge-LLM) ← code reviewed automatically
↓
Synced to GitHub/JIRA/ADO ← closed, documented, shipped
```
Every feature starts as a specification — user stories, acceptance criteria, architecture decisions — before a single line of code is written. TDD enforces correctness. Quality gates catch what tests miss.
## Built With SpecWeave
12 production projects shipped in 3 months. 5 in the App Store.
| App | Platform | What It Does |
|-----|----------|-------------|
| [**EasyChamp**](https://easychamp.com) | Web (GCP) | Enterprise sports league management. 20+ microservices, ML video analytics. 4 years in production. |
| [**SketchMate**](https://apps.apple.com/app/sketchmate-ai-draw-game/id6760250072) | App Store | AI drawing game — multi-model evaluation judges player art semantically. |
| [**Lulla**](https://apps.apple.com/app/lulla-calm-baby-anywhere/id6756977992) | App Store | Baby sleep app with Apple Watch. ML cry classification (tired/hungry/pain). |
| [**Football 2026**](https://apps.apple.com/app/football-2026-travel/id6757258711) | App Store + Web | World Cup 2026 companion. AI travel planner, live tickets, team stats. |
| [**SkillUp Football**](https://apps.apple.com/app/skillup-football/id6756978002) | App Store | Coaches monetize training via Stripe. Instagram-like feed, scheduling. |
| [**BizZone**](https://apps.apple.com/app/business-zone/id6756091030) | App Store | Student & business events with AI-powered news generation. |
| [**EduFeed**](https://edufeed-jet.vercel.app/) | Web | NotebookLM meets Zoom. Upload videos, get quizzes, flashcards, live rooms. |
| [**JobWeave**](https://jobweave.ai) | Web | AI-powered job search. Smart matching, resume optimization. |
| [**SpecWeave**](https://github.com/anton-abyzov/specweave) | npm | The framework itself. 600+ increments, 538+ releases. |
| [**SpecWeave Umbrella**](https://github.com/anton-abyzov/specweave-umb) | GitHub | Multi-repo orchestration workspace for all repositories. |
| [**vskill**](https://github.com/anton-abyzov/vskill) | npm | Package manager for AI skills. Security scanning, 49 platforms. |
| [**verified-skill.com**](https://verified-skill.com) | Web | Skill marketplace & studio. 105K+ verified skills, eval system. |
[Browse increments on GitHub](https://github.com/anton-abyzov/specweave/tree/develop/.specweave/increments) — full transparency.
## Quick Start
```bash
npm install -g specweave # Node.js 20.12.0+
cd your-project
specweave init .
# Then in Claude Code, Cursor, Copilot, or any AI tool:
# "Build me a user authentication system"
```
## How It Compares
| Capability | Cursor Rules | Copilot Instructions | Windsurf | Cline | Vibe Coding | **SpecWeave** |
|------------|:---:|:---:|:---:|:---:|:---:|:---:|
| Structured specs (spec + plan + tasks) | — | — | — | — | — | **Yes** |
| Quality gates (Grill + Judge-LLM + 130 rules) | — | — | — | — | — | **Yes** |
| Autonomous execution (hours, unattended) | — | — | — | — | — | **Yes** |
| Multi-agent teams (parallel, contract-first) | — | — | — | — | — | **Yes** |
| External sync (GitHub / JIRA / ADO) | — | — | — | — | — | **Yes** |
| TDD enforcement (strict red-green-refactor) | — | — | — | — | — | **Yes** |
| LSP code intelligence (198x faster) | — | — | — | — | — | **Yes** |
| Self-improving skills (learns from corrections) | — | — | — | — | — | **Yes** |
Cursor tells AI "use Tailwind." SpecWeave tells AI "build a checkout flow with 5 acceptance criteria, test it, review it, sync to JIRA, and close."
## Key Features
**Spec-First Planning** — Every feature starts as spec.md + plan.md + tasks.md. Configuration, not prompting.
**TDD Enforcement** — Strict red-green-refactor. Tasks cannot close without passing tests. Coverage targets enforced.
**Agent Swarms** — Run parallel agents across iTerm/tmux panes. Team lead splits work, each agent owns an increment.
```
┌──────────────────┬──────────────────┬──────────────────┐
│ Agent 1 (auth) │ Agent 2 (payments)│ Agent 3 (catalog)│
│ sw:auto │ sw:auto │ sw:auto │
│ ████████░░ 80% │ ██████░░░░ 60% │ ████░░░░░░ 40% │
└──────────────────┴──────────────────┴──────────────────┘
```
**LSP Code Intelligence** — 198x faster than grep, 0 false positives. Semantic references, definitions, and types.
**100+ Skills** — PM, Architect, QA, Security, DevOps, Frontend, Backend, Mobile, ML. Every skill is customizable via skill-memories without forking.
**External Sync** — GitHub Issues, JIRA, Azure DevOps — bidirectional, real-time. Close an increment, external tools update automatically.
**Enterprise Ready** — Compliance audit trails (SOC 2, HIPAA, FDA). Brownfield analysis. Multi-repo coordination. Multi-environment deployment.
**Dashboard** — Built-in web dashboard for increment progress, analytics, cost tracking, and multi-project monitoring.
## Skills Ecosystem
SpecWeave skills are published and verified at **[verified-skill.com](https://verified-skill.com)**. The [vskill](https://www.npmjs.com/package/vskill) package manager provides:
- **Security scanning** — 52 attack patterns, SHA-256 pinning, blocklist API
- **49 agent platforms** — one install deploys to Claude Code, Cursor, Copilot, Windsurf, and 45 more
- **Skill evals** — unit tests, A/B comparisons, cross-model testing. Skills tested like programs.
- **Visual Skill Studio** — `vskill eval serve` for benchmarks, comparisons, and history
```bash
npx vskill install remotion-best-practices # Install from registry
npx vskill eval run my-skill # Run eval suite
```
## Core Commands
| You say | SpecWeave runs |
|---------|---------------|
| "Build me X" | `sw:increment` → spec + plan + tasks |
| "Go ahead" | `sw:auto` → autonomous execution |
| "Ship it" | `sw:done` → quality gates + close |
| "Split into teams" | `sw:team-lead` → parallel agents |
| "Review the code" | `sw:code-reviewer` → 6 parallel reviewers |
[Full command reference](https://spec-weave.com/docs/commands/overview)
## Upgrading to v1.1.0 (Opus 4.7 Alignment)
SpecWeave v1.1.0 aligns the framework with Claude Opus 4.7 best practices.
### P0 Fixes (required if using judge-llm or team-lead)
1. **Remove fixed extended thinking**: `sw:judge-llm` no longer passes `thinking.budget_tokens`. Adaptive prompt hints replace it. No user action needed — the CLI handles this automatically.
2. **Team-lead spawn heuristic**: `sw:team-lead` now only spawns agents when `domains ≥ 3` OR `tasks ≥ 15` OR `--parallel` is set. Previously it always spawned agents.
3. **State markers removed**: `skill-chain-*.json` and `interview-*.json` state files are no longer written. `pretooluse-guard.ts` uses TaskGet-based state with filesystem fallback.
### Config Migration
Run the migration script to add new config keys with defaults:
```bash
specweave migrate-config-0669
```
### Deprecations
See full list in [CHANGELOG.md](./CHANGELOG.md). Key migrations:
- `/sw:tdd-red` → `/sw:tdd-cycle --phase red`
- `/sw:github-sync` → `/sw-github:sync-spec`
- `/sw:team-build` → `/sw:team-lead --preset `
Full migration guide: `.specweave/docs/internal/specs/opus-47-migration.md`
## Documentation
**[spec-weave.com](https://spec-weave.com)** — guides, reference, and enterprise docs.
## Community
[Discord](https://discord.gg/UYg4BGJ65V) · [YouTube](https://www.youtube.com/@antonabyzov) · [GitHub Issues](https://github.com/anton-abyzov/specweave/issues)
## License
MIT — [github.com/anton-abyzov/specweave](https://github.com/anton-abyzov/specweave)