https://github.com/shadowdevcode/ai-product-os

A simulated, end-to-end product development organization where specialized AI agents collaborate to take an idea from raw hypothesis to deployed, instrumented product — following the same rigor as a real product team.
https://github.com/shadowdevcode/ai-product-os
ai-agents claude-ai claude-code productivity productmanagement
Last synced: about 2 months ago
JSON representation
Host: GitHub
URL: https://github.com/shadowdevcode/ai-product-os
Owner: shadowdevcode
Created: 2026-03-08T10:19:41.000Z (3 months ago)
Default Branch: main
Last Pushed: 2026-04-30T18:03:31.000Z (about 2 months ago)
Last Synced: 2026-04-30T19:15:32.816Z (about 2 months ago)
Topics: ai-agents, claude-ai, claude-code, productivity, productmanagement
Language: TypeScript
Homepage: https://ai-product-os-493e.vercel.app/
Size: 2.18 MB
Stars: 1
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Agents: AGENTS.md
Awesome Lists containing this project

README

          # AI Product Operating System

A simulated, end-to-end product development organization where specialized AI agents collaborate to take an idea from raw hypothesis to deployed, instrumented product — following the same rigor as a real product team.

**Who this is for:** Product Managers, indie founders, and ICPs who want to ship AI-assisted products faster without skipping the parts that matter — research, architecture review, QA, metrics, and learning.

> **Live:** [ai-product-os-493e.vercel.app](https://ai-product-os-493e.vercel.app/) | Built and operated with [Claude Code](https://claude.ai/code). You need Claude Code to run the slash commands.

>

> Deploy note: `ai-product-os-493e` is the current landing Vercel project. MoneyMirror was a separate Vercel project in earlier cycles; its legacy app code has been removed from this monorepo and its archived Vercel project is no longer linked to this GitHub repo.

---

## Quick Navigation

| What you're looking for                  | Where to find it                                                         |

| ---------------------------------------- | ------------------------------------------------------------------------ |

| Active project status, stage, blockers   | [`project-state.md`](project-state.md)                                   |

| Product ideas and issue definitions      | [`experiments/ideas/issue-NNN.md`](experiments/ideas/)                   |

| Market research and problem exploration  | [`experiments/exploration/exploration-NNN.md`](experiments/exploration/) |

| PRDs, UX specs, architecture plans       | [`experiments/plans/plan-NNN.md`](experiments/plans/)                    |

| QA, code review, metrics, deploy results | [`experiments/results/`](experiments/results/)                           |

| Demo scripts and presentations           | [`experiments/demos/`](experiments/demos/)                               |

| Built app codebases                      | [`apps/[project-name]/`](apps/)                                          |

| Pipeline command instructions            | [`commands/`](commands/)                                                 |

| Agent role definitions                   | [`agents/`](agents/)                                                     |

| Engineering and product knowledge base   | [`knowledge/`](knowledge/)                                               |

| Quality gate rules and stage progression | [`system-orchestrator.md`](system-orchestrator.md)                       |

| Command execution framework              | [`command-protocol.md`](command-protocol.md)                             |

---

## Projects Built

| Issue | App                                                  | What It Does                                                                                                                                      | Stack                                                    | Status   |

| ----- | ---------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------- | -------------------------------------------------------- | -------- |

| 002   | —                                                    | Gmail → WhatsApp daily digest summarizer                                                                                                          | Next.js, Supabase, Gemini, Twilio                        | Archived |

| 003   | [finance-advisor](apps/finance-advisor/)             | AI personal finance advisor                                                                                                                       | Next.js, Supabase, Gemini                                | Complete |

| 004   | [clarity](apps/clarity/)                             | PM to-do list with AI task categorization                                                                                                         | Next.js, Neon, Gemini                                    | Complete |

| 005   | [smb-bundler](apps/smb-bundler/)                     | Feature bundle + value-based pricing engine for B2B SaaS PMs                                                                                      | Next.js, Neon, Gemini                                    | Complete |

| 006   | [ozi-reorder](apps/ozi-reorder/)                     | Reorder reminder experiment for dark-store baby essentials (50/50 test vs. control, 7 PostHog events)                                             | Next.js, Neon, PostHog                                   | Complete |

| 007   | [ozi-insights](apps/ozi-insights/)                   | Synthetic Freshdesk support data for order reliability research (30 tickets, grounded in Play Store)                                              | Data workspace                                           | Explored |

| 008   | [nykaa-personalisation](apps/nykaa-personalisation/) | Hyper-personalized discovery feed (affinity + intent scoring, 10 PostHog events, 5 API routes)                                                    | Next.js, Neon, PostHog                                   | Archived |

| 009   | money-mirror                                         | AI personal finance coach — parses Indian bank + credit card PDFs, reveals perception gap, delivers consequence-first nudges. Weekly recap email. | Next.js 16, Neon Auth, Neon DB, Gemini 2.5 Flash, Resend | Archived |

| 013   | [research-copilot](apps/research-copilot/)           | Chat-first PM research workspace with plan approval, visible tool orchestration, cited findings, and Markdown export.                             | Next.js 16, Neon Postgres, OpenRouter, PostHog           | T0       |

| —     | [landing](apps/landing/)                             | Framework landing page showcasing pipeline, agents, and shipped products                                                                          | Next.js, Tailwind CSS                                    | Live     |

Each issue number maps directly across all folders: `experiments/ideas/issue-NNN.md`, `experiments/exploration/exploration-NNN.md`, `experiments/plans/plan-NNN.md`, and `experiments/results/*-NNN.md`.

---

## The 12-Step Pipeline

The OS enforces a sequential pipeline with quality gates. A stage cannot start until the previous stage passes.

| #   | Command         | Agent                                    | Output                                               |

| --- | --------------- | ---------------------------------------- | ---------------------------------------------------- |

| 1   | `/create-issue` | Research Agent                           | Structured opportunity brief                         |

| 2   | `/explore`      | Research Agent                           | Market validation, recommendation                    |

| 3   | `/create-plan`  | Product + Design + Backend/DB Architects | PRD, UX, architecture, DB schema                     |

| 4   | `/execute-plan` | Frontend + Backend Engineers             | Working app codebase                                 |

| 5   | `/deslop`       | Deslop Agent                             | Clean, comment-free code                             |

| 6   | `/review`       | Code Review Agent                        | Critical issues list (blocks until fixed)            |

| 7   | `/peer-review`  | Peer Review Agent                        | Adversarial architecture review                      |

| 8   | `/qa-test`      | QA Agent                                 | Reliability and edge-case test results               |

| 9   | `/metric-plan`  | Analytics Agent                          | North Star, funnels, ground-truth queries            |

| 10  | `/deploy-check` | Deploy Agent                             | Production readiness sign-off                        |

| 11  | `/postmortem`   | Learning Agent                           | Root cause analysis of pipeline failures             |

| 12  | `/learning`     | Learning Agent                           | Engineering rules extracted → knowledge base updated |

**Utility commands** (run anytime):

- `/docs` — Generate `CODEBASE-CONTEXT.md` for the active app

- `/explain` — Deep-dive on a concept, pattern, or error

- `/linear-bind` — Bind the active repo issue to a Linear team/project (auto-runs at end of `/create-issue`)

- `/linear-sync [mode]` — Mirror repo artifacts into Linear; modes: `issue`, `plan`, `status`, `release`

- `/linear-brief` — Read-only summary of the current Linear state; use before reviews or standups

- `/linear-close` — Finalize and archive the Linear project after `/learning`

---

## Knowledge Base

The system gets smarter with every cycle. After each `/learning` run, insights from postmortems are extracted into durable rules:

- [`knowledge/engineering-lessons.md`](knowledge/engineering-lessons.md) — Technical rules (e.g., fan-out cron, pagination bounds, telemetry resilience)

- [`knowledge/product-lessons.md`](knowledge/product-lessons.md) — Product patterns and anti-patterns

- [`knowledge/prompt-library.md`](knowledge/prompt-library.md) — Refined agent prompts extracted from what worked

- [`knowledge/coding-standards.md`](knowledge/coding-standards.md) — TypeScript, Next.js, Supabase/Neon standards

- [`knowledge/architecture-guide.md`](knowledge/architecture-guide.md) — Default system architecture patterns

- [`knowledge/analytics-framework.md`](knowledge/analytics-framework.md) — PostHog event schema and funnel design

Every agent reads the knowledge base before executing — preventing the same class of mistake from appearing twice.

---

## Linear PM Layer

Linear is an optional PM-facing layer on top of the repo workflow.

The source of truth remains in this repository:

- `project-state.md` is the canonical workflow state

- `experiments/` contains the canonical issue, exploration, plan, and result artifacts

- `experiments/linear-sync/` stores durable Linear sync identities per issue

- `commands/` defines the execution contracts

Linear exists to improve:

- prioritization

- roadmap visibility

- blocker communication

- task tracking from execution manifests

- release and closeout visibility

Recommended usage:

| Pipeline stage                        | Linear command             | What it does                               |

| ------------------------------------- | -------------------------- | ------------------------------------------ |

| `/create-issue`                       | (auto) bind + `sync issue` | Creates Linear project; syncs issue brief  |

| `/create-plan`                        | `/linear-sync plan`        | Publishes PRD summary + child tasks        |

| `/review`, `/peer-review`, `/qa-test` | `/linear-sync status`      | Reflects gate pass/fail, surfaces blockers |

| `/deploy-check`                       | `/linear-sync release`     | Attaches PR link and release notes         |

| `/learning`                           | `/linear-close`            | Marks project complete, archives it        |

If Linear is unavailable, the Linear utility command should fail explicitly. The 12-step pipeline remains usable because Linear is not the workflow engine.

---

## Getting Started (Forking This Repo)

1. **Check the current state** — read [`project-state.md`](project-state.md) to see what stage the system is at and which issue is active

2. **Pick an idea** — browse [`experiments/ideas/`](experiments/ideas/) for context on past issues, or create a new one with `/create-issue`

3. **Run commands sequentially** — pass the command file from [`commands/`](commands/) to Claude Code (e.g., paste `commands/create-issue.md` content and follow it)

4. **Read the knowledge base first** — every command in the pipeline reads all files in [`knowledge/`](knowledge/) before generating output to avoid repeating past mistakes

5. **Track gates, not just progress** — check `project-state.md` after each command; blocked = do not proceed

6. **Validate repo changes** — run `bun install` and `bun run validate` from the repo root before accepting agent work

**Default tech stack** (used across all apps):

- Frontend: Next.js 16+ (App Router), TypeScript strict, Tailwind CSS 4+

- Backend: Next.js API Routes, Neon DB (`@neondatabase/serverless`) or Supabase

- AI: Google Gemini 2.5 Flash/Pro via `@google/genai` with structured outputs

- Analytics: PostHog (`posthog-js` + `posthog-node`)

- Hosting: Vercel

**Environment setup per app:**

```bash

bun install

cd apps/[project-name]

cp .env.local.example .env.local   # fill in your keys

bun run dev

```

Each app includes a `schema.sql` (idempotent) that must be applied in your database editor before first run.

---

## The Human PM Role

Agents execute but do not replace judgment. The human PM is responsible for:

- Deciding which ideas to pursue

- Evaluating agent outputs at each stage

- Overriding blocked quality gates when the tradeoff is justified

- Making final product and architectural decisions

- Approving releases

---

## Repository Structure

```

/agents                    # Agent role definitions (one file per role)

/commands                  # Pipeline command instructions (one file per command)

/knowledge                 # Shared intelligence: standards, lessons, prompts

/experiments

  /ideas                   # Issue briefs (issue-NNN.md)

  /exploration             # Market validation outputs (exploration-NNN.md)

  /plans                   # PRDs, UX, architecture, DB schema (plan-NNN.md)

  /results                 # QA, reviews, metrics, deploy artifacts (*-NNN.md)

  /demos                   # Demo scripts and presentations

/apps                      # Built codebases (one folder per project)

  /[project-name]

    src/app/               # Next.js App Router pages and API routes

    src/components/        # UI components

    src/lib/               # Utilities, DB clients, AI helpers

    schema.sql             # Idempotent DB schema

    CODEBASE-CONTEXT.md    # Auto-generated docs (via /docs command)

    README.md              # Setup and run instructions

    .env.local.example     # Required env vars (no secrets)

project-state.md           # Live runtime memory — always check this first

system-orchestrator.md     # Quality gate rules and stage progression

command-protocol.md        # How commands load context and update state

```

---

## System Evolution

This OS isn't static — it improves with every cycle. Full history in [`CHANGELOG.md`](CHANGELOG.md).

| Phase  | What Changed                                                                                                                                                                                                                                                                                                | PM Decision                                                                                                                                       |

| ------ | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------- |

| **v0** | Manual pipeline, no enforcement                                                                                                                                                                                                                                                                             | Proved the 12-step concept works end-to-end                                                                                                       |

| **v1** | Quality gates, knowledge base, 5 shipped products                                                                                                                                                                                                                                                           | Each postmortem generated rules that prevented the same class of failure in the next cycle                                                        |

| **v2** | Developer tooling (husky, prettier, enforcement scripts), test infrastructure (Vitest + shared mocks), progressive disclosure (.claude/rules/), landing page                                                                                                                                                | Finalized with Nykaa Hyper-Personalization (issue-008). AI Product OS v2 is now feature-complete.                                                 |

| **v3** | MoneyMirror (issue-009): most complex pipeline cycle — Neon Auth, multi-bank PDF parsing, credit card flows, perception gap dashboard, weekly recap email. Phase 2 adds 3-tab dashboard, statement library, month picker, upload labels, multi-account. Linear PM layer + real-time feedback capture added. | First cycle requiring live DB migration, production smoke, and Vercel runtime fix before passing deploy-check. 7 new engineering rules extracted. |

---

_Build faster. Learn systematically. Fail safely._
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/shadowdevcode/ai-product-os

Awesome Lists containing this project

README