{"id":47178599,"url":"https://github.com/tim101010101/arena","last_synced_at":"2026-05-09T11:25:35.596Z","repository":{"id":343478434,"uuid":"1177666624","full_name":"tim101010101/arena","owner":"tim101010101","description":"Multi-agent AI arena for debates, code reviews, and red-team challenges via Model Context Protocol (MCP)","archived":false,"fork":false,"pushed_at":"2026-03-10T13:51:00.000Z","size":60,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2026-03-10T20:17:16.409Z","etag":null,"topics":["adversarial-ai","ai-agents","ai-arena","ai-debate","claude","code-review","codex","collective-intelligence","gemini","mcp","mcp-server","model-context-protocol","multi-agent","openai","red-team"],"latest_commit_sha":null,"homepage":null,"language":"TypeScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/tim101010101.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":"SECURITY.md","support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2026-03-10T08:50:33.000Z","updated_at":"2026-03-10T13:51:04.000Z","dependencies_parsed_at":null,"dependency_job_id":null,"html_url":"https://github.com/tim101010101/arena","commit_stats":null,"previous_names":["tim101010101/arena"],"tags_count":2,"template":false,"template_full_name":null,"purl":"pkg:github/tim101010101/arena","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tim101010101%2Farena","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tim101010101%2Farena/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tim101010101%2Farena/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tim101010101%2Farena/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/tim101010101","download_url":"https://codeload.github.com/tim101010101/arena/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tim101010101%2Farena/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":30457986,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-03-13T03:55:51.346Z","status":"ssl_error","status_checked_at":"2026-03-13T03:55:33.055Z","response_time":60,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["adversarial-ai","ai-agents","ai-arena","ai-debate","claude","code-review","codex","collective-intelligence","gemini","mcp","mcp-server","model-context-protocol","multi-agent","openai","red-team"],"created_at":"2026-03-13T07:02:32.298Z","updated_at":"2026-05-09T11:25:35.589Z","avatar_url":"https://github.com/tim101010101.png","language":"TypeScript","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Arena\n\n```\n                         █████╗ ██████╗ ███████╗███╗   ██╗ █████╗\n                        ██╔══██╗██╔══██╗██╔════╝████╗  ██║██╔══██╗\n                        ███████║██████╔╝█████╗  ██╔██╗ ██║███████║\n                        ██╔══██║██╔══██╗██╔══╝  ██║╚██╗██║██╔══██║\n                        ██║  ██║██║  ██║███████╗██║ ╚████║██║  ██║\n                        ╚═╝  ╚═╝╚═╝  ╚═╝╚══════╝╚═╝  ╚═══╝╚═╝  ╚═╝\n```\n\n[![CI](https://github.com/tim101010101/arena/actions/workflows/ci.yml/badge.svg)](https://github.com/tim101010101/arena/actions/workflows/ci.yml)\n[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)\n[![Version](https://img.shields.io/github/v/release/tim101010101/arena)](https://github.com/tim101010101/arena/releases)\n\n**A position-driven adversarial arena for AI agents.** Host provides context and 2+ opposing positions; arena dispatches local CLI models (Claude, Codex, Gemini, OpenAI, Kimi) to argue each position over multiple rounds and returns the transcript.\n\nA standalone CLI — invoke it from your shell, scripts, or any agent that can run shell commands.\n\n## Mental model\n\n- **Host doesn't fight.** The caller (Claude Code, Codex CLI, scripts) just supplies what should be argued and which positions to argue.\n- **Position is the unit, not the model.** Adversarial value comes from clashing stances, not from \"which model wins\". Same model with two different system prompts is a valid pair if no other CLI is available.\n- **Arena owns model dispatch.** It picks distinct models when multiple CLIs are healthy, falls back to reusing one when not.\n\n## Subcommands\n\n| Subcommand | Purpose |\n|---|---|\n| `arena challenge` | Core. Run N positions over R rounds against the supplied context. |\n| `arena review` | Code-review preset over `arena challenge`. Spawns attacker positions (default: bug-hunter + security-auditor) on the supplied code/diff. |\n| `arena health` | List agent CLIs and their availability. |\n| `arena mcp` | Start arena as a stdio MCP server — exposes each scenario as a tool callable from any MCP client. |\n\n## Install\n\n```bash\n# Required: at least one of these CLIs in $PATH\nnpm install -g @anthropic-ai/claude-cli   # for \"claude\"\nnpm install -g @codex-ai/cli              # for \"codex\" / \"openai\" / \"gemini\"\nuv tool install kimi-cli                  # for \"kimi\" (or: pipx install kimi-cli)\n```\n\n### Shell (no npm/node required)\n\nDownloads a self-contained native binary from the latest GitHub release. Supports macOS (arm64/x64) and Linux (arm64/x64).\n\n```bash\ncurl -fsSL https://raw.githubusercontent.com/tim101010101/arena/main/install.sh | bash\n```\n\nInstalls to `~/.local/bin/arena`. Override the directory with `ARENA_INSTALL_DIR`, or pin a version with `ARENA_VERSION`:\n\n```bash\nARENA_INSTALL_DIR=/usr/local/bin ARENA_VERSION=v0.1.3 \\\n  curl -fsSL https://raw.githubusercontent.com/tim101010101/arena/main/install.sh | bash\n```\n\n### npm\n\n```bash\nnpm install -g arena-debate     # or: npx arena-debate\n```\n\n## CLI usage\n\n```bash\n# Adversarial debate — supply your own positions\narena challenge \\\n  --context \"Should we use microservices or a monolith for a 10k-user product with 5 devs?\" \\\n  --position \"Pro-microservices: team boundaries justify the split\" \\\n  --position \"Pro-monolith: a 5-person team should not carry the ops burden\" \\\n  --rounds 3\n\n# Adversarial code review (positions auto-derived from --focus)\narena review --git-ref feature/auth --focus bugs,security\n\narena review --files src/login.ts,src/session.ts --focus security\n\n# Override which models to use (must already be healthy)\narena challenge --context \"...\" --position a --position b --models claude,codex\n\n# Diagnostics\narena health\narena --version\narena --help\n```\n\n## MCP server\n\n`arena mcp` starts a stdio MCP server. Each loaded scenario (`challenge`, `review`, and any user-defined ones) is exposed as an MCP tool; a `health` tool is also included.\n\nAdd it to your MCP client config (e.g. Claude Desktop or Claude Code `.mcp.json`):\n\n```json\n{\n  \"mcpServers\": {\n    \"arena\": {\n      \"command\": \"arena\",\n      \"args\": [\"mcp\"]\n    }\n  }\n}\n```\n\nOnce connected, your AI client can call:\n\n- **`challenge`** — supply `context` (string) and `positions` (array of ≥2 strings); optional `rounds` and `models`.\n- **`review`** — supply `sources` (array of source objects: `raw`, `git_ref`, `git_range`, `file_list`, or `patch_file`); optional `focus`, `rounds`, and `models`.\n- **`health`** — returns availability of all local agent CLIs.\n\n## Configuration (env vars)\n\n| Variable | Default | Notes |\n|---|---|---|\n| `ARENA_TIMEOUT_MS` | `120000` | Per-fighter execution timeout |\n| `ARENA_DEFAULT_ROUNDS` | `3` | Default rounds when not specified |\n| `ARENA_DEFAULT_MODE` | `parallel` | Reserved (challenge runs sequentially) |\n| `ARENA_MAX_CONTEXT_SIZE` | `1000000` | Max bytes from `sources` |\n| `ARENA_CLAUDE_MODEL` / `ARENA_CODEX_MODEL` / `ARENA_GEMINI_MODEL` / `ARENA_OPENAI_MODEL` / `ARENA_KIMI_MODEL` | CLI default | Per-adapter model override |\n\n## Dispatch behavior\n\n```\npositions = [\"A\", \"B\"]\navailable = healthCheckAll().filter(ok)\noverride  = caller-supplied --models / models[]\n\npool = override ?? available\nfighter[i].model = pool[i % pool.length]\n```\n\n- Prefers distinct models when `len(positions) ≤ len(pool)`.\n- Cycles when positions outnumber the pool — same model, different prompts.\n- Each fighter gets a unique id (`\u003cmodel\u003e#\u003ci\u003e`) so transcripts stay disambiguated.\n\n## Development\n\n```bash\nbun install\nbun test          # full suite\nbun run build     # produces dist/index.js\n```\n\n## License\n\nMIT\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftim101010101%2Farena","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Ftim101010101%2Farena","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftim101010101%2Farena/lists"}