https://github.com/saferl-lab/cheetahclaws

CheetahClaws (Nano Claude Code): A Fast, Easy-to-Use, Python-Native Personal AI Assistant for Any Model, Inspired by OpenClaw and Claude Code, Built to Work for You Autonomously 24/7.
https://github.com/saferl-lab/cheetahclaws
agentic-ai claude claude-code memory openclaw python skills
Last synced: 3 months ago
JSON representation
CheetahClaws (Nano Claude Code): A Fast, Easy-to-Use, Python-Native Personal AI Assistant for Any Model, Inspired by OpenClaw and Claude Code, Built to Work for You Autonomously 24/7.
Host: GitHub
URL: https://github.com/saferl-lab/cheetahclaws
Owner: SafeRL-Lab
License: apache-2.0
Created: 2026-04-01T15:23:36.000Z (3 months ago)
Default Branch: main
Last Pushed: 2026-04-07T22:22:57.000Z (3 months ago)
Last Synced: 2026-04-08T00:02:45.894Z (3 months ago)
Topics: agentic-ai, claude, claude-code, memory, openclaw, python, skills
Language: Python
Homepage: https://deepwiki.com/SafeRL-Lab/cheetahclaws
Size: 7.85 MB
Stars: 438
Watchers: 6
Forks: 174
Open Issues: 4
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project

README

          English | [中文](https://github.com/SafeRL-Lab/clawspring/blob/main/docs/README.CN.MD) | [Français](https://github.com/SafeRL-Lab/clawspring/blob/main/docs/README.FR.MD) | [한국어](https://github.com/SafeRL-Lab/clawspring/blob/main/docs/README.KO.MD) | [日本語](https://github.com/SafeRL-Lab/clawspring/blob/main/docs/README.JP.MD) | [Deutsch](https://github.com/SafeRL-Lab/clawspring/blob/main/docs/README.DE.MD) | [Português](https://github.com/SafeRL-Lab/clawspring/blob/main/docs/README.ES.MD)






  

     

  

  



CheetahClaws (Nano Claude Code): A Fast, Easy-to-Use, Python-Native Personal AI Assistant for Any Model, Inspired by OpenClaw and Claude Code, Built to Work for You Autonomously 24/7



    The newest source of Claude Code

    ·

    Issue

  ·

    Brief Intro

  

  



 


  

 



Task Excution

 

 

 

---

  


  

 



Brainstorm Mode: Multi-Agent Brainstorm

 


---

  


  

 



Proactive Mode: Autonomous Agent

 


---

  


  

 



SSJ Developer Mode: Power Menu Workflow

 


---

  


  

 



Telegram Bridge: Control cheetahclaws from Your Phone

 


---

 

## 🔥🔥🔥 News (Pacific Time)

- Apr 06, 2026 (**v3.05.52**): **Checkpoint system, plan mode, compact, and utility commands, support MiniMax Models** 

  - **Checkpoint system** (`checkpoint/` package): auto-snapshots conversation state and file changes after every turn. `/checkpoint` lists all snapshots; `/checkpoint ` rewinds both files and conversation history to any previous state; `/checkpoint clear` removes all snapshots for the session. `/rewind` is an alias. 100-snapshot sliding window; initial snapshot captured at session start. Throttling: skips when nothing changed. File backups use copy-on-write; snapshots capture post-edit state.

  - **Plan mode**: `/plan ` enters a read-only analysis mode — Claude may only read the codebase and write to a dedicated plan file (`.nano_claude/plans/.md`). All other writes are silently blocked with a helpful message. `/plan` shows the current plan; `/plan done` exits plan mode and restores original permissions; `/plan status` reports whether plan mode is active. Two new agent tools — `EnterPlanMode` and `ExitPlanMode` — let Claude autonomously enter and exit plan mode for complex multi-file tasks; both are auto-approved in all permission modes.

  - **`/compact [focus]`**: manually trigger conversation compaction at any time. An optional focus string guides the LLM summarizer on what context to preserve. Auto-compact and manual compact both restore plan file context after compaction.

  - **Utility commands**: `/init` creates a `CLAUDE.md` template in the current directory; `/export [filename]` exports the conversation as Markdown (default) or JSON; `/copy` copies the last assistant response to the clipboard (Windows/macOS/Linux); `/status` shows version, model, provider, permissions, session ID, token usage, and context %; `/doctor` diagnoses installation health (Python version, git, API key + live connectivity test, optional deps, CLAUDE.md presence, checkpoint disk usage, permission mode).

- Apr 06, 2026 (**v3.05.51**): **Project renamed from Nano Claude Code to CheetahClaws**

  - The project has been rebranded from **Nano Claude Code** to **CheetahClaws** — a more distinctive name that captures the spirit of the tool: a sharp, agile coding assistant. The `Cl` in CheetahClaws is a subtle nod to Claude.

  - CLI command: `nano_claude` → `cheetahclaws`

  - PyPI package: `nano-claude-code` → `cheetahclaws`

  - Config directory: `~/.nano_claude/` → `~/.clawnest/` → `~/.cheetahclaws/`

  - Main entry point: `nano_claude.py` → `cheetahclaws.py`

  - All documentation, GitHub URLs, and internal references updated accordingly.

  - Added **CheetahClaws vs OpenClaw** comparison section to README.

- 00.29 PM, Apr 06, 2026 (**v3.05.5**): **SSJ Developer Mode, Telegram Bridge, Worker Command, and UX improvements** 

  - **`/ssj` — SSJ Developer Mode**: Interactive power menu with 10 workflow options: Brainstorm, TODO viewer, Worker, Expert Debate, Propose Improvements, Code Review, README generator, Commit helper, Git Diff Scan, and Idea-to-Tasks Promotion. Menu stays open between actions and supports `/command` passthrough (e.g. `/exit` works from inside SSJ).

  - **`/worker` command**: Auto-implements pending tasks from `brainstorm_outputs/todo_list.txt` one by one. Supports selecting specific tasks with comma-separated numbers (e.g. `1,4,6`), a custom todo file path (`--path /other/todo.md`), and a worker count limit (`--workers 3`). If you accidentally pass a brainstorm `.md` output file, Worker detects it and offers to redirect to `todo_list.txt` — or to generate it first from the brainstorm file and then run Worker automatically. Each task gets a dedicated prompt that reads code, implements the change, and marks it done.

  - **`/telegram` — Telegram Bot Bridge**: Receives messages via Telegram Bot API and routes them through the model, sending responses back to the chat. Auto-starts on launch if configured. Only responds to the authorized `chat_id`. Supports slash command passthrough (`/cost`, `/model`, etc.), shows a typing indicator while the model processes, and can be stopped remotely by sending `/stop` in Telegram.

  - **Brainstorm → TODO pipeline**: After brainstorm synthesis, automatically generates `brainstorm_outputs/todo_list.txt` with prioritized checkbox tasks. TODO viewer (SSJ option 2) shows only pending tasks as numbered (completed tasks shown with ✓ without a number).

  - **Expert Debate improvements**: SSJ option 4 now prompts for the number of debate agents (default 2, minimum 2); rounds are auto-calculated as `(agents × 2 − 1)`. The debate result is saved to the same directory as the debated file (`_debate_HHMMSS.md`). An animated per-round per-expert spinner (`⚔️ Round 2/3 — Expert 1 thinking...`) keeps the terminal lively throughout the debate.

  - **Brainstorm spinner**: Animated spinner with random phrases while brainstorm agents are thinking.

  - **Force quit**: 3× Ctrl+C within 2 seconds triggers `os._exit(1)` — kills the process immediately regardless of blocking I/O.

  - **Interactive Ollama Model Picker** — when a request fails with 404 (model not found), cheetahclaws queries the local Ollama API (`/api/tags`) and presents a numbered model selector to switch models and retry without restarting. Cancelling aborts gracefully without crashing the REPL.

  - **Windows file handling** — `_read`, `_write`, and `_edit` in `tools.py` now force UTF-8 encoding and `newline=""`. `_edit` detects pure-CRLF files (every `\n` is part of `\r\n`) and restores line endings after edit; mixed-line-ending files are left as-is to avoid corruption.

  - **/brainstorm command** — `/brainstorm [topic]` runs a multi-persona AI debate. The model first generates N expert personas tailored to the topic (geopolitics → analysts & diplomats; software → architects & engineers; etc.). Agent count is chosen interactively at runtime (2–100, default 5). Results are saved to `brainstorm_outputs/` and synthesized by the main agent. 

  - **Rich Live SSH fix** — Rich's in-place Live streaming is now automatically disabled in SSH sessions (`SSH_CLIENT`/`SSH_TTY` detected) where ANSI cursor-up breaks and causes repeated output lines. Override with `/config rich_live=true/false`.

  - **`threading.RLock`** — replaced `threading.Lock` with `RLock` to support re-entrant calls from brainstorm synthesis and Ollama retry paths.

- 05:39 PM, Apr 05, 2026 (**v3.05.4**): **Reasoning, Rendering, and Packaging Improvements, Enhanced Memory System, Native vision support for local Ollama models, Bracketed Paste Mode, Rich Tab Completion**

  - **Bracketed Paste Mode** — replaced the old timing-based multi-line paste detection with the standard terminal Bracketed Paste Mode protocol. Pasted text of any length (code blocks, long prompts, multi-paragraph instructions) is now collected as a single turn with zero latency and no blank-line artifacts. Falls back to a 60 ms timing window for terminals that don't support BPM. Bracketed paste mode is cleanly disabled on REPL exit.

  - **Rich Tab Completion with descriptions** — pressing Tab after `/` now shows every command with a one-line description and a hint of its subcommands. Typing `/plugin ` then Tab lists all subcommands (`install`, `uninstall`, `enable`, …). Auto-completes to the unique match when only one command matches the prefix. Subcommands supported for `/mcp`, `/plugin`, `/tasks`, `/cloudsave`, `/voice`, `/permissions`, `/proactive`, and `/memory`.

  - **Model name bug fix** — `--model ollama/qwen3.5:35b` no longer gets corrupted to `ollama/qwen3.5/35b`. The startup colon-to-slash conversion now only fires when the left side of `:` is a known provider name and no `/` is already present, preserving Ollama's `model:tag` format.

  - **Native vision support for local Ollama models** (`llava`, `gemma4`, `llama3.2-vision`): new `/image [prompt]` command captures the current clipboard image, encodes it to Base64, and attaches it to the next prompt. Install Pillow with `pip install cheetahclaws[vision]`; Linux users also need `xclip` (`sudo apt install xclip`).

  - **Enhanced Memory System** — added `confidence` / `source` / `last_used_at` / `conflict_group` metadata to every memory entry; conflict detection on `MemorySave` warns before overwriting; `MemorySearch` re-ranks results by `confidence × recency` (30-day decay) and updates `last_used_at` on hits; new `/memory consolidate` command runs a lightweight AI analysis of the current session and auto-saves up to 3 long-term insights (user preferences, feedback corrections, project decisions) at 0.8 confidence — never overwrites higher-confidence user memories.

  - **Post-merge fixes** — removed a debug `debug_payload.json` file write that was firing on every OpenAI-compatible API call (left over from PR #11 development). Also fixed ANSI dim color not being reset after the thinking block ends, which caused subsequent text to appear dim in non-Rich terminals. Bumped `pyproject.toml` version to `3.05.4`, and moved `sounddevice` to the optional `voice` extra (`pip install cheetahclaws[voice]`).

  - **Native Ollama reasoning + terminal rendering fix** — local reasoning models (`deepseek-r1`, `qwen3`, `gemma4`) now stream their `` blocks to the terminal. Ollama exposes thoughts in `msg["thinking"]`, but cheetahclaws was previously dropping them; this is now fixed by yielding `ThinkingChunk` from the Ollama adapter. Also fixed a Windows CMD/PowerShell rendering issue where token-by-token ANSI dim resets caused thoughts to print vertically, and corrected `flush_response()` so it runs once at the end instead of on every thinking token. Enable with `/verbose` and `/thinking`.

  - **uv support** — added `pyproject.toml`; install with `uv tool install .` to make the `cheetahclaws` command available globally from anywhere in an isolated environment, without manual PATH setup.

- 00:41 PM, Apr 05, 2026: **v3.05.3 add structured session history** — Structured session history: on every exit, sessions are saved to `daily/YYYY-MM-DD/` (capped at `session_daily_limit`, default 5 per day) and appended to a master `history.json` (capped at `session_history_limit`, default 100). Each session file now includes `session_id` and `saved_at` metadata. `/load` groups sessions by date with time, ID, and turn-count display; supports multi-select (`1,2,3`) to merge sessions and `H` to load the full history with token-count confirmation. Both limits are configurable via `/config`.

- 00:41 PM, Apr 05, 2026: **v3.05.3 fix session** — Structured session history: on every exit, sessions are saved to `daily/YYYY-MM-DD/` (capped at `session_daily_limit`, default 5 per day) and appended to a master `history.json` (capped at `session_history_limit`, default 100). Each session file now includes `session_id` and `saved_at` metadata. `/load` groups sessions by date with time, ID, and turn-count display; supports multi-select (`1,2,3`) to merge sessions and `H` to load the full history with token-count confirmation. Both limits are configurable via `/config`.

- 09:34 AM, Apr 05, 2026: **v3.05.3** — Added GitHub Gist cloud sync: `/cloudsave setup ` to configure, `/cloudsave` to upload the current session to a private Gist, `/cloudsave auto on` to sync automatically on `/exit`, `/cloudsave list` to browse cloud sessions, and `/cloudsave load ` to restore from the cloud. Uses stdlib `urllib` — no new dependencies. Also added version number (e.g., `v3.05.2`) in the startup banner: The startup banner now displays the current version number (v3.05.2) in green, making it easy to identify which version is running at a glance.

- 08:58 AM, Apr 05, 2026: **v3.05.2** — Introduced `/proactive [duration]` command: a background daemon thread watches for user inactivity and automatically wakes the agent up after the specified interval (e.g. `/proactive 5m`), enabling continuous monitoring loops without user intervention. `/proactive` with no args now shows current status; `/proactive off` disables it explicitly. Proactive polling state is stored in `config` (no module-level globals). Watcher exceptions are logged via `traceback` instead of silently swallowed. Also fixed duplicated output in Rich-enabled terminals by buffering text during streaming and rendering Markdown once via `rich.live.Live` — updates happen in-place for a true streaming Markdown experience. 

- 10:51 PM, Apr 04, 2026: **v3.05_fix04** — Fixed a crash on `/model` and config save commands caused by the newly introduced `_run_query_callback` being serialized to JSON; also added `SleepTimer` usage    

  guidance to the system prompt so the agent knows when to invoke background timers proactively.

- 10:28 PM, Apr 04, 2026: **v3.05_fix03** — Added a native `SleepTimer` tool that lets the agent schedule background timers and autonomously wake itself up after a delay — no user prompt required. Paired with a `threading.Lock` to prevent output collisions when background and foreground calls overlap. Also includes cross-platform fixes: Windows ANSI color support, CRLF-aware Edit tool matching, an interactive numbered menu for `/load`, native Ollama streaming via `/api/chat`, and auto-capping `max_tokens` per provider to prevent API errors. 

- 08:31 PM, Apr 04, 2026: **v3.05_fix** — Autosave + `/resume`: session is automatically saved to `mr_sessions/session_latest.json` on `/exit`, `/quit`, `Ctrl+C`, and `Ctrl+D`. Run `/resume` to restore the last session instantly, or `/resume ` to load a specific file from `mr_sessions/`, and better support for api and local Ollama models (specifically gemma4), along with Windows compatibility enhancements, session management UX improvements, and cross-platform reliability fixes for the Edit tool.

- 00:41 AM, Apr 04, 2026: **v3.05** — Voice input (`voice/` package): `sounddevice` → `arecord` → SoX recording backends, `faster-whisper` → `openai-whisper` → OpenAI API STT backends. Smart keyterm extraction from git branch + project name + recent files passed as Whisper `initial_prompt` for coding-domain accuracy. `/voice`, `/voice status`, `/voice lang ` REPL commands. Works fully offline with no API key. 29 new tests (**~11.6K** lines of Python).

- 10:29 PM, Apr 03, 2026: **v3.04** — Expanded tool coverage: `NotebookEdit` (edit Jupyter `.ipynb` cells — replace/insert/delete with full JSON round-trip) and `GetDiagnostics` (LSP-style diagnostics via pyright/mypy/flake8/tsc/shellcheck). Also fixed a pre-existing schema-index bug in `_register_builtins` by switching to name-based lookup (**~10.5K** lines of Python).

- 06:00 PM, Apr 03, 2026: **v3.03** — Task management system (`task/` package): `TaskCreate` / `TaskUpdate` / `TaskGet` / `TaskList` tools with sequential IDs, dependency edges (blocks/blocked_by), metadata, persistence to `.cheetahclaws/tasks.json`, thread-safe store, `/tasks` REPL command, 37 new tests (**~9500** lines of Python).

- 02:50 PM, Apr 03, 2026: **v3.02** — Plugin system (`plugin/` package): install/uninstall/enable/disable/update via `/plugin` CLI, recommendation engine (keyword+tag matching), multi-scope (user/project), git-based marketplace. `AskUserQuestion` tool: interactive mid-task user prompts with numbered options and free-text input (**~8500** lines of Python).

- 10:00 AM, Apr 03, 2026: **v3.01** — MCP (Model Context Protocol) support: `mcp/` package, stdio + SSE + HTTP transports, auto tool discovery, `/mcp` command, 34 new tests (**~7000** lines of Python).

- 12:20 PM, Apr 02, 2026: **v3.0** — Multi-agent packages (`multi_agent/`), memory package (`memory/`), skill package (`skill/`) with built-in skills, argument substitution, fork/inline execution, AI memory search, git worktree isolation, agent type definitions (**~5000** lines of Python), see [update](https://github.com/SafeRL-Lab/clawspring/blob/main/docs/update_readme_v3.0.md).

- 10:00 AM, Apr 02, 2026: **v2.0** — Context compression, memory, sub-agents, skills, diff view, tool plugin system (**~3400** lines of Python Code).

- 01:47 PM, Apr 01, 2026: Support VLLM inference (**~2000** lines of Python Code).

- 11:30 AM, Apr 01, 2026: Support more **closed-source** models and **open-source models**: Claude, GPT, Gemini, Kimi, Qwen, Zhipu, DeepSeek, and local open-source models via Ollama or any OpenAI-compatible endpoint. (**~1700** lines of Python Code).

- 09:50 AM, Apr 01, 2026: Support more **closed-source** models: Claude, GPT, Gemini. (**~1300** lines of Python Code).

- 08:23 AM, Apr 01, 2026: Release the initial version of CheetahClaws (**~900 lines** of Python Code).


---

# CheetahClaws

CheetahClaws: **A Lightweight** and **Easy-to-Use** Python Reimplementation of Claude Code **Supporting Any Model**, such as Claude, GPT, Gemini, Kimi, Qwen, Zhipu, DeepSeek, MiniMax, and local open-source models via Ollama or any OpenAI-compatible endpoint.

---

## Content

  * [Why CheetahClaws](#why-cheetahclaws)

  * [CheetahClaws vs OpenClaw](#cheetahclaws-vs-openclaw)

  * [Features](#features)

  * [Supported Models](#supported-models)

  * [Installation](#installation)

  * [Usage: Closed-Source API Models](#usage--closed-source-api-models)

  * [Usage: Open-Source Models (Local)](#usage--open-source-models--local-)

  * [Model Name Format](#model-name-format)

  * [CLI Reference](#cli-reference)

  * [Slash Commands (REPL)](#slash-commands--repl-)

  * [Configuring API Keys](#configuring-api-keys)

  * [Permission System](#permission-system)

  * [Built-in Tools](#built-in-tools)

  * [Memory](#memory)

  * [Skills](#skills)

  * [Sub-Agents](#sub-agents)

  * [MCP (Model Context Protocol)](#mcp-model-context-protocol)

  * [Plugin System](#plugin-system)

  * [AskUserQuestion Tool](#askuserquestion-tool)

  * [Task Management](#task-management)

  * [Voice Input](#voice-input)

  * [Brainstorm](#brainstorm)

  * [SSJ Developer Mode](#ssj-developer-mode)

  * [Telegram Bridge](#telegram-bridge)

  * [Proactive Background Monitoring](#proactive-background-monitoring)

  * [Checkpoint System](#checkpoint-system)

  * [Plan Mode](#plan-mode)

  * [Context Compression](#context-compression)

  * [Diff View](#diff-view)

  * [CLAUDE.md Support](#claudemd-support)

  * [Session Management](#session-management)

  * [Cloud Sync (GitHub Gist)](#cloud-sync-github-gist)

  * [Project Structure](#project-structure)

  * [FAQ](#faq)

## Why CheetahClaws

Claude Code is a powerful, production-grade AI coding assistant — but its source code is a compiled, 12 MB TypeScript/Node.js bundle (~1,300 files, ~283K lines). It is tightly coupled to the Anthropic API, hard to modify, and impossible to run against a local or alternative model.

**CheetahClaws** reimplements the same core loop in ~10K lines of readable Python, keeping everything you need and dropping what you don't. See here for more detailed analysis (CheetahClaws v3.03), [English version](https://github.com/SafeRL-Lab/clawspring/blob/main/docs/comparison_claude_code_vs_nano_v3.03_en.md) and [Chinese version](https://github.com/SafeRL-Lab/clawspring/blob/main/docs/comparison_claude_code_vs_nano_v3.03_cn.md)

### At a glance

| Dimension | Claude Code (TypeScript) | CheetahClaws (Python) |

|-----------|--------------------------|---------------------------|

| Language | TypeScript + React/Ink | Python 3.8+ |

| Source files | ~1,332 TS/TSX files | 51 Python files |

| Lines of code | ~283K | ~12K |

| Built-in tools | 44+ | 27 |

| Slash commands | 88 | 36 |

| Voice input | Proprietary Anthropic WebSocket (OAuth required) | Local Whisper / OpenAI API — works offline, no subscription |

| Model providers | Anthropic only | 8+ (Anthropic · OpenAI · Gemini · Kimi · Qwen · DeepSeek · MiniMax · Ollama · …) |

| Local models | No | Yes — Ollama, LM Studio, vLLM, any OpenAI-compatible endpoint |

| Build step required | Yes (Bun + esbuild) | No — run directly with `python cheetahclaws.py` (or install to use `cheetahclaws`) |

| Runtime extensibility | Closed (compile-time) | Open — `register_tool()` at runtime, Markdown skills, git plugins |

| Task dependency graph | No | Yes — `blocks` / `blocked_by` edges in `task/` package |

### Where Claude Code wins

- **UI quality** — React/Ink component tree with streaming rendering, fine-grained diff visualization, and dialog systems.

- **Tool breadth** — 44 tools including `RemoteTrigger`, `EnterWorktree`, and more UI-integrated tools.

- **Enterprise features** — MDM-managed config, team permission sync, OAuth, keychain storage, GrowthBook feature flags.

- **AI-driven memory extraction** — `extractMemories` service proactively extracts knowledge from conversations without explicit tool calls.

- **Production reliability** — single distributable `cli.js`, comprehensive test coverage, version-locked releases.

### Where CheetahClaws wins

- **Multi-provider** — switch between Claude, GPT-4o, Gemini 2.5 Pro, DeepSeek, Qwen, MiniMax, or a local Llama model with `--model` or `/model` — no recompile needed.

- **Local model support** — run entirely offline with Ollama, LM Studio, or any vLLM-hosted model.

- **Readable source** — the full agent loop is 174 lines (`agent.py`). Any Python developer can read, fork, and extend it in minutes.

- **Zero build** — `pip install -r requirements.txt` and you're running. Changes take effect immediately.

- **Dynamic extensibility** — register new tools at runtime with `register_tool(ToolDef(...))`, install skill packs from git URLs, or wire in any MCP server.

- **Task dependency graph** — `TaskCreate` / `TaskUpdate` support `blocks` / `blocked_by` edges for structured multi-step planning (not available in Claude Code).

- **Two-layer context compression** — rule-based snip + AI summarization, configurable via `preserve_last_n_turns`.

- **Notebook editing** — `NotebookEdit` directly manipulates `.ipynb` JSON (replace/insert/delete cells) with no kernel required.

- **Diagnostics without LSP server** — `GetDiagnostics` chains pyright → mypy → flake8 → py_compile for Python and tsc/shellcheck for other languages, with zero configuration.

- **Offline voice input** — `/voice` records via `sounddevice`/`arecord`/SoX, transcribes with local `faster-whisper` (no API key, no subscription), and auto-submits. Keyterms from your git branch and project files boost coding-term accuracy.

- **Cloud session sync** — `/cloudsave` backs up conversations to private GitHub Gists with zero extra dependencies; restore any past session on any machine with `/cloudsave load `.

- **SSJ Developer Mode** — `/ssj` opens a persistent power menu with 10 workflow shortcuts: Brainstorm → TODO → Worker pipeline, expert debate, code review, README generation, commit helper, and more. Stays open between actions; supports `/command` passthrough.

- **Telegram Bot Bridge** — `/telegram  ` turns cheetahclaws into a Telegram bot: receive user messages, run the model, and send back responses — all from your phone. Slash commands pass through, and a typing indicator keeps the chat feeling live.

- **Worker command** — `/worker` auto-implements pending tasks from `brainstorm_outputs/todo_list.txt`, marks each one done after completion, and supports task selection by number (e.g. `1,4,6`).

- **Force quit** — 3× Ctrl+C within 2 seconds triggers immediate `os._exit(1)`, unblocking any frozen I/O.

- **Proactive background monitoring** — `/proactive 5m` activates a sentinel daemon that wakes the agent automatically after a period of inactivity, enabling continuous monitoring loops, scheduled checks, or trading bots without user prompts.

- **Rich Live streaming rendering** — When `rich` is installed, responses stream as live-updating Markdown in place (no duplicate raw text), with clean tool-call interleaving.

- **Native Ollama reasoning** — Local reasoning models (deepseek-r1, qwen3, gemma4) stream their `` tokens directly to the terminal via `ThinkingChunk` events; enable with `/verbose` and `/thinking`.

- **Native Ollama vision** — `/image [prompt]` captures the clipboard and sends it to local vision models (llava, gemma4, llama3.2-vision) via Ollama's native image API. No cloud required.

- **Reliable multi-line paste** — Bracketed Paste Mode (`ESC[?2004h`) collects any pasted text — code blocks, multi-paragraph prompts, long diffs — as a single turn with zero latency and no blank-line artifacts.

- **Rich Tab completion** — Tab after `/` shows all commands with one-line descriptions and subcommand hints; subcommand Tab-complete works for `/mcp`, `/plugin`, `/tasks`, `/cloudsave`, and more.

- **Checkpoint & rewind** — `/checkpoint` lists all auto-snapshots of conversation + file state; `/checkpoint ` rewinds both files and history to any earlier point in the session.

- **Plan mode** — `/plan ` (or the `EnterPlanMode` tool) puts Claude into a structured read-only analysis phase; only the plan file is writable. Claude writes a detailed plan, then `/plan done` restores full write permissions for implementation.

---

## CheetahClaws vs OpenClaw

[OpenClaw](https://github.com/openclaw/openclaw) is another popular open-source AI assistant built on TypeScript/Node.js. The two projects have **different primary goals** — here is how they compare.

### At a glance

| Dimension | OpenClaw (TypeScript) | CheetahClaws (Python) |

|-----------|----------------------|---------------------|

| Language | TypeScript + Node.js | Python 3.8+ |

| Source files | ~10,349 TS/JS files | 51 Python files |

| Lines of code | ~245K | ~12K |

| Primary focus | Personal life assistant across messaging channels | AI **coding** assistant / developer tool |

| Architecture | Always-on Gateway daemon + companion apps | Zero-install terminal REPL |

| Messaging channels | 20+ (WhatsApp · Telegram · Slack · Discord · Signal · iMessage · Matrix · WeChat · …) | Terminal + optional Telegram bridge |

| Model providers | Multiple (cloud-first) | 7+ including full local support (Ollama · vLLM · LM Studio · …) |

| Local / offline models | Limited | Full — Ollama, vLLM, any OpenAI-compatible endpoint |

| Voice | Wake word · PTT · Talk Mode (macOS/iOS/Android) | Offline Whisper STT (local, no API key) |

| Code editing tools | Browser control, Canvas workspace | Read · Write · Edit · Bash · Glob · Grep · NotebookEdit · GetDiagnostics |

| Build step required | Yes (`pnpm install` + daemon setup) | No — `pip install` and run |

| Mobile companion | macOS menu bar + iOS/Android apps | — |

| Live Canvas / UI | Yes (A2UI agent-driven visual workspace) | — |

| MCP support | — | Yes (stdio/SSE/HTTP) |

| Runtime extensibility | Skills platform (bundled/managed/workspace) | `register_tool()` at runtime, MCP, git plugins, Markdown skills |

| Hackability | Large codebase (245K lines), harder to modify | ~12K lines — full agent loop visible in one file |

### Where OpenClaw wins

- **Omni-channel inbox** — connects to 20+ messaging platforms (WhatsApp, Signal, iMessage, Discord, Teams, Matrix, WeChat…); users interact from wherever they already are.

- **Always-on daemon** — Gateway runs as a background service (launchd/systemd); no terminal required for day-to-day use.

- **Mobile-first** — macOS menu bar, iOS Voice Wake / Talk Mode, Android camera/screen recording — feels like a native app, not a CLI tool.

- **Live Canvas** — agent-driven visual workspace rendered in the browser; supports A2UI push/eval/snapshot.

- **Browser automation** — dedicated Chrome/Chromium profile with snapshot, actions, and upload tools.

- **Production reliability** — versioned npm releases, comprehensive CI, onboarding wizard, `openclaw doctor` diagnostics.

### Where CheetahClaws wins

- **Coding toolset** — Read/Write/Edit/Bash/Glob/Grep/NotebookEdit/GetDiagnostics are purpose-built for software development; CheetahClaws understands diffs, file trees, and code structure.

- **True local model support** — full Ollama/vLLM/LM Studio integration with streaming, tool-calling, and vision — no cloud required.

- **8+ model providers** — switch between Claude, GPT-4o, Gemini, DeepSeek, Qwen, MiniMax, and local models with a single `--model` flag.

- **Hackable in minutes** — 12K lines of readable Python; the entire agent loop is in `agent.py`; extend with `register_tool()` at runtime without rebuilding.

- **Zero setup** — `pip install cheetahclaws` and run `cheetahclaws`; no daemon, no pairing, no onboarding wizard.

- **MCP support** — connect any MCP server (stdio/SSE/HTTP); tools auto-registered.

- **SSJ Developer Mode** — `/ssj` power menu chains Brainstorm → TODO → Worker → Debate in a persistent interactive session; automates entire dev workflows.

- **Offline voice** — `/voice` transcribes locally with `faster-whisper`; no subscription, no OAuth, works without internet.

- **Session cloud sync** — `/cloudsave` backs up full conversations to private GitHub Gists with zero extra dependencies.

### When to choose which

| If you want… | Use |

|---|---|

| A personal assistant you can message on WhatsApp/Signal/Discord | **OpenClaw** |

| An AI coding assistant in your terminal | **CheetahClaws** |

| Full offline / local model support | **CheetahClaws** |

| A mobile-friendly always-on experience | **OpenClaw** |

| To read and modify the source in an afternoon | **CheetahClaws** |

| Browser automation and a visual Canvas | **OpenClaw** |

| Multi-provider LLM switching without rebuilding | **CheetahClaws** |

---

### Key design differences

**Agent loop** — CheetahClaws uses a Python generator that `yield`s typed events (`TextChunk`, `ToolStart`, `ToolEnd`, `TurnDone`). The entire loop is visible in one file, making it easy to add hooks, custom renderers, or logging.

**Tool registration** — every tool is a `ToolDef(name, schema, func, read_only, concurrent_safe)` dataclass. Any module can call `register_tool()` at import time; MCP servers, plugins, and skills all use the same mechanism.

**Context compression**

| | Claude Code | CheetahClaws |

|-|-------------|-----------------|

| Trigger | Exact token count | `len / 3.5` estimate, fires at 70 % |

| Layer 1 | — | Snip: truncate old tool outputs (no API cost) |

| Layer 2 | AI summarization | AI summarization of older turns |

| Control | System-managed | `preserve_last_n_turns` parameter |

**Memory** — Claude Code's `extractMemories` service has the model proactively surface facts. CheetahClaws's `memory/` package is tool-driven: the model calls `MemorySave` explicitly, which is more predictable and auditable. Each memory now carries `confidence`, `source`, `last_used_at`, and `conflict_group` metadata; search re-ranks by confidence × recency; and `/memory consolidate` offers a manual consolidation pass without silently modifying memories in the background.

### Who should use CheetahClaws

- Developers who want to **use a local or non-Anthropic model** as their coding assistant.

- Researchers studying **how agentic coding assistants work** — the entire system fits in one screen.

- Teams who need a **hackable baseline** to add proprietary tools, custom permission policies, or specialised agent types.

- Anyone who wants Claude Code-style productivity **without a Node.js build chain**.

---

## Features

| Feature | Details |

|---|---|

| Multi-provider | Anthropic · OpenAI · Gemini · Kimi · Qwen · Zhipu · DeepSeek · MiniMax · Ollama · LM Studio · Custom endpoint |

| Interactive REPL | readline history, Tab-complete slash commands with descriptions + subcommand hints; Bracketed Paste Mode for reliable multi-line paste |

| Agent loop | Streaming API + automatic tool-use loop |

| 27 built-in tools | Read · Write · Edit · Bash · Glob · Grep · WebFetch · WebSearch · **NotebookEdit** · **GetDiagnostics** · MemorySave · MemoryDelete · MemorySearch · MemoryList · Agent · SendMessage · CheckAgentResult · ListAgentTasks · ListAgentTypes · Skill · SkillList · AskUserQuestion · TaskCreate/Update/Get/List · **SleepTimer** · **EnterPlanMode** · **ExitPlanMode** · *(MCP + plugin tools auto-added at startup)* |

| MCP integration | Connect any MCP server (stdio/SSE/HTTP), tools auto-registered and callable by Claude |

| Plugin system | Install/uninstall/enable/disable/update plugins from git URLs or local paths; multi-scope (user/project); recommendation engine |

| AskUserQuestion | Claude can pause and ask the user a clarifying question mid-task, with optional numbered choices |

| Task management | TaskCreate/Update/Get/List tools; sequential IDs; dependency edges; metadata; persisted to `.cheetahclaws/tasks.json`; `/tasks` REPL command |

| Diff view | Git-style red/green diff display for Edit and Write |

| Context compression | Auto-compact long conversations to stay within model limits |

| Persistent memory | Dual-scope memory (user + project) with 4 types, confidence/source metadata, conflict detection, recency-weighted search, `last_used_at` tracking, and `/memory consolidate` for auto-extraction |

| Multi-agent | Spawn typed sub-agents (coder/reviewer/researcher/…), git worktree isolation, background mode |

| Skills | Built-in `/commit` · `/review` + custom markdown skills with argument substitution and fork/inline execution |

| Plugin tools | Register custom tools via `tool_registry.py` |

| Permission system | `auto` / `accept-all` / `manual` / `plan` modes |

| Checkpoints | Auto-snapshot conversation + file state after each turn; `/checkpoint` to list, `/checkpoint ` to rewind; `/rewind` alias; 100-snapshot sliding window |

| Plan mode | `/plan ` enters read-only analysis mode; Claude writes only to the plan file; `EnterPlanMode` / `ExitPlanMode` agent tools for autonomous planning |

| 36 slash commands | `/model` · `/config` · `/save` · `/cost` · `/memory` · `/skills` · `/agents` · `/voice` · `/proactive` · `/checkpoint` · `/plan` · `/compact` · `/status` · `/doctor` · … |

| Voice input | Record → transcribe → auto-submit. Backends: `sounddevice` / `arecord` / SoX + `faster-whisper` / `openai-whisper` / OpenAI API. Works fully offline. |

| Brainstorm | `/brainstorm [topic]` generates N expert personas suited to the topic (2–100, default 5, chosen interactively), runs an iterative debate, saves results to `brainstorm_outputs/`, and synthesizes a Master Plan + auto-generates `brainstorm_outputs/todo_list.txt`. |

| SSJ Developer Mode | `/ssj` opens a persistent interactive power menu with 10 shortcuts: Brainstorm, TODO viewer, Worker, Expert Debate, Propose, Review, Readme, Commit, Scan, Promote. Stays open between actions; `/command` passthrough supported. Debate shows animated per-round spinner and saves result next to the debated file. |

| Worker | `/worker [task#s]` reads `brainstorm_outputs/todo_list.txt`, implements each pending task with a dedicated model prompt, and marks it done (`- [x]`). Supports task selection (`/worker 1,4,6`), custom path (`--path`), and worker count limit (`--workers`). Detects and redirects accidental brainstorm `.md` paths. |

| Telegram bridge | `/telegram  ` starts a bot bridge: receive messages from Telegram, run the model, and reply — all from your phone. Typing indicator, slash command passthrough, and auto-start on launch if configured. |

| Vision input | `/image [prompt]` captures the clipboard image and sends it to a local vision model (Ollama `llava`, `gemma4`, `llama3.2-vision`). Requires `pip install cheetahclaws[vision]`; Linux also needs `xclip`. |

| Proactive monitoring | `/proactive [duration]` starts a background sentinel daemon; agent wakes automatically after inactivity, enabling continuous monitoring loops without user prompts |

| Force quit | 3× Ctrl+C within 2 seconds triggers `os._exit(1)` — kills the process immediately regardless of blocking I/O |

| Rich Live streaming | When `rich` is installed, responses render as live-updating Markdown in place. Auto-disabled in SSH sessions to prevent repeated output; override with `/config rich_live=false`. |

| Context injection | Auto-loads `CLAUDE.md`, git status, cwd, persistent memory |

| Session persistence | Autosave on exit to `daily/YYYY-MM-DD/` (per-day limit) + `history.json` (master, all sessions) + `session_latest.json` (/resume); sessions include `session_id` and `saved_at` metadata; `/load` grouped by date |

| Cloud sync | `/cloudsave` syncs sessions to private GitHub Gists; auto-sync on exit; load from cloud by Gist ID. No new dependencies (stdlib `urllib`). |

| Extended Thinking | Toggle on/off for Claude models; native `` block streaming for local Ollama reasoning models (deepseek-r1, qwen3, gemma4) |

| Cost tracking | Token usage + estimated USD cost |

| Non-interactive mode | `--print` flag for scripting / CI |

---

## Supported Models

### Closed-Source (API)

| Provider | Model | Context | Strengths | API Key Env |

|---|---|---|---|---|

| **Anthropic** | `claude-opus-4-6` | 200k | Most capable, best for complex reasoning | `ANTHROPIC_API_KEY` |

| **Anthropic** | `claude-sonnet-4-6` | 200k | Balanced speed & quality | `ANTHROPIC_API_KEY` |

| **Anthropic** | `claude-haiku-4-5-20251001` | 200k | Fast, cost-efficient | `ANTHROPIC_API_KEY` |

| **OpenAI** | `gpt-4o` | 128k | Strong multimodal & coding | `OPENAI_API_KEY` |

| **OpenAI** | `gpt-4o-mini` | 128k | Fast, cheap | `OPENAI_API_KEY` |

| **OpenAI** | `o3-mini` | 200k | Strong reasoning | `OPENAI_API_KEY` |

| **OpenAI** | `o1` | 200k | Advanced reasoning | `OPENAI_API_KEY` |

| **Google** | `gemini-2.5-pro-preview-03-25` | 1M | Long context, multimodal | `GEMINI_API_KEY` |

| **Google** | `gemini-2.0-flash` | 1M | Fast, large context | `GEMINI_API_KEY` |

| **Google** | `gemini-1.5-pro` | 2M | Largest context window | `GEMINI_API_KEY` |

| **Moonshot (Kimi)** | `moonshot-v1-8k` | 8k | Chinese & English | `MOONSHOT_API_KEY` |

| **Moonshot (Kimi)** | `moonshot-v1-32k` | 32k | Chinese & English | `MOONSHOT_API_KEY` |

| **Moonshot (Kimi)** | `moonshot-v1-128k` | 128k | Long context | `MOONSHOT_API_KEY` |

| **Alibaba (Qwen)** | `qwen-max` | 32k | Best Qwen quality | `DASHSCOPE_API_KEY` |

| **Alibaba (Qwen)** | `qwen-plus` | 128k | Balanced | `DASHSCOPE_API_KEY` |

| **Alibaba (Qwen)** | `qwen-turbo` | 1M | Fast, cheap | `DASHSCOPE_API_KEY` |

| **Alibaba (Qwen)** | `qwq-32b` | 32k | Strong reasoning | `DASHSCOPE_API_KEY` |

| **Zhipu (GLM)** | `glm-4-plus` | 128k | Best GLM quality | `ZHIPU_API_KEY` |

| **Zhipu (GLM)** | `glm-4` | 128k | General purpose | `ZHIPU_API_KEY` |

| **Zhipu (GLM)** | `glm-4-flash` | 128k | Free tier available | `ZHIPU_API_KEY` |

| **DeepSeek** | `deepseek-chat` | 64k | Strong coding | `DEEPSEEK_API_KEY` |

| **DeepSeek** | `deepseek-reasoner` | 64k | Chain-of-thought reasoning | `DEEPSEEK_API_KEY` |

| **MiniMax** | `MiniMax-Text-01` | 1M | Long context, strong reasoning | `MINIMAX_API_KEY` |

| **MiniMax** | `MiniMax-VL-01` | 1M | Vision + language | `MINIMAX_API_KEY` |

| **MiniMax** | `abab6.5s-chat` | 256k | Fast, cost-efficient | `MINIMAX_API_KEY` |

| **MiniMax** | `abab6.5-chat` | 256k | Balanced quality | `MINIMAX_API_KEY` |

### Open-Source (Local via Ollama)

| Model | Size | Strengths | Pull Command |

|---|---|---|---|

| `llama3.3` | 70B | General purpose, strong reasoning | `ollama pull llama3.3` |

| `llama3.2` | 3B / 11B | Lightweight | `ollama pull llama3.2` |

| `qwen2.5-coder` | 7B / 32B | **Best for coding tasks** | `ollama pull qwen2.5-coder` |

| `qwen2.5` | 7B / 72B | Chinese & English | `ollama pull qwen2.5` |

| `deepseek-r1` | 7B–70B | Reasoning, math | `ollama pull deepseek-r1` |

| `deepseek-coder-v2` | 16B | Coding | `ollama pull deepseek-coder-v2` |

| `mistral` | 7B | Fast, efficient | `ollama pull mistral` |

| `mixtral` | 8x7B | Strong MoE model | `ollama pull mixtral` |

| `phi4` | 14B | Microsoft, strong reasoning | `ollama pull phi4` |

| `gemma3` | 4B / 12B / 27B | Google open model | `ollama pull gemma3` |

| `codellama` | 7B / 34B | Code generation | `ollama pull codellama` |

| `llava` | 7B / 13B | **Vision** — image understanding | `ollama pull llava` |

| `llama3.2-vision` | 11B | **Vision** — multimodal reasoning | `ollama pull llama3.2-vision` |

> **Note:** Tool calling requires a model that supports function calling. Recommended local models: `qwen2.5-coder`, `llama3.3`, `mistral`, `phi4`.

> **Reasoning models:** `deepseek-r1`, `qwen3`, and `gemma4` stream native `` blocks. Enable with `/verbose` and `/thinking` to see thoughts in the terminal. Note: models fed a large system prompt (like cheetahclaws's 25 tool schemas) may suppress their thinking phase to avoid breaking the expected JSON format — this is model behavior, not a bug.

---

## Installation

### Recommended: install as a global command with `uv`

[uv](https://docs.astral.sh/uv/) installs `cheetahclaws` into an isolated environment and puts it on your PATH so you can run it from anywhere:

```bash

# Install uv (if not already installed)

curl -LsSf https://astral.sh/uv/install.sh | sh

# Clone and install

git clone https://github.com/SafeRL-Lab/clawspring

cd cheetahclaws

uv tool install .

```

After that, `cheetahclaws` is available as a global command:

```bash

cheetahclaws                        # start REPL

cheetahclaws --model gpt-4o         # choose a model

cheetahclaws -p "explain this"      # non-interactive

```

To update after pulling new code:

```bash

uv tool install . --reinstall

```

To uninstall:

```bash

uv tool uninstall cheetahclaws

```

### Alternative: run directly from the repo

```bash

git clone https://github.com/SafeRL-Lab/clawspring

cd cheetahclaws

pip install -r requirements.txt

# or manually (sounddevice is optional — only needed for /voice):

pip install anthropic openai httpx rich

pip install sounddevice  # optional: voice input

python cheetahclaws.py

```

---

## Usage: Closed-Source API Models

### Anthropic Claude

Get your API key at [console.anthropic.com](https://console.anthropic.com).

```bash

export ANTHROPIC_API_KEY=sk-ant-api03-...

# Default model (claude-opus-4-6)

cheetahclaws

# Choose a specific model

cheetahclaws --model claude-sonnet-4-6

cheetahclaws --model claude-haiku-4-5-20251001

# Enable Extended Thinking

cheetahclaws --model claude-opus-4-6 --thinking --verbose

```

### OpenAI GPT

Get your API key at [platform.openai.com](https://platform.openai.com).

```bash

export OPENAI_API_KEY=sk-...

cheetahclaws --model gpt-4o

cheetahclaws --model gpt-4o-mini

cheetahclaws --model gpt-4.1-mini

cheetahclaws --model o3-mini

```

### Google Gemini

Get your API key at [aistudio.google.com](https://aistudio.google.com).

```bash

export GEMINI_API_KEY=AIza...

cheetahclaws --model gemini/gemini-2.0-flash

cheetahclaws --model gemini/gemini-1.5-pro

cheetahclaws --model gemini/gemini-2.5-pro-preview-03-25

```

### Kimi (Moonshot AI)

Get your API key at [platform.moonshot.cn](https://platform.moonshot.cn).

```bash

export MOONSHOT_API_KEY=sk-...

cheetahclaws --model kimi/moonshot-v1-32k

cheetahclaws --model kimi/moonshot-v1-128k

```

### Qwen (Alibaba DashScope)

Get your API key at [dashscope.aliyun.com](https://dashscope.aliyun.com).

```bash

export DASHSCOPE_API_KEY=sk-...

cheetahclaws --model qwen/Qwen3.5-Plus

cheetahclaws --model qwen/Qwen3-MAX

cheetahclaws --model qwen/Qwen3.5-Flash

```

### Zhipu GLM

Get your API key at [open.bigmodel.cn](https://open.bigmodel.cn).

```bash

export ZHIPU_API_KEY=...

cheetahclaws --model zhipu/glm-4-plus

cheetahclaws --model zhipu/glm-4-flash   # free tier

```

### DeepSeek

Get your API key at [platform.deepseek.com](https://platform.deepseek.com).

```bash

export DEEPSEEK_API_KEY=sk-...

cheetahclaws --model deepseek/deepseek-chat

cheetahclaws --model deepseek/deepseek-reasoner

```

### MiniMax

Get your API key at [platform.minimaxi.chat](https://platform.minimaxi.chat).

```bash

export MINIMAX_API_KEY=...

cheetahclaws --model minimax/MiniMax-Text-01

cheetahclaws --model minimax/MiniMax-VL-01

cheetahclaws --model minimax/abab6.5s-chat

```

---

## Usage: Open-Source Models (Local)

### Option A — Ollama (Recommended)

Ollama runs models locally with zero configuration. No API key required.

**Step 1: Install Ollama**

```bash

# macOS / Linux

curl -fsSL https://ollama.com/install.sh | sh

# Or download from https://ollama.com/download

```

**Step 2: Pull a model**

```bash

# Best for coding (recommended)

ollama pull qwen2.5-coder          # 4.7 GB (7B)

ollama pull qwen2.5-coder:32b      # 19 GB (32B)

# General purpose

ollama pull llama3.3               # 42 GB (70B)

ollama pull llama3.2               # 2.0 GB (3B)

# Reasoning

ollama pull deepseek-r1            # 4.7 GB (7B)

ollama pull deepseek-r1:32b        # 19 GB (32B)

# Other

ollama pull phi4                   # 9.1 GB (14B)

ollama pull mistral                # 4.1 GB (7B)

```

**Step 3: Start Ollama server** (runs automatically on macOS; on Linux run manually)

```bash

ollama serve     # starts on http://localhost:11434

```

**Step 4: Run cheetahclaws**

```bash

cheetahclaws --model ollama/qwen2.5-coder

cheetahclaws --model ollama/llama3.3

cheetahclaws --model ollama/deepseek-r1

```

Or

```bash

python cheetahclaws.py --model ollama/qwen2.5-coder

python cheetahclaws.py --model ollama/llama3.3

python cheetahclaws.py --model ollama/deepseek-r1

python cheetahclaws.py --model ollama/qwen3.5:35b

```

**List your locally available models:**

```bash

ollama list

```

Then use any model from the list:

```bash

cheetahclaws --model ollama/

```

---

### Option B — LM Studio

LM Studio provides a GUI to download and run models, with a built-in OpenAI-compatible server.

**Step 1:** Download [LM Studio](https://lmstudio.ai) and install it.

**Step 2:** Search and download a model inside LM Studio (GGUF format).

**Step 3:** Go to **Local Server** tab → click **Start Server** (default port: 1234).

**Step 4:**

```bash

cheetahclaws --model lmstudio/

# e.g.:

cheetahclaws --model lmstudio/phi-4-GGUF

cheetahclaws --model lmstudio/qwen2.5-coder-7b

```

The model name should match what LM Studio shows in the server status bar.

---

### Option C — vLLM / Self-Hosted OpenAI-Compatible Server

For self-hosted inference servers (vLLM, TGI, llama.cpp server, etc.) that expose an OpenAI-compatible API:

Quick Start for option C:

Step 1: Start vllm:

 ```

CUDA_VISIBLE_DEVICES=7 python -m vllm.entrypoints.openai.api_server \

      --model Qwen/Qwen2.5-Coder-7B-Instruct \

      --host 0.0.0.0 \

      --port 8000 \

      --enable-auto-tool-choice \

      --tool-call-parser hermes

```

 Step 2: Start cheetahclaws：

```

  export CUSTOM_BASE_URL=http://localhost:8000/v1

  export CUSTOM_API_KEY=none

  cheetahclaws --model custom/Qwen/Qwen2.5-Coder-7B-Instruct

```

```bash

# Example: vLLM serving Qwen2.5-Coder-32B

python -m vllm.entrypoints.openai.api_server \

    --model Qwen/Qwen2.5-Coder-32B-Instruct \

    --port 8000

# Then run cheetahclaws pointing to your server:

cheetahclaws

```

Inside the REPL:

```

/config custom_base_url=http://localhost:8000/v1

/config custom_api_key=token-abc123    # skip if no auth

/model custom/Qwen2.5-Coder-32B-Instruct

```

Or set via environment:

```bash

export CUSTOM_BASE_URL=http://localhost:8000/v1

export CUSTOM_API_KEY=token-abc123

cheetahclaws --model custom/Qwen2.5-Coder-32B-Instruct

```

For a remote GPU server:

```bash

/config custom_base_url=http://192.168.1.100:8000/v1

/model custom/your-model-name

```

---

## Model Name Format

Three equivalent formats are supported:

```bash

# 1. Auto-detect by prefix (works for well-known models)

cheetahclaws --model gpt-4o

cheetahclaws --model gemini-2.0-flash

cheetahclaws --model deepseek-chat

# 2. Explicit provider prefix with slash

cheetahclaws --model ollama/qwen2.5-coder

cheetahclaws --model kimi/moonshot-v1-128k

# 3. Explicit provider prefix with colon (also works)

cheetahclaws --model kimi:moonshot-v1-32k

cheetahclaws --model qwen:qwen-max

```

**Auto-detection rules:**

| Model prefix | Detected provider |

|---|---|

| `claude-` | anthropic |

| `gpt-`, `o1`, `o3` | openai |

| `gemini-` | gemini |

| `moonshot-`, `kimi-` | kimi |

| `qwen`, `qwq-` | qwen |

| `glm-` | zhipu |

| `deepseek-` | deepseek |

| `MiniMax-`, `minimax-`, `abab` | minimax |

| `llama`, `mistral`, `phi`, `gemma`, `mixtral`, `codellama` | ollama |

---

## CLI Reference

```

cheetahclaws [OPTIONS] [PROMPT]

# or: python cheetahclaws.py [OPTIONS] [PROMPT]

Options:

  -p, --print          Non-interactive: run prompt and exit

  -m, --model MODEL    Override model (e.g. gpt-4o, ollama/llama3.3)

  --accept-all         Auto-approve all operations (no permission prompts)

  --verbose            Show thinking blocks and per-turn token counts

  --thinking           Enable Extended Thinking (Claude only)

  --version            Print version and exit

  -h, --help           Show help

```

**Examples:**

```bash

# Interactive REPL with default model

cheetahclaws

# Switch model at startup

cheetahclaws --model gpt-4o

cheetahclaws -m ollama/deepseek-r1:32b

# Non-interactive / scripting

cheetahclaws --print "Write a Python fibonacci function"

cheetahclaws -p "Explain the Rust borrow checker in 3 sentences" -m gemini/gemini-2.0-flash

# CI / automation (no permission prompts)

cheetahclaws --accept-all --print "Initialize a Python project with pyproject.toml"

# Debug mode (see tokens + thinking)

cheetahclaws --thinking --verbose

```

---

## Slash Commands (REPL)

Type `/` and press **Tab** to see all commands with descriptions. Continue typing to filter, then Tab again to auto-complete. After a command name, press **Tab** again to see its subcommands (e.g. `/plugin ` → `install`, `uninstall`, `enable`, …).

| Command | Description |

|---|---|

| `/help` | Show all commands |

| `/clear` | Clear conversation history |

| `/model` | Show current model + list all available models |

| `/model ` | Switch model (takes effect immediately) |

| `/config` | Show all current config values |

| `/config key=value` | Set a config value (persisted to disk) |

| `/save` | Save session (auto-named by timestamp) |

| `/save ` | Save session to named file |

| `/load` | Interactive list grouped by date; enter number, `1,2,3` to merge, or `H` for full history |

| `/load ` | Load a saved session by filename |

| `/resume` | Restore the last auto-saved session (`mr_sessions/session_latest.json`) |

| `/resume ` | Load a specific file from `mr_sessions/` (or absolute path) |

| `/history` | Print full conversation history |

| `/context` | Show message count and token estimate |

| `/cost` | Show token usage and estimated USD cost |

| `/verbose` | Toggle verbose mode (tokens + thinking) |

| `/thinking` | Toggle Extended Thinking (Claude only) |

| `/permissions` | Show current permission mode |

| `/permissions ` | Set permission mode: `auto` / `accept-all` / `manual` |

| `/cwd` | Show current working directory |

| `/cwd ` | Change working directory |

| `/memory` | List all persistent memories |

| `/memory ` | Search memories by keyword (ranked by confidence × recency) |

| `/memory consolidate` | AI-extract up to 3 long-term insights from the current session |

| `/skills` | List available skills |

| `/agents` | Show sub-agent task status |

| `/mcp` | List configured MCP servers and their tools |

| `/mcp reload` | Reconnect all MCP servers and refresh tools |

| `/mcp reload ` | Reconnect a single MCP server |

| `/mcp add   [args]` | Add a stdio MCP server to user config |

| `/mcp remove ` | Remove a server from user config |

| `/voice` | Record voice, transcribe with Whisper, auto-submit as prompt |

| `/image [prompt]` | Capture clipboard image and send to vision model with optional prompt |

| `/voice status` | Show recording and STT backend availability |

| `/voice lang ` | Set STT language (e.g. `zh`, `en`, `ja`; `auto` to detect) |

| `/proactive` | Show current proactive polling status (ON/OFF and interval) |

| `/proactive ` | Enable background sentinel polling (e.g. `5m`, `30s`, `1h`) |

| `/proactive off` | Disable background polling |

| `/cloudsave setup ` | Configure GitHub Personal Access Token for Gist sync |

| `/cloudsave` | Upload current session to a private GitHub Gist |

| `/cloudsave push [desc]` | Upload with an optional description |

| `/cloudsave auto on\|off` | Toggle auto-upload on `/exit` |

| `/cloudsave list` | List your cheetahclaws Gists |

| `/cloudsave load ` | Download and restore a session from Gist |

| `/brainstorm` | Run a multi-persona AI brainstorm; prompts for agent count (2–100, default 5) |

| `/brainstorm ` | Focus the brainstorm on a specific topic; prompts for agent count |

| `/ssj` | Open SSJ Developer Mode — interactive power menu with 10 workflow shortcuts |

| `/worker` | Auto-implement all pending tasks from `brainstorm_outputs/todo_list.txt` |

| `/worker ` | Implement specific pending tasks by number (e.g. `/worker 1,4,6`) |

| `/worker --path ` | Use a custom todo file path instead of the default |

| `/worker --workers ` | Limit the batch to N tasks per run (e.g. `/worker --workers 3`) |

| `/telegram  ` | Configure and start the Telegram bot bridge |

| `/telegram` | Start the bridge using previously saved token + chat_id |

| `/telegram stop` | Stop the Telegram bridge |

| `/telegram status` | Show whether the bridge is running and the configured chat_id |

| `/checkpoint` | List all checkpoints (snapshots) for the current session |

| `/checkpoint ` | Rewind to checkpoint — restore files and conversation to that snapshot |

| `/checkpoint clear` | Delete all checkpoints for the current session |

| `/rewind` | Alias for `/checkpoint` |

| `/plan ` | Enter plan mode: read-only analysis, writes only to the plan file |

| `/plan` | Show current plan file contents |

| `/plan done` | Exit plan mode and restore original permissions |

| `/plan status` | Show whether plan mode is active |

| `/compact` | Manually compact the conversation (same as auto-compact but user-triggered) |

| `/compact ` | Compact with focus instructions (e.g. `/compact keep the auth refactor context`) |

| `/init` | Create a `CLAUDE.md` template in the current working directory |

| `/export` | Export the conversation as a Markdown file to `.nano_claude/exports/` |

| `/export ` | Export as Markdown or JSON (detected by `.json` extension) |

| `/copy` | Copy the last assistant response to the clipboard |

| `/status` | Show version, model, provider, permissions, session ID, token usage, and context % |

| `/doctor` | Diagnose installation health: Python, git, API key, optional deps, CLAUDE.md, checkpoint disk usage |

| `/exit` / `/quit` | Exit |


**Switching models inside a session:**

```

[myproject] ❯ /model

  Current model: claude-opus-4-6  (provider: anthropic)

  Available models by provider:

    anthropic     claude-opus-4-6, claude-sonnet-4-6, ...

    openai        gpt-4o, gpt-4o-mini, o3-mini, ...

    ollama        llama3.3, llama3.2, phi4, mistral, ...

    ...

[myproject] ❯ /model gpt-4o

  Model set to gpt-4o  (provider: openai)

[myproject] ❯ /model ollama/qwen2.5-coder

  Model set to ollama/qwen2.5-coder  (provider: ollama)

```

---

## Configuring API Keys

### Method 1: Environment Variables (recommended)

```bash

# Add to ~/.bashrc or ~/.zshrc

export ANTHROPIC_API_KEY=sk-ant-...

export OPENAI_API_KEY=sk-...

export GEMINI_API_KEY=AIza...

export MOONSHOT_API_KEY=sk-...       # Kimi

export DASHSCOPE_API_KEY=sk-...      # Qwen

export ZHIPU_API_KEY=...             # Zhipu GLM

export DEEPSEEK_API_KEY=sk-...       # DeepSeek

export MINIMAX_API_KEY=...           # MiniMax

```

### Method 2: Set Inside the REPL (persisted)

```

/config anthropic_api_key=sk-ant-...

/config openai_api_key=sk-...

/config gemini_api_key=AIza...

/config kimi_api_key=sk-...

/config qwen_api_key=sk-...

/config zhipu_api_key=...

/config deepseek_api_key=sk-...

/config minimax_api_key=...

```

Keys are saved to `~/.cheetahclaws/config.json` and loaded automatically on next launch.

### Method 3: Edit the Config File Directly

```json

// ~/.cheetahclaws/config.json

{

  "model": "qwen/qwen-max",

  "max_tokens": 8192,

  "permission_mode": "auto",

  "verbose": false,

  "thinking": false,

  "qwen_api_key": "sk-...",

  "kimi_api_key": "sk-...",

  "deepseek_api_key": "sk-...",

  "minimax_api_key": "..."

}

```

---

## Permission System

| Mode | Behavior |

|---|---|

| `auto` (default) | Read-only operations always allowed. Prompts before Bash commands and file writes. |

| `accept-all` | Never prompts. All operations proceed automatically. |

| `manual` | Prompts before every single operation, including reads. |

| `plan` | Read-only analysis mode. Only the plan file (`.nano_claude/plans/`) is writable. Entered via `/plan ` or the `EnterPlanMode` tool. |

**When prompted:**

```

  Allow: Run: git commit -am "fix bug"  [y/N/a(ccept-all)]

```

- `y` — approve this one action

- `n` or Enter — deny

- `a` — approve and switch to `accept-all` for the rest of the session

**Commands always auto-approved in `auto` mode:**

`ls`, `cat`, `head`, `tail`, `wc`, `pwd`, `echo`, `git status`, `git log`, `git diff`, `git show`, `find`, `grep`, `rg`, `python`, `node`, `pip show`, `npm list`, and other read-only shell commands.

---

## Built-in Tools

### Core Tools

| Tool | Description | Key Parameters |

|---|---|---|

| `Read` | Read file with line numbers | `file_path`, `limit`, `offset` |

| `Write` | Create or overwrite file (shows diff) | `file_path`, `content` |

| `Edit` | Exact string replacement (shows diff) | `file_path`, `old_string`, `new_string`, `replace_all` |

| `Bash` | Execute shell command | `command`, `timeout` (default 30s) |

| `Glob` | Find files by glob pattern | `pattern` (e.g. `**/*.py`), `path` |

| `Grep` | Regex search in files (uses ripgrep if available) | `pattern`, `path`, `glob`, `output_mode` |

| `WebFetch` | Fetch and extract text from URL | `url`, `prompt` |

| `WebSearch` | Search the web via DuckDuckGo | `query` |

### Notebook & Diagnostics Tools

| Tool | Description | Key Parameters |

|---|---|---|

| `NotebookEdit` | Edit a Jupyter notebook (`.ipynb`) cell | `notebook_path`, `new_source`, `cell_id`, `cell_type`, `edit_mode` (`replace`/`insert`/`delete`) |

| `GetDiagnostics` | Get LSP-style diagnostics for a source file (pyright/mypy/flake8 for Python; tsc/eslint for JS/TS; shellcheck for shell) | `file_path`, `language` (optional override) |

### Memory Tools

| Tool | Description | Key Parameters |

|---|---|---|

| `MemorySave` | Save or update a persistent memory | `name`, `type`, `description`, `content`, `scope` |

| `MemoryDelete` | Delete a memory by name | `name`, `scope` |

| `MemorySearch` | Search memories by keyword (or AI ranking) | `query`, `scope`, `use_ai`, `max_results` |

| `MemoryList` | List all memories with age and metadata | `scope` |

### Sub-Agent Tools

| Tool | Description | Key Parameters |

|---|---|---|

| `Agent` | Spawn a sub-agent for a task | `prompt`, `subagent_type`, `isolation`, `name`, `model`, `wait` |

| `SendMessage` | Send a message to a named background agent | `name`, `message` |

| `CheckAgentResult` | Check status/result of a background agent | `task_id` |

| `ListAgentTasks` | List all active and finished agent tasks | — |

| `ListAgentTypes` | List available agent type definitions | — |

### Background & Autonomy Tools

| Tool | Description | Key Parameters |

|---|---|---|

| `SleepTimer` | Schedule a silent background timer; injects an automated wake-up prompt when it fires so the agent can resume monitoring or deferred tasks | `seconds` |

### Skill Tools

| Tool | Description | Key Parameters |

|---|---|---|

| `Skill` | Invoke a skill by name from within the conversation | `name`, `args` |

| `SkillList` | List all available skills with triggers and metadata | — |

### MCP Tools

MCP tools are discovered automatically from configured servers and registered under the name `mcp____`. Claude can use them exactly like built-in tools.

| Example tool name | Where it comes from |

|---|---|

| `mcp__git__git_status` | `git` server, `git_status` tool |

| `mcp__filesystem__read_file` | `filesystem` server, `read_file` tool |

| `mcp__myserver__my_action` | custom server you configured |

> **Adding custom tools:** See [Architecture Guide](docs/architecture.md#tool-registry) for how to register your own tools.

---

## Memory

The model can remember things across conversations using the built-in memory system.

### Storage

Memories are stored as individual markdown files in two scopes:

| Scope | Path | Visibility |

|---|---|---|

| **User** (default) | `~/.cheetahclaws/memory/` | Shared across all projects |

| **Project** | `.cheetahclaws/memory/` in cwd | Local to the current repo |

A `MEMORY.md` index (≤ 200 lines / 25 KB) is auto-rebuilt on every save or delete and injected into the system prompt so the model always has an overview of what's been remembered.

### Memory types

| Type | Use for |

|---|---|

| `user` | Your role, preferences, background |

| `feedback` | How you want the model to behave (corrections AND confirmations) |

| `project` | Ongoing work, deadlines, decisions not in git history |

| `reference` | Links to external systems (Linear, Grafana, Slack, etc.) |

### Memory file format

Each memory is a markdown file with YAML frontmatter:

```markdown

---

name: coding_style

description: Python formatting preferences

type: feedback

created: 2026-04-02

confidence: 0.95

source: user

last_used_at: 2026-04-05

conflict_group: coding_style

---

Prefer 4-space indentation and full type hints in all Python code.

**Why:** user explicitly stated this preference.

**How to apply:** apply to every Python file written or edited.

```

**Metadata fields** (new — auto-managed):

| Field | Default | Description |

|---|---|---|

| `confidence` | `1.0` | Reliability score 0–1. Explicit user statements = 1.0; inferred preferences ≈ 0.8; auto-consolidated ≈ 0.8 |

| `source` | `user` | Origin: `user` / `model` / `tool` / `consolidator` |

| `last_used_at` | — | Updated automatically each time this memory is returned by MemorySearch |

| `conflict_group` | — | Groups related memories (e.g. `writing_style`) for conflict tracking |

### Conflict detection

When `MemorySave` is called with a name that already exists but different content, the system reports the conflict before overwriting:

```

Memory saved: 'writing_style' [feedback/user]

⚠ Replaced conflicting memory (was user-sourced, 100% confidence, written 2026-04-01).

  Old content: Prefer formal, academic style...

```

### Ranked retrieval

`MemorySearch` ranks results by **confidence × recency** (30-day exponential decay) rather than plain keyword order. Memories that haven't been used recently fade in priority. Each search hit also updates `last_used_at` so frequently-accessed memories stay prominent.

```

You: /memory python

  [feedback/user] coding_style [conf:95% src:user]

    Python formatting preferences

    Prefer 4-space indentation and full type hints...

```

### `/memory consolidate` — auto-extract long-term insights

After a meaningful session, run:

```

[myproject] ❯ /memory consolidate

  Analyzing session for long-term memories…

  ✓ Consolidated 2 memory/memories: user_prefers_direct_answers, avoid_trailing_summaries

```

The command sends a condensed session transcript to the model and asks it to identify up to **3** insights worth keeping long-term (user preferences, feedback corrections, project decisions). Extracted memories are saved with `confidence: 0.80` and `source: consolidator` — they **never overwrite** an existing memory that already has higher confidence.

Good times to run `/memory consolidate`:

- After correcting the model's behavior several times in a row

- After a session where you shared project background or decisions

- After completing a task with clear planning choices

### Example interaction

```

You: Remember that I prefer 4-space indentation and type hints.

AI: [calls MemorySave] Memory saved: 'coding_style' [feedback/user]

You: /memory

  1 memory/memories:

  [feedback  |user   ] coding_style.md

    Python formatting preferences

You: /memory python

  Found 1 relevant memory for 'python':

  [feedback/user] coding_style

    Prefer 4-space indentation and full type hints in all Python code.

You: /memory consolidate

  ✓ Consolidated 1 memory: user_prefers_verbose_commit_messages

```

**Staleness warnings:** Memories older than 1 day show a `⚠ stale` caveat — claims about file:line citations or code state may be outdated; verify before acting.

**AI-ranked search:** `MemorySearch(query="...", use_ai=true)` uses the model to rank candidates by relevance before applying the confidence × recency re-ranking.

---

## Skills

Skills are reusable prompt templates that give the model specialized capabilities. Two built-in skills ship out of the box — no setup required.

**Built-in skills:**

| Trigger | Description |

|---|---|

| `/commit` | Review staged changes and create a well-structured git commit |

| `/review [PR]` | Review code or PR diff with structured feedback |

**Quick start — custom skill:**

```bash

mkdir -p ~/.cheetahclaws/skills

```

Create `~/.cheetahclaws/skills/deploy.md`:

```markdown

---

name: deploy

description: Deploy to an environment

triggers: [/deploy]

allowed-tools: [Bash, Read]

when_to_use: Use when the user wants to deploy a version to an environment.

argument-hint: [env] [version]

arguments: [env, version]

context: inline

---

Deploy $VERSION to the $ENV environment.

Full args: $ARGUMENTS

```

Now use it:

```

You: /deploy staging 2.1.0

AI: [deploys version 2.1.0 to staging]

```

**Argument substitution:**

- `$ARGUMENTS` — the full raw argument string

- `$ARG_NAME` — positional substitution by named argument (first word → first name)

- Missing args become empty strings

**Execution modes:**

- `context: inline` (default) — runs inside current conversation history

- `context: fork` — runs as an isolated sub-agent with fresh history; supports `model` override

**Priority** (highest wins): project-level > user-level > built-in

**List skills:** `/skills` — shows triggers, argument hint, source, and `when_to_use`

**Skill search paths:**

```

./.cheetahclaws/skills/     # project-level (overrides user-level)

~/.cheetahclaws/skills/     # user-level

```

---

## Sub-Agents

The model can spawn independent sub-agents to handle tasks in parallel.

**Specialized agent types** — built-in:

| Type | Optimized for |

|---|---|

| `general-purpose` | Research, exploration, multi-step tasks |

| `coder` | Writing, reading, and modifying code |

| `reviewer` | Security, correctness, and code quality analysis |

| `researcher` | Web search and documentation lookup |

| `tester` | Writing and running tests |

**Basic usage:**

```

You: Search this codebase for all TODO comments and summarize them.

AI: [calls Agent(prompt="...", subagent_type="researcher")]

    Sub-agent reads files, greps for TODOs...

    Result: Found 12 TODOs across 5 files...

```

**Background mode** — spawn without waiting, collect result later:

```

AI: [calls Agent(prompt="run all tests", name="test-runner", wait=false)]

AI: [continues other work...]

AI: [calls CheckAgentResult / SendMessage to follow up]

```

**Git worktree isolation** — agents work on an isolated branch with no conflicts:

```

Agent(prompt="refactor auth module", isolation="worktree")

```

The worktree is auto-cleaned up if no changes were made; otherwise the branch name is reported.

**Custom agent types** — create `~/.cheetahclaws/agents/myagent.md`:

```markdown

---

name: myagent

description: Specialized for X

model: claude-haiku-4-5-20251001

tools: [Read, Grep, Bash]

---

Extra system prompt for this agent type.

```

**List running agents:** `/agents`

Sub-agents have independent conversation history, share the file system, and are limited to 3 levels of nesting.

---

## MCP (Model Context Protocol)

MCP lets you connect any external tool server — local subprocess or remote HTTP — and Claude can use its tools automatically. This is the same protocol Claude Code uses to extend its capabilities.

### Supported transports

| Transport | Config `type` | Description |

|---|---|---|

| **stdio** | `"stdio"` | Spawn a local subprocess (most common) |

| **SSE** | `"sse"` | HTTP Server-Sent Events stream |

| **HTTP** | `"http"` | Streamable HTTP POST (newer servers) |

### Configuration

Place a `.mcp.json` file in your project directory **or** edit `~/.cheetahclaws/mcp.json` for user-wide servers.

```json

{

  "mcpServers": {

    "git": {

      "type": "stdio",

      "command": "uvx",

      "args": ["mcp-server-git"]

    },

    "filesystem": {

      "type": "stdio",

      "command": "uvx",

      "args": ["mcp-server-filesystem", "/tmp"]

    },

    "my-remote": {

      "type": "sse",

      "url": "http://localhost:8080/sse",

      "headers": {"Authorization": "Bearer my-token"}

    }

  }

}

```

Config priority: `.mcp.json` (project) overrides `~/.cheetahclaws/mcp.json` (user) by server name.

### Quick start

```bash

# Install a popular MCP server

pip install uv        # uv includes uvx

uvx mcp-server-git --help   # verify it works

# Add to user config via REPL

/mcp add git uvx mcp-server-git

# Or create .mcp.json in your project dir, then:

/mcp reload

```

### REPL commands

```

/mcp                          # list servers + their tools + connection status

/mcp reload                   # reconnect all servers, refresh tool list

/mcp reload git               # reconnect a single server

/mcp add myserver uvx mcp-server-x   # add stdio server

/mcp remove myserver          # remove from user config

```

### How Claude uses MCP tools

Once connected, Claude can call MCP tools directly:

```

You: What files changed in the last git commit?

AI: [calls mcp__git__git_diff_staged()]

    → shows diff output from the git MCP server

```

Tool names follow the pattern `mcp____`. All characters

that are not alphanumeric or `_` are automatically replaced with `_`.

### Popular MCP servers

| Server | Install | Provides |

|---|---|---|

| `mcp-server-git` | `uvx mcp-server-git` | git operations (status, diff, log, commit) |

| `mcp-server-filesystem` | `uvx mcp-server-filesystem ` | file read/write/list |

| `mcp-server-fetch` | `uvx mcp-server-fetch` | HTTP fetch tool |

| `mcp-server-postgres` | `uvx mcp-server-postgres ` | PostgreSQL queries |

| `mcp-server-sqlite` | `uvx mcp-server-sqlite --db-path x.db` | SQLite queries |

| `mcp-server-brave-search` | `uvx mcp-server-brave-search` | Brave web search |

> Browse the full registry at [modelcontextprotocol.io/servers](https://modelcontextprotocol.io/servers)

---

## Plugin System

The `plugin/` package lets you extend cheetahclaws with additional tools, skills, and MCP servers from git repositories or local directories.

### Install a plugin

```bash

/plugin install my-plugin@https://github.com/user/my-plugin

/plugin install local-plugin@/path/to/local/plugin

```

### Manage plugins

```bash

/plugin                   # list installed plugins

/plugin enable my-plugin  # enable a disabled plugin

/plugin disable my-plugin # disable without uninstalling

/plugin disable-all       # disable all plugins

/plugin update my-plugin  # pull latest from git

/plugin uninstall my-plugin

/plugin info my-plugin    # show manifest details

```

### Plugin recommendation engine

```bash

/plugin recommend                    # auto-detect from project files

/plugin recommend "docker database"  # recommend by keyword context

```

The engine matches your context against a curated marketplace (git-tools, python-linter, docker-tools, sql-tools, test-runner, diagram-tools, aws-tools, web-scraper) using tag and keyword scoring.

### Plugin manifest (plugin.json)

```json

{

  "name": "my-plugin",

  "version": "0.1.0",

  "description": "Does something useful",

  "author": "you",

  "tags": ["git", "python"],

  "tools": ["tools"],        // Python module(s) that export TOOL_DEFS

  "skills": ["skills/my.md"],

  "mcp_servers": {},

  "dependencies": ["httpx"]  // pip packages

}

```

Alternatively use YAML frontmatter in `PLUGIN.md`.

### Scopes

| Scope | Location | Config |

|-------|----------|--------|

| user (default) | `~/.cheetahclaws/plugins/` | `~/.cheetahclaws/plugins.json` |

| project | `.cheetahclaws/plugins/` | `.cheetahclaws/plugins.json` |

Use `--project` flag: `/plugin install name@url --project`

---

## AskUserQuestion Tool

Claude can pause mid-task and interactively ask you a question before proceeding.

**Example invocation by Claude:**

```json

{

  "tool": "AskUserQuestion",

  "question": "Which database should I use?",

  "options": [

    {"label": "SQLite", "description": "Simple, file-based"},

    {"label": "PostgreSQL", "description": "Full-featured, requires server"}

  ],

  "allow_freetext": true

}

```

**What you see in the terminal:**

```

❓ Question from assistant:

   Which database should I use?

  [1] SQLite — Simple, file-based

  [2] PostgreSQL — Full-featured, requires server

  [0] Type a custom answer

Your choice (number or text):

```

- Select by number or type free text directly

- Claude receives your answer and continues the task

- 5-minute timeout (returns "(no answer — timeout)" if unanswered)

---

## Task Management

The `task/` package gives Claude (and you) a structured task list for tracking multi-step work within a session.

### Tools available to Claude

| Tool | Parameters | What it does |

|------|-----------|--------------|

| `TaskCreate` | `subject`, `description`, `active_form?`, `metadata?` | Create a task; returns `#id created: subject` |

| `TaskUpdate` | `task_id`, `subject?`, `description?`, `status?`, `owner?`, `add_blocks?`, `add_blocked_by?`, `metadata?` | Update any field; `status='deleted'` removes the task |

| `TaskGet` | `task_id` | Return full details of one task |

| `TaskList` | _(none)_ | List all tasks with status icons and pending blockers |

**Valid statuses:** `pending` → `in_progress` → `completed` / `cancelled` / `deleted`

### Dependency edges

```

TaskUpdate(task_id="3", add_blocked_by=["1","2"])

# Task 3 is now blocked by tasks 1 and 2.

# Reverse edges are set automatically: tasks 1 and 2 get task 3 in their "blocks" list.

```

Completed tasks are treated as resolved — `TaskList` hides their blocking effect on dependents.

### Persistence

Tasks are saved to `.cheetahclaws/tasks.json` in the current working directory after every mutation and reloaded on first access.

### REPL commands

```

/tasks                    list all tasks

/tasks create    quick-create a task

/tasks start          mark in_progress

/tasks done           mark completed

/tasks cancel         mark cancelled

/tasks delete         remove a task

/tasks get            show full details

/tasks clear              delete all tasks

```

### Typical Claude workflow

```

User: implement the login feature

Claude:

  TaskCreate(subject="Design auth schema", description="JWT vs session")  → #1

  TaskCreate(subject="Write login endpoint", description="POST /auth/login") → #2

  TaskCreate(subject="Write tests", description="Unit + integration") → #3

  TaskUpdate(task_id="2", add_blocked_by=["1"])

  TaskUpdate(task_id="3", add_blocked_by=["2"])

  TaskUpdate(task_id="1", status="in_progress", active_form="Designing schema")

  ... (does the work) ...

  TaskUpdate(task_id="1", status="completed")

  TaskList()  → task 2 is now unblocked

  ...

```

---

## Voice Input

CheetahClaws v3.05 adds a fully offline voice-to-prompt pipeline. Speak your request — it is transcribed and submitted as if you had typed it.

### Quick start

```bash

# 1. Install a recording backend (choose one)

pip install sounddevice        # recommended: cross-platform, no extra binary

# sudo apt install alsa-utils  # Linux arecord fallback

# sudo apt install sox         # SoX rec fallback

# 2. Install a local STT backend (recommended — works offline, no API key)

pip install faster-whisper numpy

# 3. Start CheetahClaws and speak

cheetahclaws

[myproject] ❯ /voice

  🎙  Listening… (speak now, auto-stops on silence, Ctrl+C to cancel)

  🎙  ████

✓  Transcribed: "fix the authentication bug in user.py"

[auto-submitting…]

```

### STT backends (tried in order)

| Backend | Install | Notes |

|---|---|---|

| `faster-whisper` | `pip install faster-whisper` | **Recommended** — local, offline, fastest, GPU optional |

| `openai-whisper` | `pip install openai-whisper` | Local, offline, original OpenAI model |

| OpenAI Whisper API | set `OPENAI_API_KEY` | Cloud, requires internet + API key |

Override the Whisper model size with `NANO_CLAUDE_WHISPER_MODEL` (default: `base`):

```bash

export NANO_CLAUDE_WHISPER_MODEL=small   # better accuracy, slower

export NANO_CLAUDE_WHISPER_MODEL=tiny    # fastest, lightest

```

### Recording backends (tried in order)

| Backend | Install | Notes |

|---|---|---|

| `sounddevice` | `pip install sounddevice` | **Recommended** — cross-platform, Python-native |

| `arecord` | `sudo apt install alsa-utils` | Linux ALSA, no pip needed |

| `sox rec` | `sudo apt install sox` / `brew install sox` | Built-in silence detection |

### Keyterm boosting

Before each recording, CheetahClaws extracts coding vocabulary from:

- **Git branch** (e.g. `feat/voice-input` → "feat", "voice", "input")

- **Project root name** (e.g. "cheetahclaws")

- **Recent source file stems** (e.g. `authentication_handler.py` → "authentication", "handler")

- **Global coding terms**: `MCP`, `grep`, `TypeScript`, `OAuth`, `regex`, `gRPC`, …

These are passed as Whisper's `initial_prompt` so the STT engine prefers correct spellings of coding terms.

### Commands

| Command | Description |

|---|---|

| `/voice` | Record voice and auto-submit the transcript as your next prompt |

| `/voice status` | Show which recording and STT backends are available |

| `/voice lang ` | Set transcription language (`en`, `zh`, `ja`, `de`, `fr`, … default: `auto`) |


### How it compares to Claude Code

| | Claude Code | CheetahClaws v3.05 |

|---|---|---|

| STT service | Anthropic private WebSocket (`voice_stream`) | `faster-whisper` / `openai-whisper` / OpenAI API |

| Requires Anthropic OAuth | Yes | **No** |

| Works offline | No | **Yes** (with local Whisper) |

| Keyterm hints | Deepgram `keyterms` param | Whisper `initial_prompt` (git + files + vocab) |

| Language support | Server-allowlisted codes | Any language Whisper supports |

---

## Brainstorm

`/brainstorm` runs a structured multi-persona AI debate over your project, then synthesizes all perspectives into an actionable plan.

### How it works

1. **Context snapshot** — reads `README.md`, `CLAUDE.md`, and root file listing from the current working directory.

2. **Agent count** — you are prompted to choose how many agents (2–100, default 5). Press Enter to use the default.

3. **Dynamic persona generation** — the model generates N expert roles tailored to your topic. Software topics get architects and engineers; geopolitics gets analysts, diplomats, and economists; business gets strategists and market experts. Falls back to built-in tech personas if generation fails.

4. **Agents debate sequentially**, each building on the previous responses.

5. **Output saved** to `brainstorm_outputs/brainstorm_YYYYMMDD_HHMMSS.md` in the current directory.

6. **Synthesis** — the main agent reads the saved file and produces a prioritized Master Plan.

**Example personas by topic:**

| Topic | Example Generated Personas |

|---|---|

| Software architecture | 🏗️ Architect · 💡 Product Innovator · 🛡️ Security Engineer · 🔧 Code Quality Lead · ⚡ Performance Specialist |

| US-Iran geopolitics | 🌍 Geopolitical Analyst · ⚖️ International Law Expert · 💰 Energy Economist · 🎖️ Military Strategist · 🕊️ Conflict Mediator |

| Business strategy | 📈 Market Strategist · 💼 Operations Lead · 🔍 Competitive Intelligence · 💡 Innovation Director · 📊 Financial Analyst |

### Usage

```

[myproject] ❯ /brainstorm

  How many agents? (2-100, default 5) > 5

[myproject] ❯ /brainstorm improve plugin architecture

  How many agents? (2-100, default 5) > 3

[myproject] ❯ /brainstorm US-Iran geopolitics

  How many agents? (2-100, default 5) > 7

```

### Example output

```

[myproject] ❯ /brainstorm medical research funding

  How many agents? (2-100, default 5) > 3

Generating 3 topic-appropriate expert personas...

Starting 3-Agent Brainstorming Session on: medical research funding

Generating diverse perspectives...

🩺 Clinical Trials Director is thinking...

  └─ Perspective captured.

⚖️ Medical Ethics Committee Member is thinking...

  └─ Perspective captured.

💰 Health Economics Policy Analyst is thinking...

  └─ Perspective captured.

✓  Brainstorming complete! Results saved to brainstorm_outputs/brainstorm_20260405_224117.md

   ── Analysis from Main Agent ──

[synthesized Master Plan streams here…]

```

### Notes

- Brainstorm uses the **currently selected model** (`/model` to check). A capable model (Claude Sonnet/Opus, GPT-4o, or a large local model) gives the best results.

- With many agents (20+) the session can take several minutes depending on model speed.

- Install `faker` (`pip install faker`) for randomized persona names; falls back to built-in names otherwise.

- Output files accumulate in `brainstorm_outputs/` — already added to `.gitignore` by v3.05.5.

- If output looks garbled in SSH (repeated lines), run `/config rich_live=false` to disable Rich Live streaming.

---

## SSJ Developer Mode

`/ssj` opens a persistent interactive power menu — a single entry point for the most common development workflows, so you never have to remember command names.







### Menu options

| # | Name | What it does |

|---|------|--------------|

| 1 | 💡 Brainstorm | Multi-persona AI debate → Master Plan → auto-generates `brainstorm_outputs/todo_list.txt` |

| 2 | 📋 Show TODO | View `brainstorm_outputs/todo_list.txt` with ✓/○ indicators and pending task numbers |

| 3 | 👷 Worker | Auto-implement pending tasks (all, or select by number) |

| 4 | 🧠 Debate | Pick a file and choose agent count — expert panel debates design round-by-round; result saved next to the file |

| 5 | ✨ Propose | Pick a file — AI proposes specific improvements with code |

| 6 | 🔎 Review | Pick a file — structured code review with 1–10 ratings per dimension |

| 7 | 📘 Readme | Pick a file — auto-generate a professional README for it |

| 8 | 💬 Commit | Analyse git diff and suggest a conventional commit message |

| 9 | 🧪 Scan | Summarise all staged/unstaged changes and suggest next steps |

| 10 | 📝 Promote | Read the latest brainstorm output → convert ideas to `todo_list.txt` tasks |

| 0 | 🚪 Exit | Return to the main REPL |

### Usage

```

[myproject] ❯ /ssj

╭─ SSJ Developer Mode ⚡ ─────────────────────────

│

│   1.  💡  Brainstorm — Multi-persona AI debate

│   2.  📋  Show TODO  — View todo_list.txt

│   3.  👷  Worker     — Auto-implement pending tasks

│   4.  🧠  Debate     — Expert debate on a file

│   5.  ✨  Propose    — AI improvement for a file

│   6.  🔎  Review     — Quick file analysis

│   7.  📘  Readme     — Auto-generate README.md

│   8.  💬  Commit     — AI-suggested commit message

│   9.  🧪  Scan       — Analyze git diff

│  10.  📝  Promote    — Idea to tasks

│   0.  🚪  Exit SSJ Mode

│

╰──────────────────────────────────────────────

  ⚡ SSJ » 1

  Topic (Enter for general): cheetahclaws plugin system

  # → Brainstorm spins up, saves to brainstorm_outputs/, generates todo_list.txt

  # → Menu re-opens automatically after each action

  ⚡ SSJ » 2

  # → Shows numbered pending tasks from brainstorm_outputs/todo_list.txt

  ⚡ SSJ » 3

  Task # (Enter for all, or e.g. 1,4,6): 2

  # → Worker implements task #2 and marks it done

```

### Slash command passthrough

Any `/command` typed at the `⚡ SSJ »` prompt is passed through to the REPL:

```

  ⚡ SSJ » /model gpt-4o

  # → switches model, then re-opens SSJ menu

  ⚡ SSJ » /exit

  # → exits cheetahclaws immediately

```

### Worker command

`/worker` (also accessible as SSJ option 3) reads `brainstorm_outputs/todo_list.txt` and auto-implements each pending task:

```

[myproject] ❯ /worker

  ✓ Worker starting — 3 task(s) to implement

    1. ○ Add animated brainstorm spinner

    2. ○ Implement Telegram typing indicator

    3. ○ Write SSJ demo GIF for README

  ── Worker (1/3): Add animated brainstorm spinner ──

  [model reads code, implements the change, marks task done]

[myproject] ❯ /worker 2,3

  # Implement only tasks 2 and 3

[myproject] ❯ /worker --path docs/tasks.md

  # Use a custom todo file

[myproject] ❯ /worker --workers 2

  # Process only the first 2 pending tasks this run

```

**Smart path detection** — if you pass a brainstorm output file (`.md`) by mistake, Worker detects it and offers to redirect to the matching `todo_list.txt` in the same folder. If that file does not yet exist, it offers to generate `todo_list.txt` from the brainstorm output first (SSJ Promote), then run Worker automatically.

### Debate command

SSJ option 4 runs a structured multi-round expert debate on any file:

```

  ⚡ SSJ » 4

  Files in brainstorm_outputs/:

    1. brainstorm_20260406_143022.md

    2. cheetahclaws.py

  File to debate #: 2

  Number of debate agents (Enter for 2): 3

  ℹ Debate result will be saved to: cheetahclaws_debate_143055.md

⚔️  Assembling expert panel...

  Expert 1: 🏗️ Architecture Lead — focus: system design & modularity

  Expert 2: 🔐 Security Engineer — focus: attack surface & input validation

  Expert 3: ⚡ Performance Specialist — focus: latency & memory usage

⚔️  Round 1/5 — Expert 1 thinking...

  [Architecture Lead gives opening argument...]

💬  Round 1/5 — Expert 2 formulating...

  [Security Engineer responds...]

  ...

📜  Drafting final consensus...

  [model writes consensus + saves transcript]

✓ Debate complete. Saved to cheetahclaws_debate_143055.md

```

- Agent count is configurable (minimum 2, default 2). Rounds are set to `agents × 2 − 1` for a full open-close structure.

- An animated spinner shows the current round and expert (`⚔️ Round 2/3 — Expert 1 thinking...`), stopping the moment that expert starts outputting.

- The full debate transcript and ranked consensus are saved to `_debate_HHMMSS.md` **in the same directory as the debated file**.

---

## Telegram Bridge

`/telegram` turns cheetahclaws into a Telegram bot — receive messages from your phone, run the model with full tool access, and reply automatically.







### Setup (one-time)

1. Open [@BotFather](https://t.me/BotFather) in Telegram → `/newbot` → copy the token.

2. Send any message to your new bot, then open `https://api.telegram.org/bot/getUpdates` and note your `chat.id`.

3. Configure cheetahclaws:

```

[myproject] ❯ /telegram  

  ✓ Telegram config saved.

  ✓ Connected to @your_bot_name. Starting bridge...

  ✓ Telegram bridge active. Chat ID: 123456789

  ℹ Send messages to your bot — they'll be processed here.

  ℹ Stop with /telegram stop or send /stop in Telegram.

```

Token and chat_id are saved to `~/.cheetahclaws/config.json`. On next launch the bridge **auto-starts** if configured — the startup banner shows `flags: [telegram]`.

### How it works

```

Phone (Telegram)                  cheetahclaws terminal

──────────────────                ──────────────────────────

"List Python files"      →        📩 Telegram: List Python files

                                  [typing indicator sent...]

                                  ⚙ Glob(**/*.py) → 5 files

                                  ⚙ response assembled

                          ←       "agent.py, tools.py, ..."

```

- **Typing indicator** is sent every 4 seconds while the model processes, so the chat feels responsive.

- **Unauthorized senders** receive `⛔ Unauthorized.` and their messages are dropped.

- **Slash command passthrough**: send `/cost`, `/model gpt-4o`, `/clear`, etc. from Telegram and they execute in cheetahclaws.

- **`/stop` or `/off`** sent from Telegram stops the bridge gracefully.

### Commands

| Command | Description |

|---|---|

| `/telegram  ` | Configure token + chat_id, then start the bridge |

| `/telegram` | Start the bridge using saved config |

| `/telegram status` | Show running state and chat_id |

| `/telegram stop` | Stop the bridge |

### Auto-start

If both `telegram_token` and `telegram_chat_id` are set in `~/.cheetahclaws/config.json`, the bridge starts automatically on every cheetahclaws launch:

```

╭─ CheetahClaws ────────────────────────────────╮

│  Model:       claude-opus-4-6

│  Permissions: auto   flags: [telegram]

│  Type /help for commands, Ctrl+C to cancel        │

╰───────────────────────────────────────────────────╯

✓ Telegram bridge started (auto). Bot: @your_bot_name

```

---

## Proactive Background Monitoring

CheetahClaws v3.05.2 adds a **sentinel daemon** that automatically wakes the agent after a configurable period of inactivity — no user prompt required. This enables use cases like continuous log monitoring, market script polling, or scheduled code checks.

### Quick start

```

[myproject] ❯ /proactive 5m

Proactive background polling: ON  (triggering every 300s of inactivity)

[myproject] ❯ keep monitoring the build log and alert me if errors appear

╭─ Claude ● ─────────────────────────

│ Understood. I'll check the build log each time I wake up.

[Background Event Triggered]

╭─ Claude ● ─────────────────────────

│ ⚙ Bash(tail -50 build.log)

│ ✓ → Build failed: ImportError in auth.py line 42

│ **Action needed:** fix the import before the next CI run.

```

### Commands

| Command | Description |

|---|---|

| `/proactive` | Show current status (ON/OFF and interval) |

| `/proactive 5m` | Enable — trigger every 5 minutes of inactivity |

| `/proactive 30s` | Enable — trigger every 30 seconds |

| `/proactive 1h` | Enable — trigger every hour |

| `/proactive off` | Disable sentinel polling |

Duration suffix: `s` = seconds, `m` = minutes, `h` = hours. Plain integer = seconds.

### How it works

- A background daemon thread starts when the REPL launches (paused by default).

- The daemon checks elapsed time since the last user or agent interaction every second.

- When the inactivity threshold is reached, it calls the agent with a wake-up prompt.

- The `threading.Lock` used by the main agent loop ensures wake-ups never interrupt an active session — they queue and fire after the current turn completes.

- Watcher exceptions are logged via `traceback` so failures are visible and debuggable.

### Complements SleepTimer

| | `SleepTimer` | `/proactive` |

|---|---|---|

| Who initiates | The agent | The user |

| Trigger | After a fixed delay from now | After N seconds of inactivity |

| Use case | "Check back in 10 minutes" | "Keep watching until I stop typing" |

---

## Checkpoint System

CheetahClaws automatically snapshots your conversation and any edited files after every turn, so you can always rewind to an earlier state.

### How it works

- **Auto-snapshot** — after each turn, the checkpoint system saves the current conversation messages, token counts, and a copy-on-write backup of every file that was written or edited that turn.

- **100-snapshot sliding window** — older snapshots are automatically evicted when the limit is reached.

- **Throttling** — if nothing changed (no new messages, no file edits) since the last snapshot, the snapshot is skipped.

- **Initial snapshot** — captured at session start, so you can always rewind to a clean slate.

- **Storage** — `~/.nano_claude/checkpoints//` (snapshots metadata + backup files).

### Commands

| Command | Description |

|---|---|

| `/checkpoint` | List all snapshots for the current session |

| `/checkpoint ` | Rewind: restore files to their state at snapshot `` and trim conversation to that point |

| `/checkpoint clear` | Delete all snapshots for the current session |

| `/rewind` | Alias for `/checkpoint` |

### Example

```

[myproject] ❯ /checkpoint

  Checkpoints (4 total):

  #1  [turn 0] 14:02:11  "(initial state)"           0 files

  #2  [turn 1] 14:03:45  "Create app.py"              1 file

  #3  [turn 2] 14:05:12  "Add error handling"         1 file

  #4  [turn 3] 14:06:30  "Explain the code"           1 file

[myproject] ❯ /checkpoint 2

  Rewound to checkpoint #2 (turn 1)

  Restored: app.py

  Conversation trimmed to 2 messages.

```

---

## Plan Mode

Plan mode is a structured workflow for tackling complex, multi-file tasks: Claude first analyses the codebase in a read-only phase and writes an explicit plan, then the user approves before implementation begins.

### How it works

In plan mode:

- **Only reads** are permitted (`Read`, `Glob`, `Grep`, `WebFetch`, `WebSearch`, safe `Bash` commands).

- **Writes are blocked** everywhere **except** the dedicated plan file (`.nano_claude/plans/.md`).

- Blocked write attempts produce a helpful message rather than prompting the user.

- The system prompt is augmented with plan mode instructions.

- After compaction, the plan file context is automatically restored.

### Slash command workflow

```

[myproject] ❯ /plan add WebSocket support

  Plan mode activated.

  Plan file: .nano_claude/plans/a3f9c1b2.md

  Reads allowed. All other writes blocked (except plan file).

[myproject] ❯ 

  [Claude reads files, builds understanding, writes plan to plan file]

[myproject] ❯ /plan

  # Plan: Add WebSocket support

  ## Phase 1: Create ws_handler.py

  ## Phase 2: Modify server.py to mount the handler

  ## Phase 3: Add tests

[myproject] ❯ /plan done

  Plan mode exited. Permission mode restored to: auto

  Review the plan above and start implementing when ready.

[myproject] ❯ /plan status

  Plan mode: INACTIVE  (permission mode: auto)

```

### Agent tool workflow (autonomous)

Claude can autonomously enter and exit plan mode using the `EnterPlanMode` and `ExitPlanMode` tools — both are auto-approved in all permission modes:

```

User: Refactor the authentication module

Claude: [calls EnterPlanMode(task_description="Refactor auth module")]

  → reads auth.py, users.py, tests/test_auth.py ...

  → writes plan to .nano_claude/plans/...

  [calls ExitPlanMode()]

  → "Here is my plan. Please review and approve before I begin."

User: Looks good, go ahead.

Claude: [implements the plan]

```

### Commands

| Command | Description |

|---|---|

| `/plan ` | Enter plan mode with a task description |

| `/plan` | Print the current plan file contents |

| `/plan done` | Exit plan mode, restore previous permissions |

| `/plan status` | Show whether plan mode is active |

---

## Context Compression

Long conversations are automatically compressed to stay within the model's context window.

**Two layers:**

1. **Snip** — Old tool outputs (file reads, bash results) are truncated after a few turns. Fast, no API cost.

2. **Auto-compact** — When token usage exceeds 70% of the context limit, older messages are summarized by the model into a concise recap.

This happens transparently. You don't need to do anything.

**Manual compaction** — You can also trigger compaction at any time with `/compact`. An optional focus string tells the summarizer what context to prioritize:

```

[myproject] ❯ /compact

  Compacted: ~12400 → ~3200 tokens (~9200 saved)

[myproject] ❯ /compact keep the WebSocket implementation details

  Compacted: ~11800 → ~3100 tokens (~8700 saved)

```

If plan mode is active, the plan file context is automatically restored after any compaction.

---

## Diff View

When the model edits or overwrites a file, you see a git-style diff:

```diff

  Changes applied to config.py:

--- a/config.py

+++ b/config.py

@@ -12,7 +12,7 @@

     "model": "claude-opus-4-6",