An open API service indexing awesome lists of open source software.

https://github.com/openakita/openakita

An open-source AI assistant framework with skills and agent architecture
https://github.com/openakita/openakita

agent ai assistant automation clawd clawdbot openclaw python

Last synced: about 2 months ago
JSON representation

An open-source AI assistant framework with skills and agent architecture

Awesome Lists containing this project

README

          


OpenAkita Logo

OpenAkita


Open-Source Multi-Agent AI Assistant β€” Not Just Chat, an AI Team That Gets Things Done


Official Website
Β 
Download
Β 
Discord


License
Python Version
Version
PyPI
Build Status
Stars


Multi-Agent Collaboration Β· Organization Orchestration Β· Plugin System Β· Sandbox Security Β· 30+ LLMs Β· 6 IM Platforms Β· 89+ Tools Β· Desktop / Web / Mobile


Quick-Start β€’
Org Orchestration β€’
Scan-to-Bind β€’
Plugins β€’
Security β€’
Docs


English | δΈ­ζ–‡

---

## What is OpenAkita?

**Other AIs just chat. OpenAkita gets things done.**

OpenAkita is an open-source, all-in-one AI assistant β€” multiple AI Agents work together, build an **AI company** that runs autonomously, search the web, operate your computer, manage files, run scheduled tasks, and respond instantly across Telegram / Feishu / WeCom / DingTalk / QQ. **Scan a QR code to bind your chat app in 30 seconds.** It remembers your preferences, teaches itself new skills, and never gives up on a task. Extend anything through the **plugin system**, protected by **6-layer sandbox security**.

**Fully GUI-based setup. Ready in 5 minutes. Zero command line required.**


🌐 Website openakita.ai  | 
πŸ“₯ Download Desktop App Β |Β 
πŸ“– Documentation Β |Β 
πŸ’¬ Discord Community

---

## Quick-Start Guide

### πŸš€ For First-Time Users (3 Minutes)

**No installation required** β€” download the desktop app and start chatting:

1. **Download** the installer from [GitHub Releases](https://openakita.ai/download)
2. **Install** and follow the onboarding wizard
3. **Enter your API Key** (get one from [Anthropic](https://console.anthropic.com/) or [DeepSeek](https://platform.deepseek.com/))
4. **Try your first task**: Type "Create a calculator" and watch it work

### πŸ’» For Developers (5 Minutes)

```bash
# Install
pip install openakita[all]

# Quick setup (interactive wizard)
openakita init

# Run your first task
openakita run "Build a weather scraper"
```

### ✨ What You Can Do Right Away

| Category | Examples |
|----------|----------|
| **πŸ’¬ Chat** | Text + images + files, voice messages, stickers |
| **πŸ€– Multi-Agent** | "Create a competitive analysis" β†’ research + analysis + writing agents work together |
| **🏒 Organization** | Build an AI company β€” CEO, CTO, marketing, finance β€” agents run it autonomously |
| **🌐 Web** | Search news, scrape websites, automate browser tasks |
| **πŸ“ Files** | Read/write/edit files, batch rename, search content |
| **πŸ–₯️ Desktop** | Click buttons, type text, take screenshots, automate apps |
| **⏰ Scheduled** | "Remind me every Monday at 9am" β€” cron-based reminders |

### ➑️ Next Steps

- **Configure LLM**: Add multiple providers for automatic failover
- **Set up IM channels**: Scan QR code to bind WeChat/Feishu/WeCom in 30 seconds
- **Explore skills**: Install from marketplace or create your own
- **Try org mode**: Build an AI company and watch it run
- **Join community**: [Discord](https://discord.gg/vFwxNVNH) | [WeChat Group](docs/assets/wechat_group.jpg)

---

## Core Capabilities

### 🀝 Multi-Agent Collaboration
Multiple AI agents with specialized skills work in parallel.
Say one thing β€” a coding Agent writes code, a writing Agent drafts docs, a testing Agent verifies β€” all at the same time.

### 🏒 Organization Orchestration
Not just multi-Agent β€” build an **AI company**. CEO, CTO, CFO, marketing director… each role operates independently. Blackboard sharing, message routing, deadlock detection.

### πŸ“‹ Plan Mode
Complex tasks auto-decomposed into step-by-step plans with real-time progress tracking and automatic rollback on failure.

### 🧠 ReAct Reasoning Engine
Think β†’ Act β†’ Observe. Explicit three-phase reasoning with checkpoint/rollback. Fails? Tries a different strategy.

### πŸ”Œ Plugin System
8 plugin types, 3-tier permission model, 10 lifecycle hooks. Tools, channels, RAG, memory, LLM β€” everything is extensible.

### πŸ›‘οΈ 6-Layer Sandbox Security
Path zoning Β· Confirmation gates Β· Command interception Β· File snapshots Β· Self-protection Β· OS-level sandbox. High-risk commands run in isolation automatically.

### πŸ“± IM Scan-to-Bind
WeChat, Feishu, WeCom β€” scan a QR code, 30 seconds to bind, use AI directly in your chat app.

### πŸ’Ύ Dual-Mode Memory β€” Smarter Over Time
Fragment memory + MDRM relational graph (causal chains Β· timelines Β· entity graph Β· 3D visualization), auto smart switching.

---

## Full Feature List

| | Feature | Description |
|:---:|---------|-------------|
| 🀝 | **Multi-Agent** | Specialized agents, parallel delegation, automatic handoff, failover, real-time visual dashboard |
| 🏒 | **Org Orchestration** | Hierarchical org structure, CEO/CTO/CFO roles, blackboard memory, message routing, deadlock detection, heartbeat, auto-scaling |
| πŸ“‹ | **Plan Mode** | Auto task decomposition, per-step tracking, floating progress bar in UI |
| 🧠 | **ReAct Reasoning** | Explicit 3-phase loop, checkpoint/rollback, loop detection, strategy switching |
| πŸš€ | **Zero-Barrier Setup** | Full GUI config, onboarding wizard, 5 minutes from install to chat, zero CLI |
| πŸ”§ | **89+ Built-in Tools** | 16 categories: Shell / Files / Browser / Desktop / Search / Scheduler / MCP … |
| πŸ”Œ | **Plugin System** | 8 types (tool/channel/RAG/memory/LLM/hook/skill/MCP), 3-tier permissions, 10 lifecycle hooks, fault isolation |
| πŸ›‘οΈ | **6-Layer Security** | Path zoning, confirmation gates, command blocklist, file snapshots, self-protection, OS sandbox (Linux bwrap / macOS seatbelt / Windows MIC) |
| πŸ“± | **IM Scan-to-Bind** | WeChat/Feishu/WeCom scan-to-bind, 30-second setup, no developer configuration needed |
| πŸ›’ | **Skill Marketplace** | Search & one-click install, GitHub direct install, AI-generated skills on the fly |
| 🌐 | **30+ LLM Providers** | Anthropic / OpenAI / DeepSeek / Qwen / Kimi / MiniMax / Gemini … smart failover |
| πŸ’¬ | **6 IM Platforms** | Telegram / Feishu / WeCom / DingTalk / QQ / OneBot, voice recognition, smart group chat |
| πŸ”— | **MCP Integration** | Standard MCP client, stdio / HTTP / SSE transports, multi-directory scan, dynamic server management |
| πŸ’Ύ | **Dual-Mode Memory** | Mode 1 fragments (3 layers + 7 types + multi-path recall) + Mode 2 MDRM relational graph (5 dimensions + multi-hop traversal + 3D visualization), auto smart switching |
| 🎭 | **8 Personas** | Default / Tech Expert / Boyfriend / Girlfriend / Jarvis / Butler / Business / Family |
| πŸ€– | **Proactive Engine** | Greetings, task follow-ups, idle chat, goodnight β€” adapts frequency to your feedback |
| 🧬 | **Self-Evolution** | Daily self-check & repair, failure root cause analysis, auto skill generation |
| πŸ” | **Deep Thinking** | Controllable thinking mode, real-time chain-of-thought display, IM streaming |
| πŸ–₯️ | **Multi-Platform** | Desktop (Win/Mac/Linux) Β· Web (PC & mobile browser) Β· Mobile App (Android/iOS), 11 panels, dark theme |
| πŸ“Š | **Observability** | 12 trace span types, full-chain token statistics panel |
| πŸ˜„ | **Stickers** | 5700+ stickers, mood-aware, persona-matched |

---

## 5-Minute Setup

### Option 1: Desktop App (Recommended)

**Fully GUI-based, no command line** β€” this is what sets OpenAkita apart from other open-source AI assistants:


OpenAkita Quick Setup

| Step | What You Do | Time |
|:----:|-------------|:----:|
| 1 | Download installer, double-click to install | 1 min |
| 2 | Follow the onboarding wizard, enter API Key | 2 min |
| 3 | Start chatting | Now |

- No Python installation, no git clone, no config file editing
- Isolated runtime β€” won't mess with your existing system
- Chinese users get automatic mirror switching
- Models, IM channels, skills, schedules β€” all configured in the GUI

> **Download**: [GitHub Releases](https://openakita.ai/download) β€” Windows (.exe) / macOS (.dmg) / Linux (.deb)
>
> Learn more at **[openakita.ai](https://openakita.ai)**

### Option 2: pip Install

```bash
pip install openakita[all] # Install with all optional features
openakita init # Run setup wizard
openakita # Launch interactive CLI
```

### Option 3: Source Install

```bash
git clone https://github.com/openakita/openakita.git
cd openakita
python -m venv venv && source venv/bin/activate
pip install -e ".[all]"
openakita init
```

### Commands

```bash
openakita # Interactive chat
openakita run "Build a calculator" # Execute a single task
openakita serve # Service mode (IM channels)
openakita serve --dev # Dev mode with hot reload
openakita daemon start # Background daemon
openakita status # Check status
```

---

## Multi-Platform Access

OpenAkita supports **Desktop, Web, and Mobile** β€” use it anywhere, on any device:

| Platform | Details |
|----------|---------|
| πŸ–₯️ **Desktop App** | Windows / macOS / Linux β€” native app built with Tauri 2.x |
| 🌐 **Web Access** | PC & mobile browser β€” enable remote access, open in any browser |
| πŸ“± **Mobile App** | Android (APK) / iOS (TestFlight) β€” native wrapper via Capacitor |

### Desktop App


OpenAkita Desktop App

Cross-platform desktop app built with **Tauri 2.x + React + TypeScript**:

| Panel | Function |
|-------|----------|
| **Chat** | AI chat, streaming output, Thinking display, drag & drop upload, image lightbox |
| **Agent Dashboard** | Neural network visualization, real-time multi-Agent status tracking |
| **Agent Manager** | Create, manage, and configure multiple Agents |
| **IM Channels** | One-stop setup for all 6 platforms, scan-to-bind |
| **Skills** | Marketplace search, install, enable/disable |
| **MCP** | MCP server management |
| **Memory** | Memory management + LLM-powered review |
| **Scheduler** | Scheduled task management |
| **Token Stats** | Token usage statistics |
| **Config** | LLM endpoints, system settings, advanced options |
| **Feedback** | Bug reports + feature requests |

Dark/light theme Β· Onboarding wizard Β· Auto-update Β· Bilingual (EN/CN) Β· Start on boot

### Mobile App



β–Ά Watch Mobile App Demo



β–Ά Click to watch the Mobile App demo on Bilibili

- Connect your phone to the desktop backend over local network
- Full-featured: chat, multi-Agent collaboration, memory, skills, MCP β€” all on mobile
- Supports real-time streaming and Thinking chain display
- Preview mode available without connecting to a server

---

## Organization Orchestration



β–Ά Watch Organization Orchestration Demo



β–Ά Click to watch: Built a company on OpenAkita, and it runs autonomously (Bilibili)

Beyond multi-Agent collaboration β€” build an entire **AI company**. OpenAkita includes a full organization orchestration engine (AgentOrg) that lets you visually design company structures in the GUI, where AI agents operate like a real company:

```
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚ CEO / Executive β”‚
β”‚ Sets company strategy, coordinates all β”‚
β””β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”˜
β–Ό β–Ό β–Ό β–Ό
CTO Product Marketing CFO
Tech arch Planning Strategy Budget ctrl
β”‚ β”‚ β”‚ β”‚
β–Ό β–Ό β–Ό β–Ό
Dev team Design Content Analytics
```

### Key Features

| Feature | Description |
|---------|-------------|
| **Visual Org Chart** | Drag-and-drop org builder in the GUI β€” nodes, edges, hierarchies |
| **Autonomous Roles** | Each node is an independent Agent with its own identity, skills, policies, and memory |
| **Blackboard Sharing** | 3-level blackboard memory (org / department / node) for secure cross-team info sharing |
| **Message Routing** | Priority message queues with edge bandwidth control and deadlock detection |
| **Heartbeat Checks** | Periodic health monitoring of all nodes, auto-remediation on anomalies |
| **Auto-Scaling** | Automatically recruits new Agents under heavy load, dismisses when idle |
| **External Tools** | Nodes can request research / browser / code tools on demand with approval workflow |
| **Org Templates** | Pre-built templates (tech company, content team, etc.) β€” deploy with one click |
| **Projects & Tasks** | Tree-structured task decomposition, timeline tracking, org-wide coordination |

---

## Multi-Agent Collaboration



β–Ά Watch Multi-Agent Collaboration Demo



β–Ά Click to watch the Multi-Agent collaboration demo on Bilibili

OpenAkita has a built-in multi-Agent orchestration system β€” not just one AI, but an **AI team**:

```
You: "Create a competitive analysis report"
β”‚
β–Ό
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚ AgentOrchestrator (Director) β”‚
β”‚ Decomposes task β†’ Assigns to Agents β”‚
β””β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”˜
β–Ό β–Ό β–Ό
Search Agent Analysis Agent Writing Agent
(web research) (data crunching) (report drafting)
β”‚ β”‚ β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
β–Ό
Results merged, delivered to you
```

- **Specialization**: Different Agents for different domains, auto-matched to tasks
- **Parallel Processing**: Multiple Agents work simultaneously
- **Auto Handoff**: If one Agent gets stuck, it hands off to a better-suited one
- **Failover**: Agent failure triggers automatic switch to backup
- **Depth Control**: Max 5 delegation levels to prevent runaway recursion
- **Visual Tracking**: Agent Dashboard shows real-time status of every Agent
- **Instance Pooling**: Agent instance pool with LRU eviction for efficient resource use

---

## IM Scan-to-Bind



β–Ά Watch Scan-to-Bind Tutorial



β–Ά Click to watch: OpenAkita scan-to-bind for WeChat, Feishu, WeCom (Bilibili)

**No developer account needed. No callback URL configuration. No technical knowledge required** β€” scan a QR code, 30 seconds to bind:

| Platform | How to Bind | Time |
|----------|-------------|:----:|
| **WeChat** | Open IM Channels β†’ Click WeChat β†’ Scan QR code | 30 sec |
| **Feishu** | Open IM Channels β†’ Click Feishu β†’ Scan to authorize | 30 sec |
| **WeCom** | Open IM Channels β†’ Click WeCom β†’ Scan to bind | 30 sec |

Once bound, just @AI in your chat app β€” send messages, images, files, voice β€” AI handles it all.

---

## 6 IM Platforms

Talk to your AI right inside the chat tools you already use:

| Platform | Connection | Highlights |
|----------|-----------|------------|
| **WeChat** | Scan-to-bind (iLink) | Personal account, no official account needed, 30-second setup |
| **Feishu** | WebSocket / Webhook | Card messages, event subscriptions, scan-to-bind |
| **WeCom** | Smart Robot callback / WebSocket | Streaming replies, proactive push, scan-to-bind |
| **DingTalk** | Stream WebSocket | No public IP needed |
| **Telegram** | Webhook / Long Polling | Pairing verification, Markdown, proxy support |
| **QQ Official** | WebSocket / Webhook | Groups, DMs, channels |
| **OneBot** | WebSocket | Compatible with NapCat / Lagrange / go-cqhttp |

- πŸ“· **Vision**: Send screenshots/photos β€” AI understands them
- 🎀 **Voice**: Send voice messages β€” auto-transcribed and processed
- πŸ“Ž **File Delivery**: AI-generated files pushed directly to chat
- πŸ‘₯ **Group Chat**: Replies when @mentioned, stays quiet otherwise
- πŸ’­ **Chain-of-Thought**: Real-time reasoning process streamed to IM
- πŸ”„ **Message Interrupts**: Insert new instructions between tool calls without waiting

---

## Plugin System

OpenAkita provides a complete plugin architecture with `plugin.json` manifest declarations, a 3-tier permission model for security, and 10 lifecycle hooks for deep integration:

### 8 Plugin Types

| Type | Description | Example |
|------|-------------|---------|
| πŸ”§ **Tool** | Register custom tools for LLM to call | Database queries, API calls |
| πŸ’¬ **Channel** | Add new IM channel adapters | Slack, Discord adapters |
| πŸ“š **RAG** | Add external knowledge retrieval sources | Notion, Confluence retrieval |
| 🧠 **Memory** | Extend memory storage backends | Redis, PostgreSQL storage |
| πŸ€– **LLM** | Connect new LLM providers | Private model deployments |
| πŸͺ **Hook** | Inject logic into the lifecycle | Message auditing, content filtering |
| ⚑ **Skill** | Wrap Skills as plugins | Package skills for distribution |
| πŸ”— **MCP** | Wrap MCP Servers as plugins | Simplify MCP deployment |

### 3-Tier Permission Model

| Tier | Description | Example |
|------|-------------|---------|
| **Basic** | Auto-granted on install | Read config, register tools |
| **Advanced** | Requires user confirmation on install | File I/O, network requests |
| **System** | Must be manually granted per-permission | Shell execution, system config |

### Lifecycle Hooks

`on_init` β†’ `on_message_received` β†’ `on_tool_result` β†’ `on_prompt_build` β†’ `on_retrieve` β†’ `on_session_start` β†’ `on_session_end` β†’ `on_schedule` β†’ `on_shutdown`

Plugins have **automatic fault isolation**: error count exceeding threshold triggers auto-disable, preventing a single plugin from crashing the system.

### Bundled AI-Media Plugins (20)

Built on the `openakita-plugin-sdk` (`contrib/` 6 modules: `verification`, `quality_gates`, `intent_verifier`, `provider_score`, `error_coach`, `slideshow_risk` / `source_review`). All ship with `SKILL.md` + `README.md` + tests.

| Plugin | Capability | Tests |
|--------|-----------|-------|
| `tongyi-image` | DashScope text-to-image / image-edit / outpaint / background-gen | 107 |
| `seedance-video` | Volcengine Ark Seedance text/image-to-video, long-form chaining | 58 |
| `tts-studio` | Multi-provider TTS dialogue rendering | 6 |
| `avatar-speaker` | Digital-human avatar + TTS | 7 |
| `image-edit` | Image editing dispatcher | 5 |
| `subtitle-maker` | Generate subtitles | 4 |
| `video-translator` | Translate video subtitles | 14 |
| `highlight-cutter` | Highlight reel generator | 5 |
| `poster-maker` | Static poster composition | 8 |
| `storyboard` | Scene plan generator (uses `slideshow_risk` + `intent_verifier`) | 39 |
| `bgm-suggester` | LLM-driven BGM brief (uses `verification`) | 45 |
| `bgm-mixer` | madmom beat-aware BGM mixing | 68 |
| `transcribe-archive` | Long-form transcription with `parallel_executor` + `checkpoint` | 95 |
| `video-bg-remove` | RVM background removal (uses `dep_gate` + `checkpoint`) | 72 |
| `video-color-grade` | ffmpeg auto color grading via `auto_color_grade_filter` | 49 |
| `smart-poster-grid` | 4-aspect poster batches (uses `verification`) | 50 |
| `ppt-to-video` | LibreOffice + tts-studio narrated slideshows | 79 |
| `local-sd-flux` | Local ComfyUI (SD 1.5 / SDXL / FLUX) + provider ranker | 99 |
| `shorts-batch` | Batch shorts orchestrator (uses `slideshow_risk`) | 51 |
| `dub-it` | Video dubbing pipeline (uses `source_review`) | 52 |
| **Total** | | **913** |

> Developer docs: [Plugin System Overview](docs/plugin-system-overview.md) Β· [SDK contrib catalog](openakita-plugin-sdk/docs/contrib.md) Β· [Sprint 18 Cleanup Assessment](docs/sprint18-cleanup-assessment.md)

---

## Sandbox Security

OpenAkita implements a **6-layer defense-in-depth** security model, from path management to OS-level isolation:

```
L1 Path Zoning workspace / controlled / protected / forbidden
L2 Confirmation Gate Dangerous ops (delete files, system commands) require user approval
L3 Command Intercept regedit, format, rm -rf β€” blocked outright
L4 File Snapshots Auto-checkpoint before writes, rollback available
L5 Self-Protection data/, src/, identity/ β€” core dirs locked from modification
L6 OS-Level Sandbox Linux bwrap / macOS seatbelt / Windows MIC
```

### Sandbox Execution

When the policy engine classifies a shell command as **HIGH risk**, it automatically runs in an OS-level sandbox:

| Platform | Sandbox Backend | Description |
|----------|----------------|-------------|
| **Linux** | bubblewrap (bwrap) | User-space container isolation, restricted filesystem and network |
| **macOS** | sandbox-exec (seatbelt) | System-level sandbox policies |
| **Windows** | Low Integrity (MIC) | Mandatory Integrity Control, low-privilege process isolation |

### Additional Security Mechanisms

- **Policy Engine**: `POLICIES.yaml` for tool permissions, shell command blocklist, path restrictions
- **Resource Budgets**: Token / cost / duration / iteration / tool call limits per task
- **Runtime Supervision**: Auto-detection of tool thrashing, reasoning loops, token anomalies
- **Local Data**: Memory, config, and chat history stored on your machine only
- **Open Source**: Apache 2.0, fully transparent codebase

---

## 30+ LLM Providers

**No vendor lock-in. Mix and match freely:**

| Category | Providers |
|----------|-----------|
| **International** | Anthropic Β· OpenAI Β· Google Gemini Β· xAI (Grok) Β· Mistral Β· OpenRouter Β· NVIDIA NIM Β· Groq Β· Together AI Β· Fireworks Β· Cohere |
| **China** | Alibaba DashScope Β· Kimi (Moonshot) Β· Xiaomi MiMo Β· MiniMax Β· DeepSeek Β· SiliconFlow Β· Volcengine Β· Zhipu AI Β· Baidu Qianfan Β· Tencent Hunyuan Β· Yunwu Β· Meituan LongCat Β· iFlow |
| **Local** | Ollama Β· LM Studio (⚠️ Small models have limited tool-calling ability β€” not recommended yet, pending optimization) |

**7 capability dimensions**: Text Β· Vision Β· Video Β· Tool use Β· Thinking Β· Audio Β· PDF

**Smart failover**: One model goes down, the next picks up seamlessly.

### Recommended Models

**International Models (in order of recommendation):**

| Model | Provider | Notes |
|-------|----------|-------|
| `claude-opus-4-6` | Anthropic | One of the best β€” top-tier coding & long-task capability, 1M context |
| `gpt-5.4` | OpenAI | Flagship β€” native computer-use, 1M context, strong reasoning |
| `claude-sonnet-4-6` | Anthropic | Best value β€” fully upgraded default model, 1M context |
| `gpt-5.3-instant` | OpenAI | Best for everyday chat β€” significantly fewer hallucinations, natural flow |
| `claude-opus-4-5` | Anthropic | Previous flagship, still extremely capable |
| `claude-sonnet-4-5` | Anthropic | Stable and reliable for everyday use |

**Chinese Models (recommended):**

| Model | Provider | Notes |
|-------|----------|-------|
| `kimi-k2.5` | Moonshot | 1T MoE, Agent Swarm with up to 100 parallel sub-agents, 256K context, open-source |
| `qwen3.5-plus` | Alibaba | 397B MoE, 1M context, 201 languages, extremely cost-effective |
| `mimo-v2-pro` | Xiaomi | 1T MoE, 1M context, global Top 8 ranking, affordable pricing |
| `deepseek-v3` | DeepSeek | Cost-effective benchmark, strong Chinese support |

> For complex reasoning, enable Thinking mode β€” add `-thinking` suffix to the model name.
>
> ⚠️ **Local small models not recommended** (e.g. 7B/14B quantized): Small models have limited tool-calling and agent collaboration capabilities, prone to hallucinations and format errors. Use API-hosted flagship models for the best experience.

---

## Memory System

Not just a "context window" β€” true long-term memory. Supports **dual modes** with automatic switching:

### Mode 1: Fragment Memory (Classic)

- **Three layers**: Working memory (current task) + Core memory (user profile) + Dynamic retrieval (past experience)
- **7 memory types**: Fact / Preference / Skill / Error / Rule / Persona trait / Experience
- **Multi-path recall**: Semantic + full-text + temporal + attachment search
- **Gets smarter over time**: Preferences you mentioned two months ago? Still remembered.

### Mode 2: MDRM Relational Graph Memory (New)

On top of fragment memory, builds **causal chains, timelines, and entity relationship graphs** β€” letting AI truly understand connections between events:

| Dimension | Description | Example |
|-----------|-------------|---------|
| **Temporal** | Event chronology and timelines | "What did I do last week?" β†’ auto-constructs timeline |
| **Causal** | Cause-and-effect chains | "What caused this bug?" β†’ traces causal chain |
| **Entity** | Relationships between people/projects/concepts | "Which projects did Alice work on?" β†’ entity graph |
| **Action** | Dependencies, prerequisites, compositions | "What else is needed to finish X?" β†’ dependency analysis |
| **Context** | Project/session attribution | "All discussions about this project" β†’ cross-session aggregation |

- **4 node types**: Event / Fact / Decision / Goal
- **Multi-hop graph traversal**: Starting from seed nodes, expands along relationship edges to find deep connections
- **3-layer encoding**: Fast rule-based encoding β†’ summary backfill β†’ session-end batch LLM encoding
- **3D visualization**: Frontend supports 3D visualization of the memory graph

### Smart Mode Switching

Set `memory_mode` to `auto` (default) and the system auto-routes based on query characteristics: causal/timeline/cross-session questions use **Mode 2 graph traversal**, preference/fact queries use **Mode 1 semantic retrieval**.

- **AI-driven extraction**: Automatically distills valuable information after each conversation, dual-track writes to both modes
- **3D memory graph**: Visualize memory nodes and relationships, intuitively understand AI's memory structure

---

## MCP Integration

OpenAkita includes a full [MCP (Model Context Protocol)](https://modelcontextprotocol.io/) client, enabling AI to connect with any external service:

| Feature | Description |
|---------|-------------|
| **3 Transports** | stdio (default), Streamable HTTP, SSE (legacy compatible) |
| **Multi-Dir Scan** | Auto-discovers MCP configs from built-in `mcps/`, `.mcp`, `data/mcp/servers/` directories |
| **Dynamic Management** | Add/remove MCP servers at runtime, no restart needed |
| **Tool Suite** | `call_mcp_tool`, `list_mcp_servers`, `add_mcp_server`, `connect_mcp_server`, and more |
| **Progressive Disclosure** | MCP tool catalog + prompt templates, shown on demand |
| **GUI Management** | Desktop MCP panel for one-stop configuration |

Connect to GitHub, databases, Playwright browser, filesystem, or any MCP Server.

---

## Self-Evolution

OpenAkita keeps getting stronger:

```
Daily 04:00 β†’ Self-check: analyze error logs β†’ AI diagnosis β†’ auto-fix β†’ push report
After failure β†’ Root cause analysis (context loss / tool limitation / loop / budget) β†’ suggestions
Missing skill β†’ Auto-search GitHub for skills, or AI generates one on the spot
Missing dep β†’ Auto pip install, auto mirror switching for China
Every chat β†’ Extract preferences and experience β†’ long-term memory
```

---

## Architecture

```
Desktop App (Tauri + React)
β”‚
Identity ─── SOUL.md Β· AGENT.md Β· POLICIES.yaml Β· 8 Persona Presets
β”‚
Core ─── ReasoningEngine(ReAct) Β· Brain(LLM) Β· ContextManager
β”‚ PromptAssembler Β· RuntimeSupervisor Β· ResourceBudget
β”‚
Agents ─── AgentOrchestrator(Coordination) Β· AgentInstancePool(Pooling)
β”‚ AgentFactory Β· FallbackResolver(Failover)
β”‚
Org ─── OrgRuntime(Runtime) Β· OrgManager(CRUD)
β”‚ OrgMessenger(Routing) Β· Blackboard(Shared Memory)
β”‚ OrgIdentity(Inheritance) Β· OrgPolicies(Policies)
β”‚
Plugins ─── PluginManager(Discovery/Loading) Β· PluginAPI(Host Interface)
β”‚ HookRegistry(10 Hooks) Β· PluginSandbox(Fault Isolation)
β”‚
Memory ─── Mode1: UnifiedStore(SQLite+Vector) Β· RetrievalEngine(Multi-path)
β”‚ Mode2: RelationalStore(MDRM Graph) Β· GraphEngine(Multi-hop)
β”‚ MemoryModeRouter(Auto Switch) Β· MemoryEncoder(3-Layer)
β”‚
Tools ─── Shell Β· File Β· Browser Β· Desktop Β· Web Β· MCP Β· Skills
β”‚ Plan Β· Scheduler Β· Sticker Β· Persona Β· Agent Delegation
β”‚
Security ─── PolicyEngine(6-Layer) Β· SandboxExecutor(OS Sandbox)
β”‚ ConfirmationGate Β· CommandFilter Β· Checkpoint
β”‚
Evolution ── SelfCheck Β· FailureAnalyzer Β· SkillGenerator Β· Installer
β”‚
Channels ─── CLI Β· Telegram Β· Feishu Β· WeCom Β· WeChat Β· DingTalk Β· QQ Β· OneBot
β”‚
Tracing ─── AgentTracer(12 SpanTypes) Β· DecisionTrace Β· TokenStats
```

---

## Documentation

| Document | Content |
|----------|---------|
| [Configuration Guide](docs/configuration-guide.md) | Desktop Quick Setup & Full Setup walkthrough |
| ⭐ [LLM Provider Setup](docs/llm-provider-setup-tutorial.md) | **API Key registration + endpoint config + Failover** |
| ⭐ [IM Channel Setup](docs/im-channel-setup-tutorial.md) | **Telegram / Feishu / DingTalk / WeCom / QQ / OneBot tutorial** |
| [Plugin System Overview](docs/plugin-system-overview.md) | Plugin types, permissions, developer guide |
| [Org Orchestration Design](docs/agent-org-technical-design.md) | AgentOrg technical architecture and design |
| [Org Orchestration Guide](docs/agent-org-user-guide.md) | Organization orchestration user guide |
| [Quick Start](docs/getting-started.md) | Installation and basics |
| [Architecture](docs/architecture.md) | System design and components |
| [Configuration](docs/configuration.md) | All config options |
| [Deployment](docs/deploy.md) | Production deployment |
| [MCP Integration](docs/mcp-integration.md) | Connecting external services |
| [Skill System](docs/skills.md) | Creating and using skills |

---

## Community



WeChat Official Account

WeChat Official

Follow for updates


Personal WeChat

WeChat (Personal)

Note "OpenAkita" to join group


WeChat Group

WeChat Group

Scan to join (⚠️ refreshed weekly)


QQ Group

QQ Group: 854429727

Scan or search to join


🌐 Website ·
πŸ’¬ Discord Β·
🐦 X (Twitter) ·
πŸ“§ Email


Issues Β·
Discussions Β·
⭐ Star

---

## Acknowledgments

- [Anthropic Claude](https://www.anthropic.com/claude) β€” Default recommended LLM, core development partner
- [Tauri](https://tauri.app/) β€” Cross-platform desktop framework
- [ChineseBQB](https://github.com/zhaoolee/ChineseBQB) β€” 5700+ stickers that give AI a soul
- [browser-use](https://github.com/browser-use/browser-use) β€” AI browser automation
- [AGENTS.md](https://agentsmd.io/) / [Agent Skills](https://agentskills.io/) β€” Open standards

### Community Contributors

- [@948324394](https://github.com/948324394) β€” Docker deployment support

## License

Apache License 2.0 β€” See [LICENSE](LICENSE)

Third-party licenses: [THIRD_PARTY_NOTICES.md](THIRD_PARTY_NOTICES.md)

## Star History





Star History Chart

---


OpenAkita β€” Open-Source Multi-Agent AI Assistant That Gets Things Done

openakita.ai