https://github.com/forge18/agent-router-mcp

Intelligent MCP router for AI subagents with config-driven rules and LLM semantic tagging
https://github.com/forge18/agent-router-mcp
agent-routing ai-agents ai-orchestration config-driven intelligent-router llm mcp model-context-protocol ollama rust semantic-routing subagent-routing subagents
Last synced: 6 months ago
JSON representation
Intelligent MCP router for AI subagents with config-driven rules and LLM semantic tagging
Host: GitHub
URL: https://github.com/forge18/agent-router-mcp
Owner: forge18
License: mit
Created: 2026-01-05T05:47:34.000Z (6 months ago)
Default Branch: main
Last Pushed: 2026-01-13T02:11:48.000Z (6 months ago)
Last Synced: 2026-01-13T19:49:43.392Z (6 months ago)
Topics: agent-routing, ai-agents, ai-orchestration, config-driven, intelligent-router, llm, mcp, model-context-protocol, ollama, rust, semantic-routing, subagent-routing, subagents
Language: Rust
Homepage: https://github.com/forge18/agent-router-mcp
Size: 108 KB
Stars: 0
Watchers: 0
Forks: 0
Open Issues: 1
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
- Security: SECURITY.md
Awesome Lists containing this project

README

          # Agent Router MCP

![Beta](https://img.shields.io/badge/status-beta-yellow)

![Build Status](https://github.com/forge18/agent-router-mcp/actions/workflows/ci.yml/badge.svg)

![License](https://img.shields.io/badge/license-MIT-blue)

> **Beta Software** - This project is functional and tested. Feedback and bug reports are welcome.

A **stateless, config-driven** Model Context Protocol (MCP) server that intelligently routes requests to specialized AI subagents using a hybrid rule-based + LLM approach.

## Table of Contents

**Getting Started**

- [Key Features](#key-features)

- [Requirements](#requirements)

- [Quick Start Installation](#quick-start-installation)

  - [1. Download Binary](#1-download-binary)

  - [2. Download Config Files](#2-download-config-files)

  - [3. Configure MCP Client](#3-configure-mcp-client)

**Using the Server**

- [MCP Tools Reference](#mcp-tools-reference)

  - [get_routing](#get_routing)

  - [start_ollama](#start_ollama)

  - [pull_model](#pull_model)

  - [load_model](#load_model)

- [How It Works](#how-it-works)

**Configuration**

- [Configuration Files](#configuration-files)

  - [agents.json](#configagentsjson)

  - [rules.json](#configrulesjson)

  - [llm-tags.json](#configllm-tagsjson)

- [Customization Examples](#customization-examples)

- [Model Switching](#model-switching)

**Advanced Topics**

- [Architecture](#architecture)

- [Creating Agents/Subagents](#creating-agentssubagents)

- [Cross-Platform Support](#cross-platform-support)

- [Compiling from Source](#compiling-from-source)

- [Development](#development)

---

## Key Features

- 🔧 **Fully Config-Driven**: All routing logic defined in JSON - no code changes needed

- 🚀 **Stateless Architecture**: No state between requests, configs loaded once on startup

- ⚡ **Fast Routing**: Rule-based matching handles 90%+ of cases locally

- 🧠 **LLM Semantic Tagging**: Uses any Ollama model for edge cases and semantic understanding

- 🔄 **Flexible Rules**: Boolean logic (any_of, all_of) with nesting support

- 📝 **User Customizable**: Define your own agents, tags, and routing rules

## Requirements

### Software

- Rust 1.70+

- Ollama installed and in PATH

### Hardware

- **RAM**: 8GB minimum (for 3B models like `smollm3:3b`)

  - 16GB recommended for better performance and multitasking

  - Larger models require more RAM (7B models need 16GB, 13B models need 32GB)

- **Disk Space**: ~2GB for default model

  - Varies by model size and quantization

## Quick Start Installation

> **Prerequisites:** Make sure [Ollama](https://ollama.com) is installed and running. See [Requirements](#requirements) above.

### 1. Download Binary

Download the latest release from [GitHub Releases](https://github.com/yourusername/agent-router-mcp/releases) for your platform:

**Choose Your Binary:**

- **Windows Intel/AMD**: `agent-router-mcp-windows-amd64.exe` (Most Windows PCs)

- **Windows ARM**: `agent-router-mcp-windows-arm64.exe` (Surface Pro X, Windows Dev Kit 2023)

- **macOS Intel**: `agent-router-mcp-macos-intel` (Intel Macs)

- **macOS Apple Silicon**: `agent-router-mcp-macos-silicon` (M1/M2/M3 Macs)

- **Linux Intel/AMD**: `agent-router-mcp-linux-amd64` (Most PCs/servers)

- **Linux ARM**: `agent-router-mcp-linux-arm64` (Raspberry Pi 4+, AWS Graviton)

**Not sure which binary?** On the command line:

- **Linux/macOS**: Run `uname -m`

  - Output `x86_64` → use `amd64`

  - Output `aarch64` or `arm64` → use `arm64`

- **Windows**: Run `echo %PROCESSOR_ARCHITECTURE%`

  - Output `AMD64` → use `windows-amd64.exe`

  - Output `ARM64` → use `windows-arm64.exe`

**macOS/Linux**: Make it executable

```bash

chmod +x agent-router-mcp-*

```

### 2. Download Config Files

Download the config archive from [GitHub Releases](https://github.com/yourusername/agent-router-mcp/releases):

- **Windows**: Download `agent-router-mcp-config.zip` and extract

- **macOS/Linux**: Download `agent-router-mcp-config.tar.gz` and extract with `tar -xzf agent-router-mcp-config.tar.gz`

Place the extracted files in a folder (e.g., `C:\agent-configs\` on Windows or `~/agent-configs/` on macOS/Linux).

Alternatively, download individual files:

- [agents.json](https://raw.githubusercontent.com/yourusername/agent-router-mcp/main/config/agents.json)

- [rules.json](https://raw.githubusercontent.com/yourusername/agent-router-mcp/main/config/rules.json)

- [llm-tags.json](https://raw.githubusercontent.com/yourusername/agent-router-mcp/main/config/llm-tags.json)

### 3. Configure MCP Client

Add to your MCP client's configuration file (location varies by client - check your client's documentation).

**Example (Windows)**:

```json

{

  "mcpServers": {

    "agent-router": {

      "type": "stdio",

      "command": "C:\\path\\to\\agent-router-mcp.exe",

      "env": {

        "OLLAMA_URL": "http://localhost:11434",

        "MODEL_SOURCE": "huggingface",

        "MODEL_NAME": "unsloth/SmolLM3-3B-128K-GGUF",

        "AGENTS_CONFIG_PATH": "C:\\agent-configs\\agents.json",

        "LLM_TAGS_CONFIG_PATH": "C:\\agent-configs\\llm-tags.json",

        "RULES_CONFIG_PATH": "C:\\agent-configs\\rules.json"

      }

    }

  }

}

```

**Example (macOS/Linux)**:

```json

{

  "mcpServers": {

    "agent-router": {

      "type": "stdio",

      "command": "/path/to/agent-router-mcp",

      "env": {

        "OLLAMA_URL": "http://localhost:11434",

        "MODEL_SOURCE": "huggingface",

        "MODEL_NAME": "unsloth/SmolLM3-3B-128K-GGUF",

        "AGENTS_CONFIG_PATH": "/Users/me/agent-configs/agents.json",

        "LLM_TAGS_CONFIG_PATH": "/Users/me/agent-configs/llm-tags.json",

        "RULES_CONFIG_PATH": "/Users/me/agent-configs/rules.json"

      }

    }

  }

}

```

Replace the paths with your actual file locations.

---

## MCP Tools Reference

The server exposes 2 tools for managing Ollama and getting routing instructions:

### `init_llm`

Initialize the LLM environment. This tool:

1. Checks if Ollama is installed

2. Starts Ollama if not running

3. Pulls the configured model if not downloaded

4. Loads the model into memory

Call this once before using `get_instructions`.

**Input:** None required

**Output (Success):**

```json

{

  "success": true,

  "message": "LLM ready for routing",

  "steps_performed": [

    "Ollama already running",

    "Model unsloth/SmolLM3-3B-128K-GGUF already installed",

    "Model unsloth/SmolLM3-3B-128K-GGUF already loaded"

  ]

}

```

### `get_instructions`

Get routing instructions for a user request. **This is the main tool** that performs intelligent routing.

**Input:**

```json

{

  "task": "Fix the authentication bug",

  "intent": "review code before commit",

  "original_prompt": "Can you fix the login issue in auth.ts?",

  "associated_files": ["src/auth.ts", "src/middleware/auth.ts"]

}

```

- `task` (required): What the agent is doing - the current task or action being performed

- `intent` (required): The agent's intent for this tool call (e.g., "review code before commit", "help debug an issue", "prepare for pull request")

- `original_prompt` (optional): The original user request, preserved for better LLM semantic tagging. Useful when `task` is a summary or derivative of the original request.

- `associated_files` (optional): List of file paths relevant to this task, used for file-based routing rules. If not provided, no file-based rules will match.

Note: Git context (branch only) is **auto-detected** from the current working directory for branch-based routing rules.

**Output (Success):**

```json

{

  "instructions": [

    {

      "trigger": {

        "name": "file_pattern",

        "description": "*auth*"

      },

      "context": {

        "instructions": "Review authentication code for security vulnerabilities",

        "files": ["src/auth.ts", "src/middleware/auth.ts"],

        "confidence": 100,

        "priority": 80

      },

      "route_to_agent": {

        "name": "security-auditor",

        "description": "Reviews code for security vulnerabilities, secrets, supply chain attacks"

      }

    },

    {

      "trigger": {

        "name": "file_pattern",

        "description": "*.ts"

      },

      "context": {

        "instructions": null,

        "files": ["src/auth.ts", "src/middleware/auth.ts"],

        "confidence": 100,

        "priority": 50

      },

      "route_to_agent": {

        "name": "language-reviewer-typescript",

        "description": "TypeScript-specific patterns and best practices"

      }

    }

  ]

}

```

**Response Fields:**

| Field | Description |

|-------|-------------|

| `instructions` | Array of routing instructions, one per agent to invoke |

| `trigger.name` | What triggered the routing: `file_pattern`, `file_regex`, `branch_regex`, `prompt_regex`, `llm_tag` |

| `trigger.description` | The specific pattern or tag that matched (e.g., `*.ts`, `security-concern`) |

| `context.instructions` | Optional agent-specific instructions from the agent definition |

| `context.files` | Files that triggered this routing (subset of input files) |

| `context.confidence` | 0-100 confidence level (100 = deterministic rule match, 85 = LLM tag match) |

| `context.priority` | 0-100 priority level from agent definition (higher = more important) |

| `route_to_agent.name` | Agent name to route to |

| `route_to_agent.description` | Agent description from config |

**Output (Prerequisites Not Met):**

The tool performs automatic prerequisite checks and returns helpful error messages:

```json

{"error": "Ollama is not running. Run init_llm first to start Ollama and load the model."}

```

```json

{"error": "Model not loaded into memory. Run init_llm to load it."}

```

When you receive these errors, call `init_llm` first.

---

## How It Works

1. **Stateless**: No state maintained between requests

2. **Config Loading**: Loads `agents.json`, `rules.json`, `llm-tags.json` on startup

3. **Git Context**: Auto-detects branch from current directory (for branch-based rules)

4. **LLM Tagging**: Analyzes **task, intent, and original_prompt** to identify semantic tags

5. **Rule Matching**: Evaluates ALL rules (file patterns, regex, branch patterns, AND tag-based)

6. **Return**: JSON result with routing instructions (empty result is valid if no rules match)

---

## Configuration Files

All routing logic lives in `config/*.json` - edit these to customize behavior:

### `config/agents.json`

Define available subagents with optional instructions and priority:

```json

{

  "agents": [

    {

      "name": "security-auditor",

      "description": "Reviews code for security vulnerabilities, secrets, supply chain attacks",

      "instructions": "Focus on OWASP Top 10 vulnerabilities and secret exposure",

      "priority": 80

    },

    {

      "name": "language-reviewer-typescript",

      "description": "TypeScript-specific patterns and best practices",

      "priority": 50

    }

  ]

}

```

**Agent Fields:**

| Field | Required | Default | Description |

|-------|----------|---------|-------------|

| `name` | Yes | - | Unique agent identifier |

| `description` | Yes | - | What this agent does (shown in routing response) |

| `instructions` | No | null | Agent-specific instructions included in routing response |

| `priority` | No | 50 | 0-100 priority level (higher = more important) |

### `config/rules.json`

Define routing rules with boolean logic:

```json

{

  "rules": [

    {

      "description": "Route TypeScript files to TS reviewer",

      "conditions": {

        "any_of": [

          {"file_pattern": "*.ts"},

          {"file_pattern": "*.tsx"}

        ]

      },

      "route_to_subagents": ["language-reviewer-typescript"]

    },

    {

      "description": "Security files AND security tag → security auditor",

      "conditions": {

        "all_of": [

          {"file_pattern": "*auth*"},

          {"llm_tag": "security-concern"}

        ]

      },

      "route_to_subagents": ["security-auditor", "code-reviewer"]

    },

    {

      "description": "Commit intent triggers code review",

      "conditions": {

        "llm_tag": "commit-review"

      },

      "route_to_subagents": ["code-reviewer"]

    },

    {

      "description": "PR intent triggers code review",

      "conditions": {

        "llm_tag": "pull-request"

      },

      "route_to_subagents": ["code-reviewer"]

    }

  ]

}

```

**Supported Conditions:**

- `file_pattern` - Glob match on file paths (e.g., `*.ts`, `*auth*`)

- `file_regex` - Regex match on file paths

- `prompt_regex` - Regex match on task, intent, or original_prompt

- `branch_regex` - Regex match on git branch name

- `llm_tag` - Match LLM-identified semantic tags (LLM analyzes task, intent, and original_prompt)

**Boolean Logic:**

- `any_of` - OR logic (match if ANY condition is true)

- `all_of` - AND logic (match if ALL conditions are true)

- Supports nesting for complex rules

### `config/llm-tags.json`

Define semantic tags for LLM to identify. The LLM analyzes **task, intent, and original_prompt** when identifying tags:

```json

{

  "tags": [

    {

      "name": "commit-review",

      "description": "Intent indicates preparing for a commit, pre-commit review, or finalizing changes",

      "examples": [

        "review before commit",

        "pre-commit check",

        "finalize changes"

      ]

    },

    {

      "name": "pull-request",

      "description": "Intent indicates preparing a pull request or code review for merge",

      "examples": [

        "create pull request",

        "prepare PR",

        "ready for review"

      ]

    },

    {

      "name": "security-concern",

      "description": "Code that handles authentication, authorization, encryption, secrets...",

      "examples": [

        "JWT token generation",

        "Password hashing",

        "API key handling"

      ]

    }

  ]

}

```

## Customization Examples

### Add a New Agent

Edit `config/agents.json`:

```json

{

  "agents": [

    {

      "name": "my-custom-agent",

      "description": "Does something special"

    }

  ]

}

```

### Add a Routing Rule

Edit `config/rules.json`:

```json

{

  "rules": [

    {

      "description": "Route GraphQL files to API specialist",

      "conditions": {

        "file_pattern": "*.graphql"

      },

      "route_to_subagents": ["api-specialist"]

    },

    {

      "description": "Performance-critical code on hotfix branch",

      "conditions": {

        "all_of": [

          {"llm_tag": "performance-critical"},

          {"branch_regex": "^hotfix/.*"}

        ]

      },

      "route_to_subagents": ["performance-optimizer", "code-reviewer"]

    }

  ]

}

```

### Add a Custom LLM Tag

Edit `config/llm-tags.json`:

```json

{

  "tags": [

    {

      "name": "error-handling",

      "description": "Code that handles errors, exceptions, or error states",

      "examples": [

        "try-catch blocks",

        "error boundaries",

        "Result types"

      ]

    }

  ]

}

```

## Model Switching

The router supports models from **two sources**:

### Model Sources

| Source | `MODEL_SOURCE` | Model Name Format | Example |

|--------|---------------|-------------------|---------|

| **HuggingFace** (default) | `huggingface` | `username/repo-name` | `unsloth/SmolLM3-3B-128K-GGUF` |

| **Ollama** | `ollama` | `model:tag` | `llama3.2:3b` |

### Using HuggingFace Models (Default)

HuggingFace offers thousands of GGUF models. The router automatically prefixes with `hf.co/` when pulling:

```bash

# Default: HuggingFace SmolLM3

export MODEL_SOURCE="huggingface"

export MODEL_NAME="unsloth/SmolLM3-3B-128K-GGUF"

# Other HuggingFace models

export MODEL_NAME="bartowski/Qwen2.5-3B-Instruct-GGUF"

export MODEL_NAME="TheBloke/Llama-2-7B-GGUF"

```

Browse HuggingFace GGUF models: https://huggingface.co/models?library=gguf

### Using Ollama Models

For models from Ollama's native library:

```bash

export MODEL_SOURCE="ollama"

export MODEL_NAME="llama3.2:3b"

# Try different models

ollama pull granite4-h-micro:3b

export MODEL_NAME="granite4-h-micro:3b"

ollama pull qwen2.5:3b

export MODEL_NAME="qwen2.5:3b"

```

**Popular Ollama models:**

| Model | Size | Best For |

|-------|------|----------|

| `smollm3:3b` | 3B | Balanced, fast |

| `granite4-h-micro:3b` | 3B | Instruction following |

| `llama3.2:3b` | 3B | General purpose |

| `qwen2.5:3b` | 3B | Code understanding |

| `phi3:3.8b` | 3.8B | Reasoning |

Browse Ollama models: https://ollama.com/library

### LM Studio Support (Coming Soon)

LM Studio backend support is planned, allowing LM Studio as an alternative to Ollama.

### Advanced LLM Configuration

| Environment Variable | Default | Description |

|---------------------|---------|-------------|

| `THINKING_MODE` | `true` | Enable thinking/reasoning mode for supported models |

| `TEMPERATURE` | `0.1` | LLM temperature (0.0-1.0). Lower = more deterministic |

**Thinking Mode**: When enabled and the model supports it, the LLM will reason through its decisions before answering. This can improve classification accuracy for ambiguous requests.

Supported thinking models:

- `deepseek-r1` - DeepSeek's reasoning model

- `qwen3`, `qwen2.5` - Alibaba's multilingual models

- `cogito` - Specialized thinking model

- `qwq` - QwQ reasoning model

```bash

# Disable thinking mode (if model doesn't support it well)

export THINKING_MODE=false

# Use lower temperature for more deterministic results

export TEMPERATURE=0.05

```

---

## Architecture

This MCP is a **pure router** - it doesn't execute agents, it just determines which subagents should handle a request.

### Flow Diagram

```

Agent Call

  ├─ Task: "Fix auth bug"

  ├─ Intent: "review code before commit"

  ├─ Original Prompt: "Can you fix the login issue?" (optional)

  └─ Associated Files: ["src/auth.ts"] (provided by caller)

       ↓

┌──────────────────────────────────────────────────┐

│  Agent Router MCP (Stateless Router)             │

│                                                  │

│  1. Load Configs                                 │

│     • agents.json (agent definitions)            │

│     • rules.json (routing rules)                 │

│     • llm-tags.json (semantic tags)              │

│                                                  │

│  2. Auto-Detect Git Context                      │

│     • Branch only (for branch-based rules)       │

│                                                  │

│  3. LLM Semantic Tagging                         │

│     • Analyzes task + intent + original_prompt   │

│     • Returns: ["security-concern",              │

│                 "commit-review"]                 │

│                                                  │

│  4. Apply ALL Rules                              │

│     • File patterns: *.ts → ts-reviewer          │

│     • File patterns: *auth* → security-auditor   │

│     • Branch regex: feature/* → code-reviewer    │

│     • LLM tags: security-concern → auditor       │

│     • LLM tags: commit-review → code-reviewer    │

│                                                  │

│  5. Return Results                               │

│     • Empty result is valid (no rules matched)   │

└──────────────────────────────────────────────────┘

       ↓

Routing Result

  ├─ Agents: [language-reviewer-typescript, security-auditor, code-reviewer]

  ├─ Method: "rules+llm-tags"

  └─ Reasoning: "Rules + LLM semantic tags"

```

**Example Agent Names (included in default config):**

- **Language Reviewers**: `language-reviewer-typescript`, `language-reviewer-rust`, `language-reviewer-python`, `language-reviewer-javascript`, `language-reviewer-csharp`, `language-reviewer-lua`, `language-reviewer-zig`, `language-reviewer-gdscript`

- **Security & Quality**: `security-auditor`, `code-reviewer`

- **Testing**: `test-engineer-junior`, `test-engineer-midlevel`, `test-engineer-senior`

- **DevOps**: `devops-engineer-junior`, `devops-engineer-midlevel`, `devops-engineer-senior`

- **Specialized**: `planning-architect`, `documentation-writer`, `performance-optimizer`, `accessibility-specialist`

> **Note:** These are just names for routing. Your actual subagents live elsewhere (e.g., as separate MCP tools, CLI commands, or API endpoints).

## Creating Agents/Subagents

This router determines which agents should handle requests. You need to create the actual agent implementations in your IDE.

### Natively Supported

These platforms have built-in agent support:

#### Claude Code (Sub-Agents)

- Agents are markdown files with YAML frontmatter in `.claude/agents/`

- Supports per-agent model selection (`model: sonnet`, `opus`, `haiku`)

- Full documentation: [Claude Code Agents](https://github.com/anthropics/claude-code)

#### GitHub Copilot (Custom Agents)

- Agents are markdown files in `.github/agents/` or `{org}/.github/agents/`

- Supports custom prompts, tool selection, and MCP servers

- Full documentation: [GitHub Copilot Custom Agents](https://github.blog/changelog/2025-10-28-custom-agents-for-github-copilot/)

#### OpenCode (Agents/Subagents)

- Agents are markdown files in `~/.config/opencode/agent/` or `.opencode/agent/`

- Supports per-agent model selection and tool permissions

- Full documentation: [OpenCode Agents](https://opencode.ai/docs/agents/)

### Workarounds Available

These platforms can use this router via MCP integration or workarounds:

- **Cursor** - Use via MCP integration (agent functionality in development)

- **Windsurf** - Use via MCP integration

- **Cline** - Use via MCP integration

- **Roo** - Use via MCP integration

## Cross-Platform Support

This MCP server works on all major platforms. Ollama installation varies by platform:

| Platform | Installation | Service Behavior |

|----------|-------------|------------------|

| **macOS** | `brew install ollama` | Runs as background service automatically |

| **Linux** | `curl -fsSL https://ollama.com/install.sh \| sh` | Start with `ollama serve` |

| **Windows** | Download from [ollama.com](https://ollama.com) | Runs as Windows service automatically |

## Compiling from Source

If you prefer to build from source instead of downloading pre-built binaries:

### Prerequisites

- **Rust 1.70+**: Install from [rustup.rs](https://rustup.rs)

- **For cross-compilation (optional)**:

  - Zig: `brew install zig` (macOS) or [download from zig.dev](https://ziglang.org/download/)

  - MinGW-w64 (for Windows builds): `brew install mingw-w64` (macOS)

### Quick Build (Current Platform)

```bash

# Clone the repository

git clone https://github.com/yourusername/agent-router-mcp.git

cd agent-router-mcp

# Build using the build script

./scripts/build-all.sh

# Binaries will be created in dist/

```

### Cross-Platform Build (All Targets)

To build binaries for all platforms (Linux, macOS, Windows):

```bash

# Install prerequisites (macOS)

brew install zig mingw-w64

# Install cargo-zigbuild

cargo install cargo-zigbuild

# Build for all platforms

./scripts/build-all.sh

# Binaries will be created in dist/:

# - agent-router-mcp-linux-amd64

# - agent-router-mcp-linux-arm64

# - agent-router-mcp-macos-intel

# - agent-router-mcp-macos-silicon

# - agent-router-mcp-windows-amd64.exe

# - agent-router-mcp-windows-arm64.exe

```

### Manual Build

```bash

# Build for current platform

cargo build --release

# Binary will be at: target/release/agent-router-mcp

# (or target/release/agent-router-mcp.exe on Windows)

```

## Development

```bash

# Run tests

cargo test

# Run with debug logging

RUST_LOG=debug cargo run

# Format code

cargo fmt

# Lint

cargo clippy

# Build for development

cargo build

```

## License

MIT
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/forge18/agent-router-mcp

Awesome Lists containing this project

README