https://github.com/tyabu12/hamoru

"Terraform for LLMs." Declaratively orchestrate multiple LLM providers in harmony, with policy-based model selection and cost impact prediction. Named after Japanese ハモる (to harmonize) — because your LLMs should sing together, not solo.
https://github.com/tyabu12/hamoru

agent-collaboration ai anthropic async cli cost-optimization declarative developer-tools llm llm-orchestration multi-provider openai openai-compatible policy-engine rust tokio workflow-engine

Last synced: 3 months ago
JSON representation

Host: GitHub
URL: https://github.com/tyabu12/hamoru
Owner: tyabu12
License: mit
Created: 2026-03-28T06:17:57.000Z (4 months ago)
Default Branch: main
Last Pushed: 2026-03-28T17:07:44.000Z (4 months ago)
Last Synced: 2026-03-28T17:11:17.551Z (4 months ago)
Topics: agent-collaboration, ai, anthropic, async, cli, cost-optimization, declarative, developer-tools, llm, llm-orchestration, multi-provider, openai, openai-compatible, policy-engine, rust, tokio, workflow-engine
Language: Rust
Homepage:
Size: 182 KB
Stars: 0
Watchers: 0
Forks: 0
Open Issues: 2
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Security: SECURITY.md

Awesome Lists containing this project

README

          


# 🎶 hamoru

**"Terraform for LLMs."**

Declaratively orchestrate multiple LLM providers in harmony,\

with policy-based model selection and cost impact prediction.

*Named after Japanese ハモる (to harmonize)*\

*— because your LLMs should sing together, not solo.*

[![CI](https://github.com/tyabu12/hamoru/actions/workflows/ci.yml/badge.svg?branch=main)](https://github.com/tyabu12/hamoru/actions/workflows/ci.yml)

[![Security Audit](https://github.com/tyabu12/hamoru/actions/workflows/security.yml/badge.svg?branch=main)](https://github.com/tyabu12/hamoru/actions/workflows/security.yml)

[![coverage](https://img.shields.io/endpoint?url=https://gist.githubusercontent.com/tyabu12/8c8891a593f77b776e5d672b8dd8ab2c/raw/hamoru-coverage.json)](https://gist.github.com/tyabu12/8c8891a593f77b776e5d672b8dd8ab2c)



> 🚧 **This project is under active development and is not production-ready.** 🚧

>

> Use at your own risk. APIs and configuration formats may change without notice.

## 🎯 Current Phase

**Phase 5: API Server (serve)**

See [design-plan.md](docs/design-plan.md) for the full roadmap.

## ✨ Key Differentiators

1. 🎼 **Declarative Agent Collaboration** — Define LLM collaboration patterns (Generator/Evaluator, Pipeline, Debate) in YAML

2. ⚖️ **Policy as Code** — Intent-based model selection: `tags: [review] → quality-first policy → Opus auto-selected`

3. 🔮 **Cost Impact Prediction** — `hamoru plan` simulates cost changes before applying policy updates

## 🔌 Supported Providers

| Provider | Type | Models (built-in catalog) | Status |

|----------|------|---------------------------|--------|

| [Anthropic](https://platform.claude.com/docs/en/home) | Cloud API | `claude-sonnet-4-6`, `claude-haiku-4-5` | ✅ |

| [DeepSeek](https://api-docs.deepseek.com/) | OpenAI-compatible | — | 🔲 Planned (post-v1.0) |

| [Google Gemini](https://ai.google.dev/gemini-api/docs) | Cloud API | — | 🔲 Planned (post-v1.0) |

| [Groq](https://console.groq.com/docs/overview) | OpenAI-compatible | — | 🔲 Planned (post-v1.0) |

| [Ollama](https://ollama.com) | Local | `llama3.3:70b`, `qwen2.5-coder:14b` | ✅ |

| [OpenAI](https://platform.openai.com/docs) | Cloud API | — | 🔲 Planned (post-v1.0) |

> Models listed above are from the built-in catalog with default pricing. You can configure any model your provider supports via `hamoru.yaml` — including custom cost overrides.

>

> Providers marked **OpenAI-compatible** will use the OpenAI adapter with a custom `base_url`. Any OpenAI-compatible API (Mistral, Together, Fireworks, etc.) can be configured the same way.

## 🏗️ Architecture

```

┌──────────────────────────────────────────────────────────┐

│                    User Interface                         │

│  CLI: hamoru plan / apply / serve / status                │

│  REST: OpenAI-compatible API (POST /v1/chat/completions)  │

├──────────────────────────────────────────────────────────┤

│  Layer 5: Agent Collaboration Engine          [NEW]       │

│  Declarative agent definitions (YAML)                     │

│  Patterns: Generator/Evaluator, Pipeline, Debate          │

│  Harness: cost caps, timeouts, quality gates              │

├──────────────────────────────────────────────────────────┤

│  Layer 4: Orchestration Engine                            │

│  Workflow definitions (YAML) → step DAG execution         │

│  Branching (Tool Calling / STATUS line), loops, parallel  │

├──────────────────────────────────────────────────────────┤

│  Layer 3: Policy Engine                                   │

│  Declarative policies: cost caps / quality / latency      │

│  Task intent (tags) → policy matching → model selection   │

├──────────────────────────────────────────────────────────┤

│  Layer 2: Provider Abstraction                            │

│  Unified trait: LlmProvider (direct impl w/ reqwest+serde)│

│  Adapters: Claude API / Ollama → later: OpenAI / Gemini   │

├──────────────────────────────────────────────────────────┤

│  Layer 1: Configuration & Telemetry                       │

│  Configuration: YAML (Git-managed, Hot Reload)            │

│  Telemetry: execution history / cost (InMemory→SQLite→S3) │

│  plan (Telemetry-based cost impact prediction)            │

└──────────────────────────────────────────────────────────┘

```

## 📋 Prerequisites

- [Rust](https://rustup.rs/) stable toolchain (`clippy` and `rustfmt` components)

## 🚀 Quick Start

```bash

cargo build

cargo run --bin hamoru -- init

```

### 🏠 Option A: Local LLM (no API key required)

Install [Ollama](https://ollama.com), then add it to your config (`.hamoru/hamoru.yaml`).  

We use `llama3.2` (3B, ~2 GB) here for a quick first run — you can swap in any model Ollama supports:

```yaml

providers:

  - name: local

    type: ollama

    endpoint: http://localhost:11434

    models:

      - llama3.2

```

```bash

ollama pull llama3.2

cargo run --bin hamoru -- providers test

cargo run --bin hamoru -- run -m local:llama3.2 "Hello, world!"

```

### ☁️ Option B: Cloud LLM (API key required)

Set your API key as an environment variable:

```bash

# Recommended: read without echoing to avoid shell history leakage

printf "Enter API key: " && read -rs HAMORU_ANTHROPIC_API_KEY && export HAMORU_ANTHROPIC_API_KEY

echo  # newline after silent input

```

> **Security note:** Avoid typing API keys directly in commands (e.g., `export KEY=sk-ant-...`) — they may be saved in your shell history file. Use `read -rs` as shown above, or load from a secrets manager.

```bash

cargo run --bin hamoru -- providers test

cargo run --bin hamoru -- run -m claude:claude-sonnet-4-6 "Hello, world!"

```

### More examples

```bash

# Policy-based model selection

cargo run --bin hamoru -- run -p cost-optimized "Summarize this text"

# Tag-based routing

cargo run --bin hamoru -- run --tags review "Review this code for security issues"

# Multi-step workflow

cargo run --bin hamoru -- run -w workflow.yaml "Implement an auth API"

```

### 🌐 OpenAI-compatible API server

```bash

# Start the server (localhost:3000 by default)

HAMORU_API_KEYS=my-secret-key cargo run --bin hamoru -- serve

# In another terminal — use any OpenAI-compatible client

curl http://localhost:3000/v1/models \

  -H "Authorization: Bearer my-secret-key"

curl http://localhost:3000/v1/chat/completions \

  -H "Authorization: Bearer my-secret-key" \

  -H "Content-Type: application/json" \

  -d '{

    "model": "claude:claude-sonnet-4-6",

    "messages": [{"role": "user", "content": "Hello!"}]

  }'

```

> **Tip:** Omit `HAMORU_API_KEYS` to skip authentication (convenient for local dev).

> Use `--port` and `--bind` to customize the listen address.

## 🔑 Environment Variables

| Variable | Provider | Status |

|----------|----------|--------|

| `HAMORU_API_KEYS` | hamoru serve (API auth) | ✅ |

| `HAMORU_ANTHROPIC_API_KEY` | Anthropic | ✅ |

| `HAMORU_DEEPSEEK_API_KEY` | DeepSeek | 🔲 Planned (post-v1.0) |

| `HAMORU_GEMINI_API_KEY` | Google Gemini | 🔲 Planned (post-v1.0) |

| `HAMORU_GROQ_API_KEY` | Groq | 🔲 Planned (post-v1.0) |

| `HAMORU_OPENAI_API_KEY` | OpenAI | 🔲 Planned (post-v1.0) |

> Ollama runs locally and does not require an API key.

## 📖 Commands

### Top-level commands

| Command | Description | Status |

|---------|-------------|--------|

| `hamoru init` | Initialize project (creates `.hamoru/` with config templates) | ✅ |

| `hamoru run ` | Execute a prompt, workflow, or collaboration | ✅ |

| `hamoru plan` | Telemetry-based cost impact prediction | ✅ |

| `hamoru metrics --period 7d` | View cost and performance metrics | ✅ |

| `hamoru providers list` | List configured providers and their models | ✅ |

| `hamoru providers test` | Test connectivity to all configured providers | ✅ |

| `hamoru telemetry show` | Show telemetry store details | ✅ |

| `hamoru telemetry pull` | Sync telemetry from remote storage | 🔲 Planned (remote config) |

| `hamoru telemetry push` | Sync telemetry to remote storage | 🔲 Planned (remote config) |

| `hamoru status` | Show current configuration overview | 🔲 Planned |

| `hamoru serve` | Start OpenAI-compatible API server | ✅ |

| `hamoru agents list` | List agent definitions | 🔲 Planned (Phase 6) |

| `hamoru agents test ` | Dry-run a collaboration pattern | 🔲 Planned (Phase 6) |

### `hamoru run` options

| Flag | Description | Status |

|------|-------------|--------|

| `-m provider:model` | Direct model selection (e.g., `claude:claude-sonnet-4-6`) | ✅ |

| `-p policy-name` | Policy-based model selection (e.g., `cost-optimized`) | ✅ |

| `--tags tag1,tag2` | Tag-based routing (can combine with `-p`) | ✅ |

| `-w workflow.yaml` | Execute a multi-step workflow from YAML | ✅ |

| `-a collaboration` | Execute an agent collaboration pattern | 🔲 Planned (Phase 6) |

| `--no-stream` | Disable streaming (print full response at once) | ✅ |

## 🛠️ Development

```bash

# Run tests

cargo test --all-targets

# Check code quality

cargo clippy --all-targets -- -D warnings

cargo fmt --all --check

```

### E2E smoke test

```bash

# Offline only (no API key or Ollama needed)

bash scripts/smoke-test.sh --offline

# Auto-detect (runs Anthropic tests if API key is set, Ollama tests if server is running)

bash scripts/smoke-test.sh

# With Anthropic API tests

printf "API key: " && read -rs HAMORU_ANTHROPIC_API_KEY && export HAMORU_ANTHROPIC_API_KEY

bash scripts/smoke-test.sh

# With Ollama tests (start Ollama in a separate terminal first)

# Terminal 1: ollama serve

# Terminal 2:

ollama pull qwen2.5:0.5b  # recommended: lightweight (~400MB), responds in seconds

bash scripts/smoke-test.sh

# Verbose output (show stdout/stderr for all tests)

bash scripts/smoke-test.sh --verbose

```

See [CONTRIBUTING.md](CONTRIBUTING.md) for full development setup and coding rules.

## 📁 Project Structure

```

hamoru/

├── crates/

│   ├── hamoru-core/          # Core library

│   │   └── src/

│   │       ├── provider/     # Layer 2: LLM provider adapters

│   │       ├── telemetry/    # Layer 1: Execution history & metrics

│   │       ├── config/       # Layer 1: YAML config loading

│   │       ├── policy/       # Layer 3: Policy engine

│   │       ├── orchestrator/ # Layer 4: Workflow execution

│   │       ├── agents/       # Layer 5: Agent collaboration (planned)

│   │       ├── server/       # API layer: OpenAI wire types, namespace, translation

│   │       └── error.rs      # Unified error types

│   └── hamoru-cli/           # CLI entry point

├── docs/

│   ├── design-plan.md        # Detailed design document

│   └── decisions/            # Architecture Decision Records

├── CLAUDE.md                 # Project context for Claude Code

├── CONTRIBUTING.md           # Development guidelines

└── SECURITY.md               # Security policy

```

## 📄 License

[MIT](LICENSE)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/tyabu12/hamoru

Awesome Lists containing this project

README