https://github.com/orra-dev/orra
Resilience for AI Agent workflows.
https://github.com/orra-dev/orra
agents ai ai-agents ai-developer-tools ai-in-production durable-execution go golang javascript-sdk llm-apps orchestrator python-sdk reasoning reliability
Last synced: 5 months ago
JSON representation
Resilience for AI Agent workflows.
- Host: GitHub
- URL: https://github.com/orra-dev/orra
- Owner: orra-dev
- License: mpl-2.0
- Created: 2024-04-30T10:17:14.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-05-05T22:06:25.000Z (5 months ago)
- Last Synced: 2025-05-05T23:19:56.406Z (5 months ago)
- Topics: agents, ai, ai-agents, ai-developer-tools, ai-in-production, durable-execution, go, golang, javascript-sdk, llm-apps, orchestrator, python-sdk, reasoning, reliability
- Language: Go
- Homepage: https://orra.dev
- Size: 1.9 MB
- Stars: 194
- Watchers: 8
- Forks: 9
- Open Issues: 27
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
README
# 🪡 orra
Orra is infrastructure for resilient AI agent workflows. It helps your agents recover from failures like API outages, failed evals, and more - keeping your workflows moving forward.

By intelligently coordinating tasks across your agents, tools, and existing stack, orra ensures robust execution in any environment. It’s designed to work seamlessly with any language, agent framework, or deployment platform.
* 🧠 Planning agent with automatic agent/service discovery
* 🗿 Durable execution with state persistence
* 🎯 Pre-validated execution plans
* ↩️ Revert state to handle failures
* 🕵 Audit logs for traceability
* 🚀 Go fast and save cost with tools as services
* ⛑️ Automatic health monitoring
* 🔮 Real-time status tracking
* 🏢 On-premises deployment
* 🪝 Webhooks notifications for completions and failures[Learn why we built orra →](https://tinyurl.com/orra-launch-blog-post)
### Coming Soon
* Integration adapters for popular agent frameworks
* Scale your workflows with reliable coverage
* Planning course correction for failed evals
* Agent replay and multi-LLM consensus planning
* End-to-end encryption
* Granular workflow access controls
* Continuous adjustment of Agent workflows during runtime
* Additional language SDKs - Ruby, DotNet and Go very soon!
* MCP integration
* SOC 2 and GDPR readiness to meet the needs of regulated industries## Table of Contents
- [Installation](#installation)
- [How The Plan Engine Works](#how-the-plan-engine-works)
- [How orra Compares](#how-orra-compares)
- [Guides](#guides)
- [Explore Examples](#explore-examples)
- [Docs](#docs)
- [Self Hosting & On-premises Deployment](#self-hosting--on-premises-deployment)
- [Support](#support)
- [Telemetry](#telemetry)
- [License](#license)## Installation
### Prerequisites
- [Docker](https://docs.docker.com/desktop/) and [Docker Compose](https://docs.docker.com/compose/install/) - For running the Plan Engine
- Set up Reasoning and Embedding Models to power task planning and execution plan caching/validation#### Setup Models for Plan Engine
Select from a variety of supported models:
**Reasoning Models**:
- OpenAI's `o1-mini` or `o3-mini` on cloud
- `deepseek-r1` or `qwq-32b` on cloud or self-hosted (on-premises or locally)**Embedding Models**:
- OpenAI's `text-embedding-3-small` on cloud
- `jina-embeddings-v2-small-en` on cloud or self-hosted (on-premises or locally)> **Note**: The Plan Engine requires all model endpoints to be **OpenAI API-compatible**. Most model serving solutions (like vLLM, LMStudio, Ollama, etc.) can be configured to expose this compatible API format.
**Quick Cloud Setup Example**:
Update the .env based on the [_env](planengine/_env) file with one of these:
```shell
# OpenAI Reasoning
LLM_MODEL=o1-mini
LLM_API_KEY=your_api_key
LLM_API_BASE_URL=https://api.openai.com/v1# OpenAI Embeddings
EMBEDDINGS_MODEL=text-embedding-3-small
EMBEDDINGS_API_KEY=your_api_key
EMBEDDINGS_API_BASE_URL=https://api.openai.com/v1
```**Self-hosted/On-premises Example**:
Update the .env based on the [_env](planengine/_env) file with one of these:
```shell
# Self-hosted QwQ model
LLM_MODEL=qwq-32b-q8
LLM_API_KEY=your_internal_key # Optional depending on your setup
LLM_API_BASE_URL=http://your-internal-server:8000/v1# Self-hosted Jina embeddings
EMBEDDINGS_MODEL=jina-embeddings-v2-small-en
EMBEDDINGS_API_KEY=your_internal_key # Optional depending on your setup
EMBEDDINGS_API_BASE_URL=http://your-internal-server:8001/v1
```→ [Complete Model Configuration Documentation](docs/model-configuration.md)
### 1. Install orra CLI
Download the latest CLI binary for your platform from our [releases page](https://github.com/orra-dev/orra/releases):
```shell
# macOS
curl -L https://github.com/orra-dev/orra/releases/download/v0.2.6/orra-darwin-arm64 -o /usr/local/bin/orra
chmod +x /usr/local/bin/orra# Linux
curl -L https://github.com/ezodude/orra/releases/download/v0.2.6/orra-linux-amd64 -o /usr/local/bin/orra
chmod +x /usr/local/bin/orra# Verify installation
orra version
```→ [Full CLI documentation](docs/cli.md)
### 2. Get orra Plan Engine Running
Clone the repository and start the Plan Engine:
```shell
git clone https://github.com/ezodude/orra.git
cd orra/planengine# Start the Plan Engine
docker compose up --build
```## How The Plan Engine Works
The Plan Engine powers your multi-agent applications through intelligent planning and reliable execution:
### Progressive Planning Levels
#### 1. Base Planning
Your agents stay clean and simple (wrapped in the orra SDK):
**Python**
```python
from orra import OrraAgent, Task
from pydantic import BaseModelclass ResearchInput(BaseModel):
topic: str
depth: strclass ResearchOutput(BaseModel):
summary: stragent = OrraAgent(
name="research-agent",
description="Researches topics using web search and knowledge base",
url="https://api.orra.dev",
api_key="sk-orra-..."
)@agent.handler()
async def research(task: Task[ResearchInput]) -> ResearchOutput:
results = await run_research(task.input.topic, task.input.depth)
return ResearchOutput(summary=results.summary)
```**JavaScript**
```javascript
import { initAgent } from '@orra.dev/sdk';const agent = initAgent({
name: 'research-agent',
orraUrl: process.env.ORRA_URL,
orraKey: process.env.ORRA_API_KEY
});await agent.register({
description: 'Researches topics using web search and knowledge base',
schema: {
input: {
type: 'object',
properties: {
topic: { type: 'string' },
depth: { type: 'string' }
}
},
output: {
type: 'object',
properties: {
summary: { type: 'string' }
}
}
}
});agent.start(async (task) => {
const results = await runResearch(task.input.topic, task.input.depth);
return { summary: results.summary };
});
```Features:
* AI analyzes intent and creates execution plans that target your components
* Automatic service discovery and coordination
* Parallel execution where possible#### 2. Production Planning with Domain Grounding
```yaml
# Define domain constraints
name: research-workflow
domain: content-generation
use-cases:
- action: "Research topic {topic}"
capabilities:
- "Web search access"
- "Knowledge synthesis"
constraints:
- "Verify sources before synthesis"
- "Maximum research time: 10 minutes"
```Features:
* Full semantic validation of execution plans
* Capability matching and verification
* Safety constraints enforcement
* State transition validation#### 3. Reliable Execution
```bash
# Execute an action with the Plan Engine
orra verify run "Research and summarize AI trends" \
--data topic:"AI in 2024" \
--data depth:"comprehensive"
```The Plan Engine ensures:
* Automatic service health monitoring
* Stateful execution tracking
* Built-in retries and recovery
* Real-time status updates
* Webhook events for result delivery and monitoring## How orra compares
Orra takes a unique approach to AI workflow orchestration. Here's how it compares to other solutions:
| | **orra** | **Agent Frameworks**
(e.g. Mastra, LangGraph) | **Workflow Engines**
(e.g. Temporal, Inngest) |
|-----------------------|--------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------|----------------------------------------------------------|
| **Purpose** | Multi-agent coordination layer | Build individual AI agents | Run pre-planned workflows |
| **Planning Style** | AI-driven plan generation | Hardcoded agent workflows | Manual workflow definition |
| **Error Recovery** | Auto-recovery without restart | Try/catch manual handling | Config-based retry policies |
| **Best For** | Complex unpredictable workflows | Single agent development | Repeatable business processes |
| **Example Use** | "Deliver this product by Friday" → dynamically coordinates research, inventory, delivery, and payment agents | "Analyze this document" → fixed steps of reading, extracting, and summarizing | "Process new signup" → predefined steps with retry logic |Orra is for building AI systems that need to adapt and recover when things go wrong, without brittle scripts or manual fixes.
## Guides
- [From Fragile to Production-Ready Multi-Agent App](https://github.com/orra-dev/agent-fragile-to-prod-guide)
- [From Fragile to Production-Ready Multi-Agent App (with Cloudflare Agents)](https://github.com/orra-dev/agent-fragile-to-prod-guide-with-cf-agents)## Explore Examples
- 🛒 [E-commerce AI Assistant (JavaScript)](examples/ecommerce-agent-app) - E-commerce customer service with a delivery specialized agent
- 👻 [Ghostwriters (Python)](examples/crewai-ghostwriters) - Content generation example showcasing how to use orra with [CrewAI](https://www.crewai.com)
- 📣 [Echo Tools as Service (JavaScript)](examples/echo-js) - Simple example showing core concepts using JS
- 📣 [Echo Tools as Service (Python)](examples/echo-python) - Simple example showing core concepts using Python## Docs
- [Rapid Multi-Agent App Development with orra](docs/rapid-agent-app-devlopment.md)
- [What is an Agent in orra?](docs/what-is-agent.md)
- [Orchestrating Actions with orra](docs/actions.md)
- [Monitoring with Webhooks](docs/monitoring-with-webhooks.md)
- [Domain Grounding Execution](docs/grounding.md)
- [Execution Plan Caching](docs/plan-caching.md)
- [Core Topics & Internals](docs/core.md)
- [Model Configuration for the orra Plan Engine](docs/model-configuration.md)## Self Hosting & On-premises Deployment
### Running Plan Engine
The orra Plan Engine is packaged with a [Dockerfile](planengine/Dockerfile) for easy deployment:
- **Local Development**: [Run it as a single instance](#installation) using Docker or Docker Compose
- **On-premises Deployment**: Deploy in your own infrastructure with your preferred orchestration system
- **Cloud Service**: Run on managed container services like [Digital Ocean's App Platform](https://docs.digitalocean.com/products/app-platform/how-to/deploy-from-monorepo/) or any Kubernetes environment### Using Self-hosted Models (Remote or On-premises)
The Plan Engine fully supports self-hosted open-source models:
- **Reasoning**: Deploy `deepseek-r1` or `qwq-32b` using your preferred model serving solution including on-premises
- **Embeddings**: Self-host `jina-embeddings-v2-small-en` for complete control> **Important**: Your model serving solution must expose an **OpenAI-compatible API**. Solutions like vLLM, LMStudio, Ollama with OpenAI compatibility mode, or Replicate all work great.
→ [Complete Model Configuration Guide](docs/model-configuration.md)
### Data Storage
The Plan Engine uses [BadgerDB](https://github.com/hypermodeinc/badger) embedded database to persist all state - operational information is queryable using the [orra CLI](docs/cli.md).
[Book an office hours slot](https://cal.com/orra-dev/office-hours) to get help hosting or running orra's Plan Engine for production.
## Support
Need help? We're here to support you:
- Report a bug or request a feature by creating an [issue](https://github.com/orra-dev/orra/issues/new?template=bug-report-feature-request.yml)
- Start a [discussion](https://github.com/orra-dev/orra/discussions) about your ideas or questions## Telemetry
See [telemetry.md](./docs/telemetry.md) for details on what is collected and how to opt out.
## License
Orra is MPL-2.0 licensed.