https://github.com/actguard/research-agent

AI-powered research agent that crawls the internet, reports in markdown, without losing control
https://github.com/actguard/research-agent

actguard cost-control langgraph-python multi-step-reasoning research-agent tavily

Last synced: about 17 hours ago
JSON representation

AI-powered research agent that crawls the internet, reports in markdown, without losing control

Host: GitHub
URL: https://github.com/actguard/research-agent
Owner: ActGuard
License: mit
Created: 2026-03-24T03:21:40.000Z (about 1 month ago)
Default Branch: main
Last Pushed: 2026-03-24T22:05:59.000Z (about 1 month ago)
Last Synced: 2026-03-25T03:48:42.886Z (about 1 month ago)
Topics: actguard, cost-control, langgraph-python, multi-step-reasoning, research-agent, tavily
Language: Python
Homepage: https://actguard.ai
Size: 1.84 MB
Stars: 1
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # Research Agent with Runtime Guardrails

A simple, budget-controlled research agent that searches the web, scrapes and compresses sources, and generates markdown reports -- without running out of control.

**~70 research questions for $1.** Perfect for quick lookups where you need a fast answer to know where to dig deeper: which foods are on a specific diet, initial research on AI agent architectures, comparing tool options, etc. 

![til](./docs/rdemo.gif)

## What this repo demonstrates

This is a controlled research agent with:

- 💸 Budget enforcement (~$0.02 per run)

- 🔍 Web search + scraping + semantic compression

- 📊 Cost + usage tracking

Inspired by [gpt-researcher](https://github.com/assafelovic/gpt-researcher), it uses a simple linear pipeline and exposes its capabilities over the [Agent-to-Agent (A2A) protocol](https://google.github.io/A2A/) so any A2A-compatible client can invoke it.

## Architecture

The agent runs a four-step linear pipeline:

```

query

  |

  v

search              -- web search via Tavily

  |

  v

scrape + compress   -- fetch pages with Crawl4AI, compress via embeddings

  |

  v

assemble context    -- join compressed pages, truncate if needed

  |

  v

generate report     -- single LLM call to produce markdown report

```

Each step is a plain async function -- no graph framework, no multi-agent orchestration. The entire pipeline is ~165 lines of code.

## ActGuard - Budget Control

![ActGuard Dashboard](docs/actguard-cost-breakdown.png)

[ActGuard](https://actguard.ai) is integrated as a budget control and cost tracking layer. Every expensive operation in the pipeline -- LLM calls, web searches, and page scrapes -- is wrapped in an ActGuard budget guard. This prevents runaway API costs during research.

How it works:

- Each research run is started with a configurable cost limit (default: 500 units)

- Individual operations are tracked under named guards (`search`, `scrape`, `write_report`)

- If the budget is exceeded mid-run, ActGuard raises a `BudgetExceededError` and the agent returns a graceful error instead of continuing to spend

To enable budget tracking, visit [actguard.ai](https://actguard.ai), create a free account, and add your `ACTGUARD_API_KEY` to `.env`. If unset, budget tracking is disabled.

## Key Libraries

| Library | Purpose |

|---|---|

| [Tavily](https://tavily.com/) | Web search API optimized for AI agents. |

| [Crawl4AI](https://github.com/unclecode/crawl4ai) | Async web scraper with headless browser and markdown extraction. |

| [OpenAI Embeddings](https://platform.openai.com/docs/guides/embeddings) | Semantic compression -- keeps only the chunks relevant to the query. |

| [ActGuard](https://actguard.ai) | Budget control and cost tracking for AI agent operations. |

| [A2A SDK](https://google.github.io/A2A/) | Agent-to-Agent protocol. Exposes the agent as a JSON-RPC endpoint. |

| [LangChain OpenAI](https://python.langchain.com/) | OpenAI integration for LLM calls with structured output. |

| [Streamlit](https://streamlit.io/) | Chat UI for interactive research sessions. |

## Project Structure

```

research-agent/

├── app/

│   ├── __init__.py

│   ├── __main__.py                # Entry point -- starts the A2A server

│   ├── agent_executor.py          # A2A AgentExecutor implementation

│   ├── config.py                  # Settings (env vars + defaults)

│   ├── a2a_auth.py                # HMAC authentication middleware

│   ├── researcher/

│   │   ├── graph.py               # Research pipeline (search → scrape → compress → report)

│   │   ├── prompts.py             # LLM prompt templates

│   │   ├── schemas.py             # Pydantic output models

│   │   ├── errors.py              # Custom exceptions

│   │   └── actguard_client.py     # ActGuard client initialization

│   └── services/

│       ├── llm.py                 # OpenAI async client

│       ├── search.py              # Tavily search client

│       ├── scraper.py             # Crawl4AI web scraper

│       └── embeddings.py          # Semantic compression via embeddings

├── chat.py                        # Streamlit chat UI

├── scripts/

│   └── sign_request.py            # Send HMAC-signed A2A requests (testing helper)

├── config/

│   └── a2a_auth.json              # A2A authentication config

├── tests/

│   ├── test_client.py             # Integration tests (A2A endpoints)

│   └── test_graph.py              # Unit tests (pipeline execution)

├── .env.example

├── .gitignore

├── pyproject.toml

└── uv.lock

```

## Prerequisites

- Python 3.12+

- [uv](https://docs.astral.sh/uv/) package manager

- An [OpenAI API key](https://platform.openai.com/api-keys)

- A [Tavily API key](https://app.tavily.com/)

- (Optional) An [ActGuard](https://actguard.ai) account (free) for measuring agent cost

## Quick Start

```bash

# 1. Clone the repo

git clone https://github.com/ActGuard/research-agent.git

cd research-agent

# 2. Copy the env template and fill in your API keys

cp .env.example .env

# 3. Install dependencies

uv sync

# 4. Start the agent server

uv run python -m app

```

The server starts on `http://localhost:10000`. Verify it's running:

```bash

curl http://localhost:10000/.well-known/agent.json

```

## Environment Variables

| Variable | Default | Description |

|---|---|---|

| `OPENAI_API_KEY` | *(required)* | OpenAI API key |

| `TAVILY_API_KEY` | *(required)* | Tavily search API key |

| `A2A_HMAC_SECRET` | `""` | 64-char hex string (256-bit) for signing A2A requests. Generate one with `openssl rand -hex 32` |

| `ACTGUARD_API_KEY` | `""` | ActGuard API key for cost tracking. Create a free account at [actguard.ai](https://actguard.ai). Optional -- budget tracking is disabled if unset |

| `HOST` | `localhost` | Server bind address |

| `PORT` | `10000` | Server port |

| `OPENAI_MODEL` | `gpt-4o-mini` | Default OpenAI model |

| `MAX_SEARCH_RESULTS` | `5` | Tavily results per query |

| `MAX_SCRAPE_URLS` | `5` | Max pages to scrape per run |

| `MAX_CONTEXT_CHARS` | `50000` | Context truncation limit |

| `REPORT_FORMAT` | `markdown` | Output format hint passed to the report writer |

Model & embedding overrides

| Variable | Default | Description |

|---|---|---|

| `MODEL_WRITE_REPORT` | `OPENAI_MODEL` | Model used for report generation |

| `EMBEDDING_MODEL` | `text-embedding-3-small` | Embedding model for semantic compression |

| `CHUNK_SIZE` | `1000` | Characters per chunk for embedding |

| `CHUNK_OVERLAP` | `100` | Overlap between chunks |

| `SIMILARITY_THRESHOLD` | `0.75` | Minimum similarity to keep a chunk |

## Streamlit Chat UI

For a conversational interface, run the Streamlit app directly -- no A2A server needed:

```bash

uv run streamlit run chat.py

```

This opens a browser-based chat where you can ask research questions interactively. Use the sidebar to switch between demo users or clear the chat history.

## Invoking the Agent (A2A)

Pass your research question as a command-line argument:

```bash

uv run python scripts/sign_request.py "What are the main approaches to quantum error correction?"

```

> **Note:** Queries are limited to 400 characters.

The script sends a signed A2A `message/send` JSON-RPC request to the running server and prints the response. A successful response looks like:

```json

{

  "jsonrpc": "2.0",

  "id": 1,

  "result": {

    "status": { "state": "completed" },

    "artifacts": [

      {

        "artifactId": "...",

        "name": "Research Report",

        "parts": [{ "text": "# Quantum Error Correction\n..." }]

      }

    ]

  }

}

```

## Testing

Unit tests (runs the pipeline directly -- requires API keys):

```bash

uv run pytest tests/test_graph.py

```

Integration tests (requires a running server):

```bash

uv run python -m app &        # start the server

uv run pytest tests/test_client.py

```

## References

- [gpt-researcher](https://github.com/assafelovic/gpt-researcher) -- inspiration for the research pipeline

- [A2A protocol](https://google.github.io/A2A/) -- Agent-to-Agent interoperability spec

- [Tavily](https://tavily.com/) -- search API for AI agents

- [Crawl4AI](https://github.com/unclecode/crawl4ai) -- async web scraper with headless browser

- [ActGuard](https://actguard.ai) -- budget control for AI agents

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/actguard/research-agent

Awesome Lists containing this project

README