https://github.com/georgedittmar/pydantask

PydanTask– Structured, multi-step reasoning deep agents on PydanticAI
https://github.com/georgedittmar/pydantask

agentic-framework agents pydantic-ai

Last synced: about 1 month ago
JSON representation

PydanTask– Structured, multi-step reasoning deep agents on PydanticAI

Host: GitHub
URL: https://github.com/georgedittmar/pydantask
Owner: GeorgeDittmar
License: apache-2.0
Created: 2025-12-14T20:50:10.000Z (6 months ago)
Default Branch: main
Last Pushed: 2026-04-15T04:33:52.000Z (about 2 months ago)
Last Synced: 2026-04-15T06:27:05.521Z (about 2 months ago)
Topics: agentic-framework, agents, pydantic-ai
Language: Python
Homepage: https://georgedittmar.github.io/pydantask/
Size: 4.47 MB
Stars: 1
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE
- Agents: docs/agents.md

Awesome Lists containing this project

README

          # PydanTask: Deep Agentic Harness for Pydantic AI

![Pydantask Logo](docs/imgs/pydantask_logo_v3.png)

PydanTask is a Harness for building deep, autonomous agents that don't just respond—they **reason**, **decompose**, and **execute**.

It builds on top of [Pydantic AI](https://ai.pydantic.dev/) and adds:

- Long‑horizon planning and hierarchical task management

- A reusable orchestration loop (`DeepAgent`) with planner → supervisor → worker/researcher/producer → critic

- Shared runtime state across agents (`RuntimeState`)

- Extensible capabilities via `CapabilityDescription` and custom tools/agents

The goal is to give you a solid “agentic backbone” you can adapt, without having to reinvent multi‑step planning and control logic yourself.

For the most up‑to‑date implementation details and API docs, see the hosted documentation: **[PydanTask Documentation](https://pydantask.readthedocs.io/en/latest/)**.

---

## High-Level Architecture

The core orchestrator is `DeepAgent`:

```python

from pydantask.agents.agent import DeepAgent

```

`DeepAgent` coordinates several built‑in agents:

- **Planner** – turns a single objective into a `Plan` (list of `TaskItem`s).

- **Supervisor** – chooses which tasks to run next, based on statuses and dependencies.

- **Researcher** – performs web/external research for tasks that need new information.

- **Producer** – synthesizes intermediate results into a final answer or artifact.

- **Critic** – evaluates task outputs and drives deterministic retry/fail transitions.

They all operate over a shared `RuntimeState`:

```python

from pydantask.models import RuntimeState, TaskItem, TaskResult, Plan

```

Key concepts:

- **Plan** (`Plan`):

  - `reasoning_steps`: planner’s internal notes

  - `tasks`: list of `TaskItem` instances

- **TaskItem**: one sub‑task in the plan, with:

  - `task_id`, `overall_objective`, `sub_task_objective`

  - `capability` (which sub‑agent to use, e.g. `"research_agent"`)

  - `sub_task_dependencies` (other task IDs that must complete first)

  - `status` (`TaskStatus`: `PENDING`, `READY`, `RUNNING`, `NEEDS_REVIEW`, `COMPLETED`, `FAILED`, `ERRORED`, `RERUN`)

  - `result` (`TaskResult`) and `task_feedback` (`TaskQAResult`)

- **RuntimeState**:

  - `plan: Dict[int, TaskItem]`

  - `objective: str`

  - `agent_registry: Dict[str, CapabilityDescription]`

  - `document_store`, `knowledge_store`, `runtime_steps`, etc.

The control loop in `DeepAgent.run()`:

1. Planner creates a `Plan` for the objective.

2. `RuntimeState` is initialized with the plan and capabilities.

3. In each cycle:

   - Supervisor decides which tasks to execute next.

   - Ready tasks (dependencies satisfied) are executed by the appropriate capability (sub‑agent).

   - Critic reviews each result and calls `handle_critic_result` to update status and retry/failed state.

4. Loop stops when the Supervisor sets `all_tasks_completed = True` or `max_steps` is reached.

For more detail, see `docs/agents.md`.

---

## Installation & Setup

PydanTask assumes you already have Pydantic AI and an OpenAI‑compatible model configured. You’ll also need a Tavily API key for the built‑in research agent.

### 1. Install dependencies

From your project root:

```bash

pip install -e .

```

(or however you manage your environment; if you use Poetry, adjust accordingly.)

### 2. Environment variables

Set the following environment variables (e.g. in your shell or a `.env` file):

- `OPENAI_API_KEY` – for the underlying OpenAIChatModel (or whatever your Pydantic AI provider expects).

- `TAVILY_API_KEY` – required by the default `research_agent` (via `tavily_search_tool`).

If `TAVILY_API_KEY` is missing, `DeepAgent.__init__` will raise a `ValueError`.

---

## Quickstart: Running a DeepAgent

Minimal example that creates a `DeepAgent` and runs it on a single objective:

```python

import asyncio

from pydantask.agents.agent import DeepAgent

async def main() -> None:

    agent = DeepAgent(

        prompt="Write an overview of ghost lights folklore and summarize scientific explanations.",

        model="gpt-4.1-mini",  # or any compatible OpenAIChatModel name

        max_steps=10,

    )

    runtime_state = await agent.run()

    # Inspect the final plan and results

    for task_id, task in sorted(runtime_state.plan.items()):

        print(f"Task {task_id} [{task.status}]: {task.sub_task_objective}")

        if task.result is not None:

            print("  Summary:", task.result.summary)

            print("  Outputs:", task.result.output_paths)

            print()

if __name__ == "__main__":

    asyncio.run(main())

```

What this does:

1. Constructs a `DeepAgent` with default Planner, Supervisor, Researcher, Producer, and Critic.

2. Planner breaks down the objective into `TaskItem`s (using built‑in capabilities `research_agent` and `producer_agent`).

3. Supervisor picks tasks to run in each loop iteration.

4. Researcher and Producer execute those tasks, writing notes/reports to `pydantask/tools/tmp_files/` when appropriate.

5. Critic evaluates each task result and marks tasks as `COMPLETED`, retryable (`READY`/`RERUN`), or `FAILED` based on configured retry limits.

6. When done, you get a `RuntimeState` with the full plan and results.

---

## Customizing Capabilities

You can add custom sub‑agents or tools via `CapabilityDescription` and the `sub_agents` argument.

### Example: custom agent capability

```python

from pydantic_ai import Agent

from pydantask.agents.agent import DeepAgent

from pydantask.models import CapabilityDescription, RuntimeState, TaskResult

my_special_agent = Agent(

    model=...,  # e.g. the same OpenAIChatModel

    name="_my_special_agent",

    system_prompt="You are a specialized agent for security analysis.",

    deps_type=RuntimeState,

    output_type=TaskResult,

    tools=[...],  # any tools it needs

)

custom_capability = CapabilityDescription(

    name="security_agent",  # used in TaskItem.capability

    description="Performs security-focused analysis and risk assessment.",

    tool_func=my_special_agent,

)

agent = DeepAgent(

    prompt="Assess the security posture of this web application.",

    sub_agents=[custom_capability],

)

# Now the Planner can choose `security_agent` as a capability in the plan.

```

### Example: simple function capability

```python

from pydantic_ai import RunContext

from pydantask.agents.agent import DeepAgent

from pydantask.models import CapabilityDescription, RuntimeState

async def my_utility_tool(ctx: RunContext[RuntimeState], payload: str) -> str:

    # do something simple with ctx.deps and payload

    return f"processed: {payload}"

utility_capability = CapabilityDescription(

    name="my_utility_tool",

    description="Utility capability that performs a simple transformation.",

    tool_func=my_utility_tool,

)

agent = DeepAgent(prompt="Some goal...", sub_agents=[utility_capability])

```

For more customization details, see:

- `docs/customization.md`

- `docs/tools.md`

- `docs/agents.md`

---

## Running Unit Tests

Tests live under the `test/` directory and are written to be compatible with both `pytest` and the standard library `unittest`.

### Recommended: pytest

From the repository root:

```bash

pip install pytest

pytest

```

### Using unittest directly

If you prefer `unittest`, you can still run the suite with:

```bash

python -m unittest discover -s test -p "test_*.py"

```

Make sure required environment variables (e.g. `TAVILY_API_KEY`, `OPENAI_API_KEY`) are set, or that tests patch them appropriately (as in `test/test_agent.py`).

---

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/georgedittmar/pydantask

Awesome Lists containing this project

README