https://github.com/dagucloud/dagu

A local-first workflow engine built the way it should be: declarative, file-based, self-contained, air-gapped ready. One binary that scales from laptop to distributed cluster. Persistent Workflow Operator handles creating and debugging workflows.
https://github.com/dagucloud/dagu
agentic-workflow ai-agents continuous-delivery cron dag data-pipeline devops directed-acyclic-graph durable-execution durable-workflows human-in-the-loop job-scheduler task-automation task-scheduler workflow-engine workflow-management workflow-orchestration workflow-scheduler
Last synced: about 2 months ago
JSON representation
Host: GitHub
URL: https://github.com/dagucloud/dagu
Owner: dagucloud
License: gpl-3.0
Created: 2022-04-22T13:00:42.000Z (about 4 years ago)
Default Branch: main
Last Pushed: 2026-05-17T10:25:16.000Z (about 2 months ago)
Last Synced: 2026-05-17T11:17:01.201Z (about 2 months ago)
Topics: agentic-workflow, ai-agents, continuous-delivery, cron, dag, data-pipeline, devops, directed-acyclic-graph, durable-execution, durable-workflows, human-in-the-loop, job-scheduler, task-automation, task-scheduler, workflow-engine, workflow-management, workflow-orchestration, workflow-scheduler
Language: Go
Homepage: https://dagu.sh/oss
Size: 121 MB
Stars: 3,402
Watchers: 20
Forks: 264
Open Issues: 117
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- Funding: .github/FUNDING.yml
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
- Security: SECURITY.md
- Agents: AGENTS.md
Awesome Lists containing this project

awesome-go - dagucloud/dagu
awesome-ccamel - dagucloud/dagu - Local-first workflow engine with a Web UI for small teams. Define DAGs in a declarative YAML format. Self-contained and no DBMS required. Use any AI agent to manage your DAGs. (Go)
README

          


  

  

    

    

    

    

  


  


    Docs |

    Examples |

    Workflow Schema |

    Support & Community

  



## Local-first Control Plane for Existing Ops Automation and AI Agent Workflows

Dagu gives your automation a home. Run your existing scripts, containers, SQL jobs, and HTTP calls as visible, governed workflows with schedules, retries, logs, artifacts, human-in-the-loop, and observability without Airflow-level complexity.

Keep your existing automation as shell scripts, Python scripts, containers, SSH commands, SQL jobs, HTTP calls, and AI harnesses. Define the workflow in plain YAML, run it with one binary, and get the operational layer that cron and ad hoc scripts are missing: dependencies, retries, queues, logs, artifacts, approvals, API/webhooks, optional distributed workers, and pinned external CLI tools for reproducible runs.

Dagu is local-first by default. API keys, private repositories, internal documents, customer data, Slack logs, and agent artifacts can stay inside your own machine or infrastructure instead of being handed to a cloud automation SaaS.

For a quick look at how workflows are defined, see the [examples](https://docs.dagu.sh/writing-workflows/examples). For a compact repository-level map of the YAML shape and current `run:`, `action:`, and `actions:` syntax, see the [Workflow Schema at a Glance](./README_SCHEMA.md).



  



| Run Details | Step Logs | Documents |

|---|---|---|

| ![Run details in dark mode](./assets/images/readme-run-details-dark.png) | ![Workflow logs in dark mode](./assets/images/readme-logs-dark.png) | ![Workflow documents in dark mode](./assets/images/readme-documents-dark.png) |

**Try it live:** [Live Demo](https://dagu-demo-f5e33d0e.dagu.sh) (credentials: `demouser` / `demouser`)

## Why Dagu?

```sh

  Traditional Orchestrator          Dagu

  ┌────────────────────────┐        ┌──────────────────┐

  │  Web Server            │        │                  │

  │  Scheduler             │        │  dagu start-all  │

  │  Worker(s)             │        │                  │

  │  PostgreSQL            │        └──────────────────┘

  │  Redis / RabbitMQ      │         Single binary.

  │  Python Runtime        │         Self-hosted by default.

  └────────────────────────┘         Adds scheduling, retries, and approvals around existing automation.

    6+ services to manage

```

Dagu was built for teams that need lightweight, self-hosted workflow orchestration without databases, brokers, or framework rewrites.

## Performance

Dagu stores state in local files by default. How much it can run depends on the machine and the workload. CPU, disk speed, workflow duration, queue settings, and worker capacity all matter.

- **Throughput:** On one machine, Dagu can run thousands of workflow runs per day when the hardware and workflow shape fit the workload.

- **Load control:** Use [queues](https://docs.dagu.sh/server-admin/queues), concurrency limits, and optional [distributed workers](https://docs.dagu.sh/server-admin/distributed/) to decide how many runs execute at once and where they run.

## Real-World Use Cases

Dagu is useful when teams need to consolidate scripts, cron jobs, server tasks, containers, data jobs, and approval-gated operational work into one visible, governed workflow system without rewriting the underlying automation.

**Cron and legacy script management.** Run existing shell scripts, Python scripts, HTTP calls, and scheduled jobs without rewriting them. Dagu turns hidden cron jobs into visible workflows with dependencies, run status, logs, retries, approvals, and history.

**ETL and data operations.** Run PostgreSQL, SQLite, or DuckDB queries, S3 transfers, `jq` transforms, validation steps, and reusable sub-workflows. Pin portable tools such as `jq` or `yq` in the DAG so workers do not depend on whatever version happens to be installed. Daily data workflows stay declarative, observable, and easy to retry when one step fails.

**Media conversion.** Run `ffmpeg`, thumbnail extraction, audio normalization, image processing, and other compute-heavy jobs. Conversion work can run across distributed workers while status, history, logs, and artifacts stay in one persistence layer for monitoring, debugging, and retries.

**Infrastructure and server automation.** Coordinate SSH backups, cleanup jobs, deploy scripts, patch windows, precondition checks, lifecycle hooks, and manual approvals. Remote operations get schedules, retries, notifications, and per-step logs without requiring operators to SSH into servers for every recovery.

**GitHub-driven workflows.** Trigger Dagu runs from GitHub events, PR comments, releases, check reruns, `workflow_dispatch`, or `repository_dispatch` while the actual DAG executes on your own licensed Dagu server. This is useful for PR validation on private infrastructure, preview deployments, release workflows, and handing off long-running jobs from GitHub Actions to Dagu.

**Container and Kubernetes workflows.** Compose workflows where each step can run a Docker image, Kubernetes Job, shell command, or validation step. Image-based tasks can be routed to the right workers without building a custom control plane around containers.

**Customer support automation.** Run diagnostics, account repair jobs, data checks, and approval-gated support actions from a simple Web UI. Non-engineers can run reviewed workflows while engineers keep commands, logs, and results traceable.

**IoT and edge workflows.** Run sensor polling, local cleanup, offline sync, health checks, and device maintenance jobs on small devices. The single binary and file-backed state work well on edge devices while still providing visibility through the Web UI.

## Quick Start

### Install

**macOS/Linux:**

```sh

curl -fsSL https://raw.githubusercontent.com/dagucloud/dagu/main/scripts/installer.sh | bash

```

**Homebrew:**

```sh

brew install dagu

```

**Windows (PowerShell):**

```powershell

irm https://raw.githubusercontent.com/dagucloud/dagu/main/scripts/installer.ps1 | iex

```

**Docker:**

```sh

docker run --rm -v ~/.dagu:/var/lib/dagu -p 8080:8080 ghcr.io/dagucloud/dagu:latest dagu start-all

```

**Kubernetes (Helm):**

```sh

helm repo add dagu https://dagucloud.github.io/dagu

helm repo update

helm install dagu dagu/dagu --set persistence.storageClass=

```

> Replace `` with a StorageClass that supports `ReadWriteMany`. See [charts/dagu/README.md](./charts/dagu/README.md) for chart configuration.

The script installers run a guided wizard that can add Dagu to your PATH, set it up as a background service, and create the initial admin account. Homebrew, npm, Docker, and Helm install without the wizard. See [Installation docs](https://docs.dagu.sh/getting-started/installation) for all options.

### Create and run a workflow

Create a YAML file with your workflow definition. For example, create `review-readme.yaml` with the following content:

```yaml

working_dir: ${DAG_RUN_ARTIFACTS_DIR}

params:

  - name: REPO_URL

    type: string

    required: true

artifacts:

  enabled: true

steps:

  - id: review_pr

    action: agent.run

    with:

      task: |

        Review the README.md file in ${REPO_URL}.

        Return concise Markdown findings.

      max_iterations: 10

    stdout: ${DAG_RUN_ARTIFACTS_DIR}/review.md

  - id: approval

    action: noop

    depends: [review_pr]

    approval:

      prompt: Review the review.md artifact. Approve to post an issue with the findings, or reject to skip.

  - id: post_issue

    run: gh issue create --title "Review Findings" --body-file "${DAG_RUN_ARTIFACTS_DIR}/review.md"

    depends: [approval]

```

This workflow uses the built-in [`agent.run` action](https://docs.dagu.sh/features/agent/step) and assumes a default model is configured in Agent Settings. It reviews a GitHub repository's README file, saves the agent's final output as an artifact, and then prompts for manual [approval](https://docs.dagu.sh/writing-workflows/yaml-specification#approval) before posting an issue with the findings. GitHub CLI (`gh`) is used in the final step to create the issue.

Run the workflow with:

```sh

dagu start review-readme.yaml -- params REPO_URL=

```

### Start the server

```sh

dagu start-all

```

Visit http://localhost:8080

### Install the Dagu skill for AI coding tools

Install the Dagu authoring skill for Claude Code, Codex, Gemini CLI, and other AI coding tools:

```sh

gh skill install dagucloud/dagu dagu

```

## Deployment Models

Dagu can run on one machine, as a self-hosted production service, as a full managed Dagu Cloud server, or as a hybrid deployment with private workers inside your infrastructure.

Need the full breakdown, tradeoffs, and architecture notes? See the [Deployment Models guide](https://docs.dagu.sh/overview/deployment-models).

  

    

      Local Single-Server


      

    

    

      Self-Hosted


      

    

  

  

    

      Dagu Cloud


      

    

    

      Hybrid


      

    

  

| Model | Server | Execution | Best for |

|------|--------|-----------|----------|

| **Local single-server** | `dagu start-all` on one machine. | Same machine. | Development, small scheduled workloads, edge jobs, and simple internal automation. |

| **Self-hosted** | Dagu server on your infrastructure. | Local execution or distributed workers on your infrastructure. | Teams that need ownership of networking, secrets, storage, runtime, and upgrade timing. |

| **Dagu Cloud** | Full managed Dagu server in a dedicated, isolated gVisor instance on GKE. | Managed instance. | Teams that want Dagu operated for them without running the server themselves. |

| **Hybrid** | Full managed Dagu Cloud server. | Private workers in your infrastructure over mTLS. | Docker steps, private networks, custom runtimes, secrets-heavy jobs, and data-local work. |

### Licensing and Cloud

- **Community self-host:** GPLv3. No license key required. You operate the server, storage, upgrades, networking, and workers. Start with the [installation guide](https://docs.dagu.sh/getting-started/installation/).

- **Self-host license:** Adds SSO, RBAC, and audit logging to self-hosted Dagu. Licenses apply to Dagu servers, not workers, so execution can scale across your own infrastructure. See [self-host licensing](https://dagu.sh/pricing#self-host).

- **Dagu Cloud managed instance:** Includes its own managed license. It can run workflows directly as a full Dagu server, and private workers can also run on your infrastructure using a worker mTLS bundle. See [Dagu Cloud](https://dagu.sh/cloud).

Managed Dagu Cloud instances do not expose a Docker daemon or Docker socket. Workflows that need Docker step execution should use self-hosted Dagu or a private worker with Docker access.

## Architecture

Dagu can run in three configurations:

**Standalone:** A single `dagu start-all` process runs the HTTP server, scheduler, and executor. Suitable for single-machine deployments.

**Coordinator/Worker:** The scheduler enqueues jobs to a local file-based queue, then dispatches them to a coordinator over gRPC. Workers long-poll the coordinator for tasks, execute DAGs locally, and report status back. Workers can run on separate machines and are routed tasks based on labels.

**Headless:** Run without the web UI (`DAGU_HEADLESS=true`). Useful for CI/CD environments or when Dagu is managed through the CLI or API only.

```sh

Standalone:

  ┌─────────────────────────────────────────┐

  │  dagu start-all                         │

  │  ┌───────────┐ ┌───────────┐ ┌────────┐ │

  │  │ HTTP / UI │ │ Scheduler │ │Executor│ │

  │  └───────────┘ └───────────┘ └────────┘ │

  │  File-based storage (logs, state, queue)│

  └─────────────────────────────────────────┘

Distributed:

  ┌────────────┐                   ┌────────────┐

  │ Scheduler  │                   │ HTTP / UI  │

  │            │                   │            │

  │ ┌────────┐ │                   └─────┬──────┘

  │ │ Queue  │ │  Dispatch (gRPC)        │ Dispatch / GetWorkers

  │ │(file)  │ │─────────┐               │ (gRPC)

  │ └────────┘ │         │               │

  └────────────┘         ▼               ▼

                    ┌─────────────────────────┐

                    │      Coordinator        │

                    │  ┌───────────────────┐  │

                    │  │ Dispatch Task     │  │

                    │  │ Store (pending/   │  │

                    │  │ claimed)          │  │

                    │  └───────────────────┘  │

                    └────────▲────────────────┘

                             │

                   Worker poll / task response

                   Heartbeat / ReportStatus /

                   StreamLogs (gRPC)

                             │

               ┌─────────────┴─────────────┐

               │             │             │

          ┌────┴───┐    ┌────┴───┐    ┌────┴───┐

          │Worker 1│    │Worker 2│    │Worker N│ Sandbox execution of DAGs

          │        │    │        │    │        │

          └────────┘    └────────┘    └────────┘

```

## Embedded Go API (Experimental)

Go applications can import Dagu and start DAG runs from the host process:

```go

import "github.com/dagucloud/dagu"

```

```go

engine, err := dagu.New(ctx, dagu.Options{

	HomeDir: "/var/lib/myapp/dagu",

})

if err != nil {

	return err

}

defer engine.Close(context.Background())

run, err := engine.RunYAML(ctx, []byte(`

name: embedded

params:

  - MESSAGE

steps:

  - name: hello

    run: echo "${MESSAGE}"

`), dagu.WithParams(map[string]string{

	"MESSAGE": "hello from the host app",

}))

if err != nil {

	return err

}

status, err := run.Wait(ctx)

if err != nil {

	return err

}

fmt.Println(status.Status)

```

The embedded API is experimental and may change before it is declared stable. It uses Dagu's YAML loader, built-in executors, and file-backed state. `RunFile` and `RunYAML` start runs asynchronously and return a run handle for `Wait`, `Status`, and `Stop`. Distributed embedded runs require an existing Dagu coordinator; embedded workers can be started with `NewWorker`.

See the [embedded API documentation](https://docs.dagu.sh/embedding/go-api) and [examples/embedded](./examples/embedded).

## Workflow Examples

### Parallel execution with dependencies

```yaml

steps:

  - id: extract

    run: ./extract.sh

  - id: transform_a

    run: ./transform_a.sh

    depends: [extract]

  - id: transform_b

    run: ./transform_b.sh

    depends: [extract]

  - id: load

    run: ./load.sh

    depends: [transform_a, transform_b]

```

```mermaid

%%{init: {'theme': 'base', 'themeVariables': {'background': '#18181B', 'primaryTextColor': '#fff', 'lineColor': '#888'}}}%%

graph LR

    A[extract] --> B[transform_a]

    A --> C[transform_b]

    B --> D[load]

    C --> D

    style A fill:#18181B,stroke:#22C55E,stroke-width:1.6px,color:#fff

    style B fill:#18181B,stroke:#22C55E,stroke-width:1.6px,color:#fff

    style C fill:#18181B,stroke:#22C55E,stroke-width:1.6px,color:#fff

    style D fill:#18181B,stroke:#3B82F6,stroke-width:1.6px,color:#fff

```

### Sequential execution

```yaml

steps:

  - id: step_1

    run: echo "Step 1"

  - id: step_2

    run: echo "Step 2"

    depends: [step_1]

```

### Reproducible external tools

```yaml

tools:

  - jqlang/jq@jq-1.7.1

steps:

  - id: inspect

    run: jq --version

  - id: transform

    run: jq '.items[] | .name' data.json

    depends: [inspect]

```

Dagu installs declared portable CLIs before the DAG run, exposes them on `PATH` for host command steps, and caches them on each worker. You do not need to install a separate tool manager; Dagu remains a single binary. See the [Tools documentation](https://docs.dagu.sh/writing-workflows/tools) for package syntax, immutable refs, distributed worker behavior, sub-DAG scoping, and current limitations.

### Docker step

```yaml

steps:

  - name: build

    container:

      image: node:20-alpine

    run: npm run build

```

### Kubernetes Pod execution

```yaml

steps:

  - name: batch-job

    action: kubernetes.run

    with:

      namespace: production

      image: my-registry/batch-processor:latest

      resources:

        requests:

          cpu: "2"

          memory: "4Gi"

      command: ./process.sh

```

### SSH remote execution

```yaml

steps:

  - name: deploy

    action: ssh.run

    with:

      host: prod-server.example.com

      user: deploy

      key: ~/.ssh/id_rsa

      command: cd /var/www && git pull && systemctl restart app

```

### Sub-DAG composition

```yaml

steps:

  - name: extract

    action: dag.run

    with:

      dag: etl/extract

      params:

        SOURCE: s3://bucket/data.csv

  - name: transform

    action: dag.run

    with:

      dag: etl/transform

      params:

        INPUT: ${extract.outputs.result}

    depends: [extract]

  - name: load

    action: dag.run

    with:

      dag: etl/load

      params:

        DATA: ${transform.outputs.result}

    depends: [transform]

```

### Retry and error handling

```yaml

steps:

  - name: flaky-api-call

    run: curl -f https://api.example.com/data

    retry_policy:

      limit: 3

      interval_sec: 10

    continue_on:

      failure: true

```

### Scheduling with overlap control and catch-up

```yaml

schedule:

  - "0 */6 * * *"          # Every 6 hours

overlap_policy: skip       # Skip if previous run is still active

catchup_window: "5h"       # Catch up missed runs when scheduler is down for up to 5 hours

  

timeout_sec: 3600

handler_on:

  failure:

    run: notify-team.sh

  exit:

    run: cleanup.sh

```

### Built-in agent step with manual approval

```yaml

steps:

  - id: review

    action: agent.run

    with:

      task: Review the README.md file and return concise Markdown findings.

      max_iterations: 10

    stdout: ${DAG_RUN_ARTIFACTS_DIR}/review.md

  - id: approval

    action: noop

    depends: [review]

    approval:

      prompt: Review the review.md artifact. Approve to post an issue with the findings, or reject to skip.

  - id: post_issue

    run: gh issue create --title "Review Findings" --body-file "${DAG_RUN_ARTIFACTS_DIR}/review.md"

    depends: [approval]

```

For more examples, see the [Examples documentation](https://docs.dagu.sh/writing-workflows/examples).

## Built-in and Custom Actions

Dagu includes built-in actions that run within the Dagu process (or worker). Local shell commands use `run`.

| Field / Action | Purpose |

|----------|---------|

| [`run:` field](https://docs.dagu.sh/step-types/shell) | Local shell commands and scripts (bash, sh, PowerShell, custom shells) |

| [`docker.run`](https://docs.dagu.sh/step-types/docker) | Run containers with registry auth, volume mounts, and resource limits |

| [`kubernetes.run` / `k8s.run`](https://docs.dagu.sh/step-types/kubernetes) | Execute Kubernetes Jobs with namespace, image, and resource settings |

| [`ssh.run`](https://docs.dagu.sh/step-types/ssh) | Remote command execution over SSH |

| [`sftp.upload` / `sftp.download`](https://docs.dagu.sh/step-types/sftp) | File transfer over SFTP |

| [`http.request`](https://docs.dagu.sh/step-types/http) | HTTP requests with headers, auth, and request bodies |

| [`postgres.query`](https://docs.dagu.sh/step-types/sql/postgresql) / [`sqlite.query`](https://docs.dagu.sh/step-types/sql/sqlite) / `duckdb.query` | SQL queries, imports, and exports for PostgreSQL, SQLite, and DuckDB |

| [`redis.`](https://docs.dagu.sh/step-types/redis) | Redis commands, pipelines, and Lua scripts |

| [`s3.upload` / `s3.download` / `s3.list` / `s3.delete`](https://docs.dagu.sh/step-types/s3) | Upload, download, list, and delete S3 objects |

| `file.stat` / `file.read` / `file.write` / `file.copy` / `file.move` / `file.delete` / `file.mkdir` / `file.list` | Local file operations without shell commands |

| [`jq.filter`](https://docs.dagu.sh/step-types/jq) | JSON transformation using jq expressions |

| [`archive.create` / `archive.extract` / `archive.list`](https://docs.dagu.sh/step-types/archive) | Create and extract zip/tar archives |

| [`wait.duration` / `wait.until` / `wait.file` / `wait.http`](https://docs.dagu.sh/step-types/wait) | Wait for time, file state, or HTTP readiness |

| [`mail.send`](https://docs.dagu.sh/step-types/mail) | Send email via SMTP |

| [`template.render`](https://docs.dagu.sh/step-types/template) | Text generation with template rendering |

| [`router.route`](https://docs.dagu.sh/step-types/router) | Conditional step routing based on values and patterns |

| [`dag.run`](https://docs.dagu.sh/writing-workflows/control-flow) | Invoke another DAG as a sub-workflow with params and dependencies |

| [`dag.enqueue`](https://docs.dagu.sh/writing-workflows/control-flow) | Queue another DAG asynchronously and continue after enqueue |

| [`harness.run`](https://docs.dagu.sh/step-types/harness) | Run coding agent CLIs such as Claude Code, Codex, Copilot, OpenCode, and Pi |

| [`agent.run`](https://docs.dagu.sh/features/agent/step) | Built-in agent action with tool use |

You can also define reusable actions with the top-level `actions` field. Custom actions expand to built-in actions during DAG load, so you can wrap a common shell, HTTP, SQL, or other pattern behind a typed interface with validated input.

```yaml

actions:

  webhook.send:

    input_schema:

      type: object

      additionalProperties: false

      required: [url, text]

      properties:

        url:

          type: string

        text:

          type: string

    template:

      action: http.request

      with:

        method: POST

        url: {{ .input.url }}

        headers:

          Content-Type: application/json

        body: |

          {"text": {{ json .input.text }}}

steps:

  - action: webhook.send

    with:

      url: https://hooks.example.com/ops

      text: deploy complete

```

See [Custom Actions](https://docs.dagu.sh/writing-workflows/custom-step-types) for the feature guide and [YAML Specification](https://docs.dagu.sh/writing-workflows/yaml-specification) for the exact `actions`, `action`, and `run` field behavior.

## Security and Access Control

### Authentication

Dagu supports three top-level authentication modes, configured via `DAGU_AUTH_MODE`:

- **`none`** — No authentication

- **`basic`** — HTTP Basic authentication

- **`builtin`** — JWT-based authentication with user management, API keys, per-DAG webhook tokens, and optional OIDC/SSO integration

### Role-Based Access Control

When using `builtin` auth, five roles control access:

| Role | Capabilities |

|------|-------------|

| `admin` | Full access including user management |

| `manager` | Create, edit, delete, run, stop DAGs; view audit logs |

| `developer` | Create, edit, delete, run, stop DAGs |

| `operator` | Run and stop DAGs only (no editing) |

| `viewer` | Read-only access |

API keys can be created with independent role assignments. Audit logging tracks all actions.

### TLS and Secrets

- TLS for the HTTP server (`DAGU_CERT_FILE`, `DAGU_KEY_FILE`)

- Mutual TLS for gRPC coordinator/worker communication (`DAGU_PEER_CERT_FILE`, `DAGU_PEER_KEY_FILE`, `DAGU_PEER_CLIENT_CA_FILE`)

- Secret management with three providers: environment variables, files, and [HashiCorp Vault](https://www.vaultproject.io/)

### Production Hardening

For self-hosted production deployments, treat network exposure and execution boundaries as the primary controls:

- Prefer `auth.mode: builtin` for any shared or network-exposed instance. Use `basic` only for simple private setups, and avoid `none` outside isolated local development.

- Keep `metrics: private` unless the metrics endpoint is reachable only on a trusted private network.

- Bind Dagu to loopback or a private interface when possible. If you must use `0.0.0.0`, place it behind a trusted reverse proxy, TLS, and network-level access controls.

- Leave `terminal.enabled: false` unless the instance is admin-only and tightly scoped.

- In distributed deployments, set `peer.insecure=false` and configure peer TLS when coordinator and workers communicate across host or network boundaries.

- Treat Docker socket mounts, root containers, and host-level executors as privileged access to the underlying machine.

See [Server Configuration](https://docs.dagu.sh/server-admin/server), [Docker deployment](https://docs.dagu.sh/server-admin/deployment/docker), and [Distributed execution](https://docs.dagu.sh/server-admin/distributed/) for the operator-focused guidance.

## Observability

### Prometheus Metrics

Dagu exposes Prometheus-compatible metrics:

- `dagu_info` — Build information (version, Go version)

- `dagu_uptime_seconds` — Server uptime

- `dagu_dag_runs_total` — Total DAG runs by status

- `dagu_dag_runs_total_by_dag` — Per-DAG run counts

- `dagu_dag_run_duration_seconds` — Histogram of run durations

- `dagu_dag_runs_currently_running` — Active DAG runs

- `dagu_dag_runs_queued_total` — Queued runs

- `dagu_workers_registered` — Registered distributed workers

- `dagu_worker_info` — Worker heartbeat labels as key/value metadata

- `dagu_worker_heartbeat_timestamp_seconds` — Last worker heartbeat timestamp

- `dagu_worker_health_status` — Worker health by heartbeat freshness

- `dagu_worker_pollers` — Worker poller capacity by state

- `dagu_worker_running_tasks` — Running tasks per worker

- `dagu_worker_oldest_running_task_age_seconds` — Age of the oldest running task per worker

### Structured Logging

JSON or text format logging (`DAGU_LOG_FORMAT`). Logs are stored per-run with separate stdout/stderr capture per step.

### Notifications

- Slack and Telegram bot integration for run monitoring and status updates

- Email notifications on DAG success, failure, or wait status via SMTP

- Per-DAG webhook endpoints with token authentication

## Artifacts

![Artifact browser in dark mode](./assets/images/readme-artifacts-dark.png)

Dagu runs can write arbitrary files into `DAG_RUN_ARTIFACTS_DIR`, and Dagu stores them per run as [Artifacts](https://docs.dagu.sh/writing-workflows/artifacts). In the [Web UI](https://docs.dagu.sh/overview/web-ui), operators can browse the file tree, preview Markdown, text, and image files inline, and download any artifact when they need the raw file.

This is useful for generated reports, screenshots, charts, exported JSON or CSV files, and other outputs that do not fit simple key/value [outputs](https://docs.dagu.sh/writing-workflows/outputs).

See the [Artifacts documentation](https://docs.dagu.sh/writing-workflows/artifacts) and the [Web UI guide](https://docs.dagu.sh/overview/web-ui) for the full artifact browser workflow and screenshots.

## Scheduling and Reliability

- **Cron scheduling** with timezone support and multiple schedule entries per DAG

- **Overlap policies**: `skip` (default — skip if previous run is still active), `all` (queue all), `latest` (keep only the most recent)

- **Catch-up scheduling**: Automatically runs missed intervals when the scheduler was down

- **Zombie detection**: Identifies and handles stalled DAG runs (configurable interval, default 45s)

- **Retry policies**: Per-step retry with configurable limits, intervals, and exit code filtering

- **Lifecycle hooks**: `onInit`, `onSuccess`, `onFailure`, `onAbort`, `onExit`, `onWait`

- **Preconditions**: Gate DAG or step execution on shell command results

- **High availability**: Scheduler lock with stale detection for failover

## Distributed Execution

The coordinator/worker architecture distributes DAG execution across multiple machines:

- **Coordinator**: gRPC server that manages task distribution, worker registry, and health monitoring

- **Workers**: Connect to the coordinator, pull tasks from the queue, execute DAGs locally, report results

- **Worker labels**: Route DAGs to specific workers based on labels (e.g., `gpu=true`, `region=us-east-1`)

- **Health checks**: HTTP health endpoints on coordinator and workers for load balancer integration

- **Queue system**: File-based persistent queue with configurable concurrency limits

```sh

# Start coordinator

dagu coordinator

# Start workers (on separate machines)

DAGU_WORKER_LABELS=gpu=true,memory=64G dagu worker

```

See the [distributed execution documentation](https://docs.dagu.sh/server-admin/distributed/) for setup details.

## CLI Reference

| Command | Description |

|---------|-------------|

| `dagu start ` | Execute a DAG |

| `dagu start-all` | Start HTTP server + scheduler + coordinator |

| `dagu server` | Start HTTP server only |

| `dagu scheduler` | Start scheduler only |

| `dagu coordinator` | Start coordinator (distributed mode) |

| `dagu worker` | Start worker (distributed mode) |

| `dagu stop ` | Stop a running DAG |

| `dagu restart ` | Restart a DAG |

| `dagu retry --run-id= ` | Retry a failed run |

| `dagu dry ` | Dry run — show what would execute |

| `dagu status ` | Show DAG run status |

| `dagu history ` | Show execution history |

| `dagu validate ` | Validate DAG YAML |

| `dagu enqueue ` | Add DAG to the execution queue |

| `dagu dequeue  [--dag-run=:]` | Remove a DAG-run from the queue |

| `dagu cleanup ` | Clean up old run data |

| `dagu migrate history` | Migrate legacy run history |

| `dagu version` | Show version |

## Environment Variables

**Precedence:** Command-line flags > Environment variables > Configuration file (`~/.config/dagu/config.yaml`)

### Server

| Variable | Default | Description |

|----------|---------|-------------|

| `DAGU_HOST` | `127.0.0.1` | Bind address |

| `DAGU_PORT` | `8080` | HTTP port |

| `DAGU_BASE_PATH` | — | Base path for reverse proxy |

| `DAGU_HEADLESS` | `false` | Run without web UI |

| `DAGU_TZ` | — | Timezone (e.g., `Asia/Tokyo`) |

| `DAGU_LOG_FORMAT` | `text` | `text` or `json` |

| `DAGU_CERT_FILE` | — | TLS certificate |

| `DAGU_KEY_FILE` | — | TLS private key |

### Paths

| Variable | Default | Description |

|----------|---------|-------------|

| `DAGU_HOME` | — | Overrides all path defaults |

| `DAGU_DAGS_DIR` | `~/.config/dagu/dags` | DAG definitions directory |

| `DAGU_LOG_DIR` | `~/.local/share/dagu/logs` | Log files |

| `DAGU_DATA_DIR` | `~/.local/share/dagu/data` | Application state |

| `DAGU_TOOLS_DIR` | `{DAGU_DATA_DIR}/tools` | Managed DAG tool cache |

### Authentication

| Variable | Default | Description |

|----------|---------|-------------|

| `DAGU_AUTH_MODE` | `builtin` | `none`, `basic`, or `builtin` |

| `DAGU_AUTH_BASIC_USERNAME` | — | Basic auth username |

| `DAGU_AUTH_BASIC_PASSWORD` | — | Basic auth password |

| `DAGU_AUTH_TOKEN_SECRET` | (auto) | JWT signing secret |

| `DAGU_AUTH_TOKEN_TTL` | `24h` | JWT token lifetime |

OIDC variables: `DAGU_AUTH_OIDC_CLIENT_ID`, `DAGU_AUTH_OIDC_CLIENT_SECRET`, `DAGU_AUTH_OIDC_ISSUER`, `DAGU_AUTH_OIDC_SCOPES`, `DAGU_AUTH_OIDC_WHITELIST`, `DAGU_AUTH_OIDC_AUTO_SIGNUP`, `DAGU_AUTH_OIDC_DEFAULT_ROLE`, `DAGU_AUTH_OIDC_ALLOWED_DOMAINS`.

### Scheduler

| Variable | Default | Description |

|----------|---------|-------------|

| `DAGU_SCHEDULER_PORT` | `8090` | Health check port |

| `DAGU_SCHEDULER_ZOMBIE_DETECTION_INTERVAL` | `45s` | Zombie run detection interval (`0` to disable) |

| `DAGU_SCHEDULER_LOCK_STALE_THRESHOLD` | `30s` | HA lock stale threshold |

| `DAGU_QUEUE_ENABLED` | `true` | Enable queue system |

### Coordinator / Worker

| Variable | Default | Description |

|----------|---------|-------------|

| `DAGU_COORDINATOR_HOST` | `127.0.0.1` | Coordinator bind address |

| `DAGU_COORDINATOR_PORT` | `50055` | Coordinator gRPC port |

| `DAGU_COORDINATOR_HEALTH_PORT` | `8091` | Coordinator health check port |

| `DAGU_WORKER_ID` | — | Worker instance ID |

| `DAGU_WORKER_MAX_ACTIVE_RUNS` | `100` | Max concurrent runs per worker |

| `DAGU_WORKER_HEALTH_PORT` | `8092` | Worker health check port |

| `DAGU_WORKER_LABELS` | — | Worker labels (`key=value,key=value`) |

### Peer TLS (gRPC)

| Variable | Default | Description |

|----------|---------|-------------|

| `DAGU_PEER_CERT_FILE` | — | Peer TLS certificate |

| `DAGU_PEER_KEY_FILE` | — | Peer TLS private key |

| `DAGU_PEER_CLIENT_CA_FILE` | — | CA for client verification |

| `DAGU_PEER_INSECURE` | `true` | Use h2c instead of TLS |

### Git Sync

| Variable | Default | Description |

|----------|---------|-------------|

| `DAGU_GITSYNC_ENABLED` | `false` | Enable Git sync |

| `DAGU_GITSYNC_REPOSITORY` | — | Repository URL |

| `DAGU_GITSYNC_BRANCH` | `main` | Branch to sync |

| `DAGU_GITSYNC_AUTH_TYPE` | `token` | `token` or `ssh` |

| `DAGU_GITSYNC_AUTOSYNC_ENABLED` | `false` | Enable periodic auto-pull |

| `DAGU_GITSYNC_AUTOSYNC_INTERVAL` | `300` | Sync interval in seconds |

Full configuration reference: [docs.dagu.sh/server-admin/reference](https://docs.dagu.sh/server-admin/reference)

## Documentation

- [Getting Started](https://docs.dagu.sh/getting-started/installation) — Installation and first workflow

- [Writing Workflows](https://docs.dagu.sh/writing-workflows/examples) — YAML syntax, scheduling, execution control

- [Workflow Schema at a Glance](./README_SCHEMA.md) — Repository-level overview of the current YAML schema

- [Tools](https://docs.dagu.sh/writing-workflows/tools) — Pin external CLI packages in DAGs for reproducible host command steps

- [Actions](https://docs.dagu.sh/step-types/shell) — [Shell](https://docs.dagu.sh/step-types/shell), [Docker](https://docs.dagu.sh/step-types/docker), [Kubernetes](https://docs.dagu.sh/step-types/kubernetes), [HTTP](https://docs.dagu.sh/step-types/http), [SQL](https://docs.dagu.sh/step-types/sql/), [Harness](https://docs.dagu.sh/step-types/harness), [Agent Step](https://docs.dagu.sh/features/agent/step), and [Custom Actions](https://docs.dagu.sh/writing-workflows/custom-step-types)

- [Distributed Execution](https://docs.dagu.sh/server-admin/distributed/) — Coordinator/worker setup

- [Authentication](https://docs.dagu.sh/server-admin/authentication/) — RBAC, OIDC, API keys

- [Git Sync](https://docs.dagu.sh/server-admin/git-sync) — Version-controlled DAG definitions

- [GitHub Integration](https://docs.dagu.sh/github-integration/) — Trigger Dagu runs from GitHub events, PR comments, releases, checks, and dispatches

- [AI Agent](https://docs.dagu.sh/features/agent/) — AI-assisted workflow authoring

- [Changelog](https://docs.dagu.sh/overview/changelog)

## Community

- [Discord](https://discord.gg/gpahPUjGRk) — Questions and discussion

- [GitHub Issues](https://github.com/dagucloud/dagu/issues) — Bug reports and feature requests

- [Bluesky](https://bsky.app/profile/dagu-org.bsky.social)

## Development

**Prerequisites:** [Go 1.26+](https://go.dev/doc/install), [Node.js](https://nodejs.org/en/download/), [pnpm](https://pnpm.io/installation)

```sh

git clone https://github.com/dagucloud/dagu.git && cd dagu

make build    # Build frontend + Go binary

make test     # Run tests with race detection

make lint     # Run golangci-lint

```

See [CONTRIBUTING.md](./CONTRIBUTING.md) for development workflow and code standards.

## Acknowledgements



  Premium Sponsors

  

    

  

  
Supporters

  

    

      

    

    

      

    

    

      

    

    

      

    

    

      

    

    

      

    

  


  



  

    

  



## Contributing

We welcome contributions of all kinds. See our [Contribution Guide](./CONTRIBUTING.md) for details.



  



## License

GNU GPLv3 - See [LICENSE](./LICENSE). See [LICENSING.md](./LICENSING.md) for embedded API and commercial embedding notes.
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/dagucloud/dagu

Awesome Lists containing this project

README

Premium Sponsors

Supporters