https://github.com/milkstrawai/good_pipeline

DAG-based job pipeline orchestration for Rails, built on GoodJob.
https://github.com/milkstrawai/good_pipeline

activejob postgresql rails ruby ruby-on-rails

Last synced: 4 months ago
JSON representation

DAG-based job pipeline orchestration for Rails, built on GoodJob.

Host: GitHub
URL: https://github.com/milkstrawai/good_pipeline
Owner: milkstrawai
License: mit
Created: 2026-03-20T18:42:50.000Z (4 months ago)
Default Branch: main
Last Pushed: 2026-03-26T17:01:26.000Z (4 months ago)
Last Synced: 2026-03-27T05:59:30.649Z (4 months ago)
Topics: activejob, postgresql, rails, ruby, ruby-on-rails
Language: Ruby
Homepage: https://milkstrawai.github.io/good_pipeline/
Size: 1.31 MB
Stars: 1
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE.txt
- Code of conduct: CODE_OF_CONDUCT.md

Awesome Lists containing this project

README

          # GoodPipeline

DAG-based job pipeline orchestration for Rails, built on [GoodJob](https://github.com/bensheldon/good_job).

Define multi-step workflows as directed acyclic graphs — not linear chains. Steps run in parallel when they can and wait for dependencies when they must. GoodPipeline handles dependency resolution, parallel execution, failure strategies, conditional branching, pipeline chaining, and lifecycle callbacks. It also ships with a web dashboard.

## Requirements

- Ruby >= 3.2

- Rails >= 7.1

- PostgreSQL

- GoodJob >= 3.10 with `preserve_job_records = true`

## Installation

Add to your Gemfile:

```ruby

gem "good_pipeline"

```

Then install the migrations:

```bash

bin/rails generate good_pipeline:install

bin/rails db:migrate

```

GoodPipeline requires GoodJob to preserve job records. Add this to your GoodJob configuration:

```ruby

# config/initializers/good_job.rb

GoodJob.preserve_job_records = true

```

GoodPipeline will raise `GoodPipeline::ConfigurationError` at boot if this is not set.

## Usage

### Defining a pipeline

Subclass `GoodPipeline::Pipeline` and implement `configure`. Use `run` to declare steps and `after:` to express dependencies:

```ruby

class VideoProcessingPipeline < GoodPipeline::Pipeline

  description "Downloads, transcodes and publishes a video"

  failure_strategy :halt

  on_complete :notify

  on_success :celebrate

  on_failure :alert

  def configure(video_id:)

    run :download,  DownloadJob,  with: { video_id: video_id }

    run :transcode, TranscodeJob, after: :download

    run :thumbnail, ThumbnailJob, after: :download

    run :publish,   PublishJob,   after: %i[transcode thumbnail]

    run :cleanup,   CleanupJob,   after: :publish

  end

  private

  def notify = Rails.logger.info("Pipeline complete")

  def celebrate = Rails.logger.info("All steps succeeded!")

  def alert = Rails.logger.warn("Pipeline had failures")

end

```

This produces the following DAG:

```mermaid

graph TD

  download --> transcode

  download --> thumbnail

  transcode --> publish

  thumbnail --> publish

  publish --> cleanup

```

### Running a pipeline

```ruby

VideoProcessingPipeline.run(video_id: 123)

```

### Step options

```ruby

run :step_key, JobClass,

  with:       { key: "value" },                # keyword args passed to the job

  after:      :other_step,                     # dependency (symbol or array of symbols)

  on_failure: :ignore,                         # step-level failure strategy override

  enqueue:    { queue: :media, priority: 10 }  # options passed to job.enqueue()

```

### Failure strategies

Set at the pipeline level with `failure_strategy`:

| Strategy | Behaviour |

|---|---|

| `:halt` (default) | Stop all pending steps when any step fails |

| `:continue` | Let independent branches continue; skip only blocked downstream steps |

| `:ignore` | Treat failures as successes for dependency resolution |

Per-step overrides via `on_failure:` in `run` apply to that step's outgoing edges only.

### Conditional branching

Use `branch` to take different paths at runtime based on application state:

```ruby

class MediaPipeline < GoodPipeline::Pipeline

  def configure(media_id:)

    run :analyze, AnalyzeJob, with: { media_id: media_id }

    branch :format_check, after: :analyze, by: :detect_format do

      on :hd do

        run :transcode_hd, TranscodeHDJob, with: { media_id: media_id }

        run :upscale, UpscaleJob, with: { media_id: media_id }, after: :transcode_hd

      end

      on :sd do

        run :transcode_sd, TranscodeSDJob, with: { media_id: media_id }

      end

    end

    run :publish, PublishJob, after: :format_check

  end

  private

  def detect_format

    Media.find(params[:media_id]).hd? ? :hd : :sd

  end

end

```

The `by:` method is evaluated at runtime when the branch step is reached. The matching arm runs; other arms are skipped. `after: :format_check` waits for whichever arm was chosen to complete.

Arms can also be empty for an if-without-else pattern:

```ruby

branch :quality_check, after: :analyze, by: :needs_processing do

  on :yes do

    run :process, ProcessJob

  end

  on :no  # skip — pipeline continues to next step

end

```

The dashboard renders branches as diamond decision nodes with labeled edges.

### Pipeline chaining

Chain pipelines together with `.then()`:

```ruby

# Serial chain

VideoProcessingPipeline

  .run(video_id: 123)

  .then(NotificationPipeline, with: { video_id: 123 })

# Fan-out

VideoProcessingPipeline

  .run(video_id: 123)

  .then(

    [NotificationPipeline, with: { video_id: 123 }],

    [AnalyticsPipeline,    with: { video_id: 123 }]

  )

# Parallel start with fan-in

GoodPipeline.run(

  [VideoProcessingPipeline, with: { video_id: 123 }],

  [AudioProcessingPipeline, with: { audio_id: 456 }]

).then(MergeMediaPipeline, with: { video_id: 123, audio_id: 456 })

```

If an upstream pipeline fails or halts, downstream pipelines are automatically skipped.

### Monitoring

```ruby

pipeline = VideoProcessingPipeline.run(video_id: 123)

pipeline.status     # => "running"

pipeline.terminal?  # => false

pipeline.steps      # => all step records

pipeline.params     # => { "video_id" => 123 }

# Query across pipelines

GoodPipeline::PipelineRecord.where(status: "failed")

GoodPipeline::PipelineRecord.where(type: "VideoProcessingPipeline")

```

### Lifecycle callbacks

```ruby

class MyPipeline < GoodPipeline::Pipeline

  on_complete :always_runs    # any terminal state

  on_success  :only_success   # pipeline succeeded

  on_failure  :only_failure   # pipeline failed or halted

end

```

Callbacks are dispatched asynchronously via a separate GoodJob job. They never block the coordinator or affect pipeline state.

## Dashboard

GoodPipeline includes a mountable web dashboard for inspecting pipeline executions:

```ruby

# config/routes.rb

mount GoodPipeline::Engine => "/good_pipeline"

```

The dashboard provides:

- Pipeline Executions: filterable list with status tabs and pipeline type dropdown

- Pipeline Details: steps table, DAG visualization, chain links, error info

- Pipeline Definitions: catalog of all pipeline types with their DAG structure

### Pipeline Executions

![Pipeline Executions](docs/screenshots/index.png)

### Pipeline Details

![Pipeline Details](docs/screenshots/show.png)

### Pipeline Definitions

![Pipeline Definitions](docs/screenshots/definitions.png)

No build step. Uses Pico CSS and Mermaid.js from CDN.

## Cleanup

GoodPipeline automatically cleans up old terminal pipelines when GoodJob runs its own cleanup cycle. No configuration needed, it uses GoodJob's retention period (default 14 days).

To configure the retention period, set GoodJob's option:

```ruby

# config/application.rb

config.good_job.cleanup_preserved_jobs_before_seconds_ago = 30.days.to_i

```

## Development

```bash

bin/setup

mise docker:start  # PostgreSQL

rake test

```

## Contributing

Bug reports and pull requests are welcome on GitHub at https://github.com/milkstrawai/good_pipeline.

## License

The gem is available as open source under the terms of the [MIT License](https://opensource.org/licenses/MIT).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/milkstrawai/good_pipeline

Awesome Lists containing this project

README