Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/whitaker-io/machine
Machine is a workflow/pipeline library for processing data
https://github.com/whitaker-io/machine
codespaces github-actions golang golang-library golang-package golangci-lint mit-license pipeline pipeline-framework stream-processing streaming-data workflow workflow-engine
Last synced: 3 months ago
JSON representation
Machine is a workflow/pipeline library for processing data
- Host: GitHub
- URL: https://github.com/whitaker-io/machine
- Owner: whitaker-io
- License: mit
- Created: 2020-10-13T04:24:19.000Z (over 4 years ago)
- Default Branch: v3
- Last Pushed: 2024-10-22T03:59:57.000Z (3 months ago)
- Last Synced: 2024-10-23T06:01:25.422Z (3 months ago)
- Topics: codespaces, github-actions, golang, golang-library, golang-package, golangci-lint, mit-license, pipeline, pipeline-framework, stream-processing, streaming-data, workflow, workflow-engine
- Language: Go
- Homepage: https://pkg.go.dev/github.com/whitaker-io/machine
- Size: 1.39 MB
- Stars: 157
- Watchers: 7
- Forks: 12
- Open Issues: 4
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- Funding: .github/FUNDING.yml
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
- Codeowners: CODEOWNERS
Awesome Lists containing this project
- awesome-go - machine - Go library for writing and generating stream workers with built in metrics and traceability. (Stream Processing / HTTP Clients)
- zero-alloc-awesome-go - machine - Go library for writing and generating stream workers with built in metrics and traceability. (Stream Processing / HTTP Clients)
- awesome-go-extra - machine - 10-13T04:24:19Z|2022-08-02T20:19:44Z| (Stream Processing / HTTP Clients)
README
[![Go](https://github.com/whitaker-io/machine/actions/workflows/go.yml/badge.svg)](https://github.com/whitaker-io/machine/actions/workflows/go.yml)
[![CodeQL](https://github.com/whitaker-io/machine/actions/workflows/github-code-scanning/codeql/badge.svg)](https://github.com/whitaker-io/machine/actions/workflows/github-code-scanning/codeql)
[![PkgGoDev](https://pkg.go.dev/badge/github.com/whitaker-io/machine)](https://pkg.go.dev/github.com/whitaker-io/machine)
[![GoDoc](https://godoc.org/github.com/whitaker-io/machine?status.svg)](https://godoc.org/github.com/whitaker-io/machine)
[![Go Report Card](https://goreportcard.com/badge/github.com/whitaker-io/machine)](https://goreportcard.com/report/github.com/whitaker-io/machine)
[![Codacy Badge](https://app.codacy.com/project/badge/Grade/aa8efa7beb3f4e66a5dc0247e25557b5)](https://app.codacy.com/gh/whitaker-io/machine/dashboard?utm_source=gh&utm_medium=referral&utm_content=&utm_campaign=Badge_grade)
[![Codacy Badge](https://app.codacy.com/project/badge/Coverage/aa8efa7beb3f4e66a5dc0247e25557b5)](https://app.codacy.com/gh/whitaker-io/machine/dashboard?utm_source=gh&utm_medium=referral&utm_content=&utm_campaign=Badge_coverage)
[![Version Badge](https://img.shields.io/github/v/tag/whitaker-io/machine)](https://img.shields.io/github/v/tag/whitaker-io/machine)
`Machine` is a library for creating data workflows. These workflows can be either very concise or quite complex, even allowing for cycles for flows that need retry or self healing mechanisms.
------
### **Installation**
Add the primary library to your project
```bash
go get github.com/whitaker-io/machine/v3
```------
The two function types are:
```golang
// Monad is a function that is applied to payload and used for transformations
type Monad[T any] func(d T) T// Filter is a function that can be used to filter the payload.
type Filter[T any] func(d T) bool```
These are used in the `Machine` for functional operations
```golang
// New is a function for creating a new Machine.
//
// name string
// input chan T
// option ...Option
//
// Call the startFn returned by New to start the Machine once built.
func New[T any](name string, input chan T, options ...Option) (startFn func(context.Context), x Machine[T])// Transform is a function for converting the type of the Machine. Cannot be used inside a loop
// until I figure out how to do it without some kind of run time error or overly complex
// tracking method that isn't type safe. I really wish method level generics were a thing.
func Transform[T, U any](m Machine[T], fn func(d T) U) (Machine[U], error)// Machine is the interface provided for creating a data processing stream.
type Machine[T any] interface {
// Name returns the name of the Machine path. Useful for debugging or reasoning about the path.
Name() string
// Then apply a mutation to each individual element of the payload.
Then(a Monad[T]) Machine[T]// Recurse applies a recursive function to the payload through a Y Combinator.
// f is a function used by the Y Combinator to perform a recursion
// on the payload.
// Example:
//
// func(f Monad[int]) Monad[int] {
// return func(x int) int {
// if x <= 0 {
// return 1
// } else {
// return x * f(x-1)
// }
// }
// }
Recurse(x Monad[Monad[T]]) Machine[T]// Memoize applies a recursive function to the payload through a Y Combinator
// and memoizes the results based on the index func.
// f is a function used by the Y Combinator to perform a recursion
// on the payload.
// Example:
//
// func(f Monad[int]) Monad[int] {
// return func(x int) int {
// if x <= 0 {
// return 1
// } else {
// return x * f(x-1)
// }
// }
// }
Memoize(x Monad[Monad[T]], index func(T) string) Machine[T]// Or runs all of the functions until one succeeds or sends the payload to the right branch
Or(x ...Filter[T]) (Machine[T], Machine[T])// And runs all of the functions and if one doesnt succeed sends the payload to the right branch
And(x ...Filter[T]) (Machine[T], Machine[T])// Filter splits the data into multiple stream branches
If(f Filter[T]) (Machine[T], Machine[T])// Select applies a series of Filters to the payload and returns a list of Builders
// the last one being for any unmatched payloads.
Select(fns ...Filter[T]) []Machine[T]// Tee duplicates the data into multiple stream branches.
Tee(func(T) (a, b T)) (Machine[T], Machine[T])// While creates a loop in the stream based on the filter
While(x Filter[T]) (loop, out Machine[T])// Drop terminates the data from further processing without passing it on
Drop()// Distribute is a function used for fanout
Distribute(Edge[T]) Machine[T]
// Output provided channel
Output() chan T
}
````Distribute` is a special method used for fan-out operations. It takes an instance of `Edge[T]` and can be used most typically to distribute work via a Pub/Sub or it can be used in a commandline utility to handle user input or a similiar blocking process.
The `Edge[T]` interface is as follows:
```golang
// Edge is an interface that is used for transferring data between vertices
type Edge[T any] interface {
Output() chan T
Send(payload T)
}
```The `Send` method is used for data leaving the associated vertex and the `Output` method is used by the following vertex to receive data from the channel.
------
Confirguration is done using the `Option` helper
```golang
// Option is used to configure the machine
type Option interface// OptionFIF0 controls the processing order of the payloads
// If set to true the system will wait for one payload
// to be processed before starting the next.
var OptionFIF0 Option// OptionBufferSize sets the buffer size on the edge channels between the
// vertices, this setting can be useful when processing large amounts
// of data with FIFO turned on.
func OptionBufferSize(size int) Option// OptionAttributes apply the slog.Attr's to the machine metrics and spans
// Do not override the "name", "type", "duration", "error", or "value" attributes
func OptionAttributes(attributes ...slog.Attr) Option// OptionFlush attempts to send all data to the flushFN before exiting after the gracePeriod has expired
// Im looking for a good way to make this type specific, but want to avoid having to add separate option
// settings for the Transform function.
func OptionFlush(gracePeriod time.Duration, flushFN func(vertexName string, payload any)) Option
````Machine` supports collecting metrics and traces through a `log/slog` wrapper that sends
the telemetry to the provided [OpenTelemetry](https://github.com/open-telemetry/opentelemetry-go) `Meter` and `Tracer````golang
// import "github.com/whitaker-io/machine/telemetry"// Make your slog handler however you please
yourSlogHandler := slog.Default().Handler()// wrap your handler and provide your tracer and meter
telemetryHandler := telemetry.New(
yourSlogHandler,
meterProvider.Meter("your_meter"), // Your otel metric.Meter
tracerProvider.Tracer("your_tracer"), // Your otel trace.Tracer
false, // Log Metrics and Traces to logs as well (useful for debugging)
)slog.SetDefault(slog.New(telemetryHandler))
```Examples of `Edge` implentations can be found in the edge directory and can be used as follows
```golang
// import "github.com/whitaker-io/machine/edge/pubsub"func New[T any](
ctx context.Context,
subscription *pubsub.Subscription,
publisher *pubsub.Topic,
to func(T) *pubsub.Message,
from func(context.Context, *pubsub.Message) T,
) machine.Edge[T]// import "github.com/whitaker-io/machine/edge/http"
func New[T any](c http.Client, fn func(context.Context, T) *http.Request) machine.Edge[T]
```------
***
## 🤝 ContributingContributions, issues and feature requests are welcome.
Feel free to check [issues page](https://github.com/whitaker-io/machine/issues) if you want to contribute.
[Check the contributing guide](./CONTRIBUTING.md).## Author
👤 **Jonathan Whitaker**
- Twitter: [@io_whitaker](https://twitter.com/io_whitaker)
- Github: [@jonathan-whitaker](https://github.com/jonathan-whitaker)## Show your support
Please ⭐️ this repository if this project helped you!
***
## [License](#license)Machine is provided under the [MIT License](https://github.com/whitaker-io/machine/blob/master/LICENSE).
```text
The MIT License (MIT)Copyright (c) 2020 Jonathan Whitaker
```