{"id":26667913,"url":"https://github.com/Jeffail/benthos","last_synced_at":"2025-03-25T20:01:15.784Z","repository":{"id":37004143,"uuid":"54435102","full_name":"redpanda-data/connect","owner":"redpanda-data","description":"Fancy stream processing made operationally mundane","archived":false,"fork":false,"pushed_at":"2025-03-21T14:25:32.000Z","size":35117,"stargazers_count":8276,"open_issues_count":539,"forks_count":857,"subscribers_count":119,"default_branch":"main","last_synced_at":"2025-03-24T08:12:12.979Z","etag":null,"topics":["amqp","cqrs","data-engineering","data-ops","etl","event-sourcing","go","golang","kafka","logs","message-bus","message-queue","nats","rabbitmq","stream-processing","stream-processor","streaming-data"],"latest_commit_sha":null,"homepage":"https://docs.redpanda.com/redpanda-connect/about/","language":"Go","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/redpanda-data.png","metadata":{"files":{"readme":"README.md","changelog":"CHANGELOG.md","contributing":null,"funding":null,"license":"licenses/Apache-2.0.txt","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":"SECURITY.md","support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null},"funding":{"github":"Jeffail","open_collective":"benthos"}},"created_at":"2016-03-22T01:18:48.000Z","updated_at":"2025-03-23T09:26:44.000Z","dependencies_parsed_at":"2024-01-05T20:28:03.362Z","dependency_job_id":"19e9fd6c-6772-4c68-b270-3ac6a8a4031e","html_url":"https://github.com/redpanda-data/connect","commit_stats":{"total_commits":4928,"total_committers":294,"mean_commits":"16.761904761904763","dds":0.2715097402597403,"last_synced_commit":"ede148a6288755816653e2ce6d618fd0c695fa83"},"previous_names":["jeffail/benthos","redpanda-data/connect","benthosdev/connect"],"tags_count":550,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/redpanda-data%2Fconnect","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/redpanda-data%2Fconnect/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/redpanda-data%2Fconnect/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/redpanda-data%2Fconnect/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/redpanda-data","download_url":"https://codeload.github.com/redpanda-data/connect/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":245535485,"owners_count":20631297,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["amqp","cqrs","data-engineering","data-ops","etl","event-sourcing","go","golang","kafka","logs","message-bus","message-queue","nats","rabbitmq","stream-processing","stream-processor","streaming-data"],"created_at":"2025-03-25T20:00:48.601Z","updated_at":"2025-03-25T20:01:15.672Z","avatar_url":"https://github.com/redpanda-data.png","language":"Go","readme":"Redpanda Connect\n================\n\n[![Build Status][actions-badge]][actions-url]\n\nAPI for Apache V2 builds: [![godoc for redpanda-data/connect ASL][godoc-badge]][godoc-url-apache]\n\nAPI for Enterprise builds: [![godoc for redpanda-data/connect RCL][godoc-badge]][godoc-url-enterprise]\n\nRedpanda Connect is a high performance and resilient stream processor, able to connect various [sources][inputs] and [sinks][outputs] in a range of brokering patterns and perform [hydration, enrichments, transformations and filters][processors] on payloads.\n\nIt comes with a [powerful mapping language][bloblang-about], is easy to deploy and monitor, and ready to drop into your pipeline either as a static binary or docker image, making it cloud native as heck.\n\nRedpanda Connect is declarative, with stream pipelines defined in as few as a single config file, allowing you to specify connectors and a list of processing stages:\n\n```yaml\ninput:\n  gcp_pubsub:\n    project: foo\n    subscription: bar\n\npipeline:\n  processors:\n    - mapping: |\n        root.message = this\n        root.meta.link_count = this.links.length()\n        root.user.age = this.user.age.number()\n\noutput:\n  redis_streams:\n    url: tcp://TODO:6379\n    stream: baz\n    max_in_flight: 20\n```\n\n### Delivery Guarantees\n\nDelivery guarantees [can be a dodgy subject](https://youtu.be/QmpBOCvY8mY). Redpanda Connect processes and acknowledges messages using an in-process transaction model with no need for any disk persisted state, so when connecting to at-least-once sources and sinks it's able to guarantee at-least-once delivery even in the event of crashes, disk corruption, or other unexpected server faults.\n\nThis behaviour is the default and free of caveats, which also makes deploying and scaling Redpanda Connect much simpler.\n\n## Supported Sources \u0026 Sinks\n\nAWS (DynamoDB, Kinesis, S3, SQS, SNS), Azure (Blob storage, Queue storage, Table storage), GCP (Pub/Sub, Cloud storage, Big query), Kafka, NATS (JetStream, Streaming), NSQ, MQTT, AMQP 0.91 (RabbitMQ), AMQP 1, Redis (streams, list, pubsub, hashes), Cassandra, Elasticsearch, HDFS, HTTP (server and client, including websockets), MongoDB, SQL (MySQL, PostgreSQL, Clickhouse, MSSQL), and [you know what just click here to see them all, they don't fit in a README][about-categories].\n\n## Documentation\n\nIf you want to dive fully into Redpanda Connect then don't waste your time in this dump, check out the [documentation site][general-docs].\n\nFor guidance on building your own custom plugins in Go check out [the public APIs](https://pkg.go.dev/github.com/redpanda-data/benthos/v4/public/service).\n\n## Install\n\nInstall on Linux:\n\n```shell\ncurl -LO https://github.com/redpanda-data/redpanda/releases/latest/download/rpk-linux-amd64.zip\nunzip rpk-linux-amd64.zip -d ~/.local/bin/\n```\n\nOr use Homebrew:\n\n```shell\nbrew install redpanda-data/tap/redpanda\n```\n\nOr pull the docker image:\n\n```shell\ndocker pull docker.redpanda.com/redpandadata/connect\n```\n\nFor more information check out the [getting started guide][getting-started].\n\n## Run\n\n```shell\nrpk connect run ./config.yaml\n```\n\nOr, with docker:\n\n```shell\n# Using a config file\ndocker run --rm -v /path/to/your/config.yaml:/connect.yaml docker.redpanda.com/redpandadata/connect run\n\n# Using a series of -s flags\ndocker run --rm -p 4195:4195 docker.redpanda.com/redpandadata/connect run \\\n  -s \"input.type=http_server\" \\\n  -s \"output.type=kafka\" \\\n  -s \"output.kafka.addresses=kafka-server:9092\" \\\n  -s \"output.kafka.topic=redpanda_topic\"\n```\n\n## Monitoring\n\n### Health Checks\n\nRedpanda Connect serves two HTTP endpoints for health checks:\n- `/ping` can be used as a liveness probe as it always returns a 200.\n- `/ready` can be used as a readiness probe as it serves a 200 only when both the input and output are connected, otherwise a 503 is returned.\n\n### Metrics\n\nRedpanda Connect [exposes lots of metrics][metrics] either to Statsd, Prometheus, a JSON HTTP endpoint, [and more][metrics].\n\n### Tracing\n\nRedpanda Connect also [emits open telemetry tracing events][tracers], which can be used to visualise the processors within a pipeline.\n\n## Configuration\n\nRedpanda Connect provides lots of tools for making configuration discovery, debugging and organisation easy. You can [read about them here][config-doc].\n\n## Build\n\nBuild with Go (any [currently supported version](https://go.dev/dl/)):\n\n```shell\ngit clone git@github.com:redpanda-data/connect\ncd connect\nmake\n```\n\n## Lint\n\nRedpanda Connect uses [golangci-lint][golangci-lint] for linting, which you can install with:\n\n```shell\ncurl -sSfL https://raw.githubusercontent.com/golangci/golangci-lint/master/install.sh | sh -s -- -b $(go env GOPATH)/bin\n```\n\nAnd then run it with `make lint`.\n\n## Plugins\n\nIt's pretty easy to write your own custom plugins for Redpanda Connect in Go, for information check out [the API docs][godoc-url], and for inspiration there's an [example repo][plugin-repo] demonstrating a variety of plugin implementations.\n\n## Extra Plugins\n\nBy default Redpanda Connect does not build with components that require linking to external libraries, such as the `zmq4` input and outputs. If you wish to build Redpanda Connect locally with these dependencies then set the build tag `x_benthos_extra`:\n\n```shell\n# With go\ngo install -tags \"x_benthos_extra\" github.com/redpanda-data/connect/v4/cmd/redpanda-connect@latest\n\n# Using make\nmake TAGS=x_benthos_extra\n```\n\nNote that this tag may change or be broken out into granular tags for individual components outside of major version releases. If you attempt a build and these dependencies are not present you'll see error messages such as `ld: library not found for -lzmq`.\n\n## Docker Builds\n\nThere's a multi-stage `Dockerfile` for creating a Redpanda Connect docker image which results in a minimal image from scratch. You can build it with:\n\n```shell\nmake docker\n```\n\nThen use the image:\n\n```shell\ndocker run --rm \\\n\t-v /path/to/your/benthos.yaml:/config.yaml \\\n\t-v /tmp/data:/data \\\n\t-p 4195:4195 \\\n\tdocker.redpanda.com/redpandadata/connect run /config.yaml\n```\n\n## Contributing\n\nContributions are welcome! To prevent CI errors, please always make sure a pull request has been:\n\n- Unit tested with `make test`\n- Linted with `make lint`\n- Formatted with `make fmt`\n\nNote: most integration tests need to spin up Docker containers, so they are skipped by `make test`. You can trigger\nthem individually via `go test -run \"^Test.*Integration.*$\" ./internal/impl/\u003cconnector directory\u003e/...`.\n\n[inputs]: https://docs.redpanda.com/redpanda-connect/components/inputs/about\n[about-categories]: https://docs.redpanda.com/redpanda-connect/about#components\n[processors]: https://docs.redpanda.com/redpanda-connect/components/processors/about\n[outputs]: https://docs.redpanda.com/redpanda-connect/components/outputs/about\n[metrics]: https://docs.redpanda.com/redpanda-connect/components/metrics/about\n[tracers]: https://docs.redpanda.com/redpanda-connect/components/tracers/about\n[config-interp]: https://docs.redpanda.com/redpanda-connect/configuration/interpolation\n[streams-api]: https://docs.redpanda.com/redpanda-connect/guides/streams_mode/streams_api\n[streams-mode]: https://docs.redpanda.com/redpanda-connect/guides/streams_mode/about\n[general-docs]: https://docs.redpanda.com/redpanda-connect/about\n[bloblang-about]: https://docs.redpanda.com/redpanda-connect/guides/bloblang/about\n[config-doc]: https://docs.redpanda.com/redpanda-connect/configuration/about\n[releases]: https://github.com/redpanda-data/connect/releases\n[plugin-repo]: https://github.com/redpanda-data/redpanda-connect-plugin-example\n[getting-started]: https://docs.redpanda.com/redpanda-connect/guides/getting_started\n\n[godoc-badge]: https://pkg.go.dev/badge/github.com/redpanda-data/benthos/v4/public\n[godoc-url-apache]: https://pkg.go.dev/github.com/redpanda-data/connect/public/bundle/free/v4\n[godoc-url-enterprise]: https://pkg.go.dev/github.com/redpanda-data/connect/public/bundle/enterprise/v4\n[actions-badge]: https://github.com/redpanda-data/connect/actions/workflows/test.yml/badge.svg\n[actions-url]: https://github.com/redpanda-data/connect/actions/workflows/test.yml\n\n[golangci-lint]: https://golangci-lint.run/\n[jaeger]: https://www.jaegertracing.io/\n","funding_links":["https://github.com/sponsors/Jeffail","https://opencollective.com/benthos"],"categories":["开源类库","Messaging","Table of Contents","_Table of Contents_","消息传递","Open source library","Go","消息","消息系统","Relational Databases","机器学习"],"sub_categories":["流处理","Search and Analytic Databases","Streaming Library","On-Prem","Advanced Console UIs","交流","Stream Processing","检索及分析资料库","SQL 查询语句构建库"],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FJeffail%2Fbenthos","html_url":"https://awesome.ecosyste.ms/projects/github.com%2FJeffail%2Fbenthos","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FJeffail%2Fbenthos/lists"}