Projects in Awesome Lists by bakdata
A curated list of projects in awesome lists by bakdata .
https://github.com/bakdata/aws-lambda-r-runtime
Serverless execution of R code on AWS Lambda
Last synced: 10 Apr 2025
https://github.com/bakdata/fluent-kafka-streams-tests
Fluent Kafka Streams Test with Java
avro java kafka-streams test-framework testing testing-framework
Last synced: 10 Apr 2025
https://github.com/bakdata/kafka-large-message-serde
A Kafka Serde that reads and writes records from and to Blob storage (S3, Azure, Google) transparently.
azure-blob-storage deserialization google-cloud-storage kafka kafka-streams large-data s3 serde serialization simple-storage-service
Last synced: 10 Apr 2025
https://github.com/bakdata/streams-explorer
Explore Apache Kafka data pipelines in Kubernetes.
apache-kafka data-pipelines data-stream hacktoberfest kafka-connect kafka-streams kubernetes python react
Last synced: 10 Apr 2025
https://github.com/bakdata/streams-bootstrap
Utility functions and base classes for Kafka Streams applications
helm kafka kafka-streams kubernetes stream-processing
Last synced: 10 Apr 2025
https://github.com/bakdata/kafka-key-value-store
Queryable Kafka Topics with Kafka Streams.
Last synced: 28 Aug 2025
https://github.com/bakdata/dedupe
Java DSL for (online) deduplication
data-cleaning data-cleansing deduplication duplicate-detection duplicate-removal
Last synced: 10 Apr 2025
https://github.com/bakdata/kafka-error-handling
A library for error handling in Kafka Streams.
Last synced: 10 Apr 2025
https://github.com/bakdata/conversational-search-bot
Conversational search bot for book and movie recommendations
Last synced: 10 Apr 2025
https://github.com/bakdata/kpops
Deploy Kafka pipelines to Kubernetes
kafka kafka-connect kafka-streams kubernetes pipelines stream-processing
Last synced: 03 Apr 2025
https://github.com/bakdata/kafka-profile-store-demo
A Kafka Streams application that creates a queryable profile store
Last synced: 25 Jul 2025
https://github.com/bakdata/quick
The Fastest Way to Create Live Data Products
apache-kafka data-stream graphql kafka-streams rest-api
Last synced: 10 Apr 2025
https://github.com/bakdata/citus-k8s-membership-manager
A manager for connecting Citus worker nodes with the Citus master node.
citusdb kubernetes postgres python
Last synced: 10 Jul 2025
https://github.com/bakdata/rebalancing-demo
Repository that showcases problems with Kafka rebalancing and explains how to fix them. Please visit our blog article to learn what Kafka rebalancing is, what can cause problems, and how to solve them. https://medium.com/bakdata/solving-my-weird-kafka-rebalancing-problems-c05e99535435
kafka kafka-streams kubernetes rebalancing
Last synced: 17 Aug 2025
https://github.com/bakdata/common-kafka-streams-demo
Demo Kafka Streaming NLP pipeline for the blogpost "Continuous NLP Pipelines with Python, Java, and Apache Kafka"
kafka kafka-streams topologies
Last synced: 09 Jul 2025
https://github.com/bakdata/kserve-client
A Java client for KServe inference services
Last synced: 10 Apr 2025
https://github.com/bakdata/faust-avro-serializer
An improved version of https://github.com/marcosschroh/python-schema-registry-client/
Last synced: 10 Apr 2025
https://github.com/bakdata/faust-bootstrap
Utility functions and base classes for Faust applications
Last synced: 10 Apr 2025
https://github.com/bakdata/ci-templates
Collection of reusable workflows and composite actions for Github
Last synced: 25 Feb 2026
https://github.com/bakdata/kafka-streams-kserve-demo
A demo to accompany our blogpost "Scalable Machine Learning with Kafka Streams and KServe"
kafka kafka-streams kserve kubeflow kubernetes
Last synced: 10 Apr 2025
https://github.com/bakdata/kafka-brute-force-serde
A Kafka SerDe to deserialize messages of an unknown serialization format
Last synced: 10 Apr 2025
https://github.com/bakdata/connection_plugin
Templating for Airflow connections
Last synced: 10 Apr 2025
https://github.com/bakdata/gradle-avro-dependency-plugin
A Gradle plugin that lets you compile Apache Avro schemas to Java classes and supports dependencies
Last synced: 04 Mar 2026
https://github.com/bakdata/s3-backed-serde-demo
Demo of S3-backed Kafka Streams SerDe
Last synced: 10 Apr 2025
https://github.com/bakdata/sub-topologies-demo
Demo application for the Blogpost "Optimizing Kafka Streams Topologies running on Kubernetes"
kafka kafka-streams keda topologies
Last synced: 10 Apr 2025
https://github.com/bakdata/kafka-connect-resetter
An application to reset the state of Kafka Connect connectors
Last synced: 15 Mar 2026
https://github.com/bakdata/faust-large-message-serializer
A Faust Serializer that reads and writes records from and to S3 or Azure Blob Storage transparently.
Last synced: 10 Apr 2025
https://github.com/bakdata/seq2
This library is a extension to the great jOOλ library (https://github.com/jOOQ/jOOL)
Last synced: 10 Apr 2025
https://github.com/bakdata/python-keycloak-oauth
Keycloak authentication for Python projects with optional integrations for FastAPI & Starlette-Admin
fastapi keycloak starlette-admin
Last synced: 10 Apr 2025
https://github.com/bakdata/hpi-ii-project-2022
Repository containing the code for the assignment of the Information Integration course in SoSe 2022
elasticsearch kafka kafka-connect
Last synced: 10 Apr 2025
https://github.com/bakdata/kafka-dead-letter-analyzer
A Kafka Streams application that analyzes dead letters in your Kafka cluster
Last synced: 23 Feb 2026
https://github.com/bakdata/nginx-kafka-request-routing
Scaling Requests to Queryable Apache Kafka Topics with nginx
Last synced: 10 Aug 2025
https://github.com/bakdata/bakdata-code-styles
The default code styles + tools to setup for the major IDEs
Last synced: 10 Apr 2025
https://github.com/bakdata/generic-avro-reflect
Allows generic types to be serialized in avro
Last synced: 10 Apr 2025
https://github.com/bakdata/kafka-connect-plugins
A collection Kafka Connect plugins
Last synced: 12 Jul 2025
https://github.com/bakdata/rclone-helm-chart
Helm chart for deploying an rclone CronJob
Last synced: 21 Jan 2026
https://github.com/bakdata/pipeline-atm-fraud
Demo pipeline for ATM fraud detection using streams-bootstrap
Last synced: 10 Apr 2025
https://github.com/bakdata/gradle-plugins
Convenience plugins that orchestrate and preconfigure other plugins for release management or code quality.
Last synced: 28 Jan 2026
https://github.com/bakdata/kafka-streams-state-store-demo
Kafka Streams project demonstrating state store usage
Last synced: 10 Apr 2025
https://github.com/bakdata/ix-spark-article
Apache Spark Demo on Wikipedia Data
Last synced: 08 Feb 2026
https://github.com/bakdata/bakdata-project-templates
Project templates for bakdata OSS projects
Last synced: 15 Jun 2025
https://github.com/bakdata/template-python-poetry
Template for open source Python Poetry projects
Last synced: 10 Apr 2025
https://github.com/bakdata/quick-examples
Quick demos and examples. Quick enables you to easily build scalable stream data applications. Quick empowers your applications with modern API backends and unlocks your stream data through GraphQL.(https://d9p.io)
Last synced: 10 Apr 2025
https://github.com/bakdata/timely-autoscaling-demo
Kafka Summit 2023 - Timely Autoscaling of Complex Kafka Streams Pipelines
Last synced: 10 Apr 2025