Projects in Awesome Lists tagged with data-stream-processing
A curated list of projects in awesome lists tagged with data-stream-processing .
https://github.com/ArroyoSystems/arroyo
Distributed stream processing engine in Rust
data data-stream-processing dev-tools infrastructure kafka rust sql stream-processing stream-processing-engine
Last synced: 26 Mar 2025
https://github.com/superstreamlabs/memphis
Memphis.dev is a highly scalable and effortless data streaming platform
data data-engineering data-pipeline data-stream-processing data-streaming enrichment golang kubernetes message-broker message-bus message-queue messaging-queue microservices schema-registry
Last synced: 12 Jan 2026
https://github.com/Fengrui-Liu/StreamAD
Online anomaly detection for data streams/ Real-time anomaly detection for time series data.
anomaly-detection data-stream-processing data-streams incremental-learning machine-learning online-learning outlier-detection python real-time-detection streaming time-series timeseries
Last synced: 26 Mar 2025
https://github.com/marcosschroh/faust-docker-compose-example
Faust dockerized application
data-stream-processing docker faust kafka schema-registry zookeeper
Last synced: 21 Mar 2025
https://github.com/zmyzheng/tweetmap
A real time Tweet Trend Map and Sentiment Analysis web application with kafka, Angular, Spring Boot, Flink, Elasticsearch, Kibana, Docker and Kubernetes deployed on the cloud
angular aws bootstrap cloud data-stream-processing docker elaticsearch flink kafka kibana kubernetes microservice sentiment-analysis spring-boot
Last synced: 01 May 2025
https://github.com/boilingdata/boilstream
Diskless high-throughput data ingestion server with local DuckDB caching. Writes optimised Parquet files onto Object Storages.
analytics data-stream-processing data-stream-processing-framework data-streams duckdb ducklake object-storage rust sql
Last synced: 02 Sep 2025
https://github.com/anqorithm/realtime-stockstream
RealTime StockStream is a streamlined, simulation system for processing live stock market data. It uses Apache Kafka for data input, Apache Spark for data handling, and Apache Cassandra for data storage, making it a powerful yet easy-to-use tool for financial data analysis
apache-spark apache-sparksql asynchronous bigdata cassendra data-stream-processing databases docker docker-compose kafka python realtime spark spark-master spark-streaming stock-market stocks zookeeper
Last synced: 31 Oct 2025
https://github.com/ahmetfurkandemir/flink-example
Flink Example
apache apache-flink data-engineering data-stream-processing debezium debezium-client debezium-connector debeziumkafkaconnector docker flink flink-sql flink-stream-processing flink-streaming kafka kafka-streams scala scala2 ubuntu
Last synced: 20 Aug 2025
https://github.com/ppatierno/kafka-hybrid-iot
Apache Kafka for the Hybrid IoT
amqp apache-camel apache-kafka data-stream-processing data-streaming internet-of-things iot iot-application iot-cloud kafka-streams messaging vertx
Last synced: 18 Mar 2025
https://github.com/daq-tools/lorrystream
A lightweight and polyglot stream-processing library, to be used as a data backplane-, message relay-, or pipeline-subsystem.
amqp broker cratedb data-stream data-stream-processing data-stream-processing-framework data-streaming kotori-daq message-broker message-bus message-queue mosquitto mqtt pandas sqlalchemy stream streaming streamz zeromq zmq
Last synced: 30 Apr 2025
https://github.com/northern-64bit/riodb-headless-client
A headless client for RioDB.
data-stream-processing headless-client java
Last synced: 15 Mar 2025
https://github.com/hrialan/m2ds-data-stream-project
Markets stocks prediction using Kafka and online regression tools
data-stream-processing kafka machine-learning online-learning
Last synced: 03 Jan 2026
https://github.com/datasystemsgrouput/process-discovery-over-unordered-streams
A Flink library to implement both a buffer-based and a speculative out-of-order event arrival handlers for online process discovery
data-stream-processing out-of-order process-mining
Last synced: 16 Jan 2026
https://github.com/mauriciovazquezm/spark_bigdata_architecture_project
Final project for the course 'Architecture for Large Data Volumes', taught in the Bachelor's program in Data Science at ITAM
data-stream-processing data-streaming pyspark python spark time-series
Last synced: 28 Nov 2025
https://github.com/pyladiesams/kafka-clients-processing-data-streams-aug2025
Learn how to set up a Kafka producer client, then process the data to make it ready for downstream consumers. Discuss the basic of Kafka and get a handle on the different ways to process the data.
data-stream-processing data-streaming kafka kafka-consumer kafka-producer
Last synced: 20 Jul 2025
https://github.com/eli64s/pyflink-poc
PyFlink data stream processing utilities 🐿
apache-flink data-stream-processing data-streaming data-streams pyflink real-time-data
Last synced: 08 Apr 2025
https://github.com/ayemunhossain/stream-buffers-in-nodejs
This project focuses on implementing and demonstrating how stream and buffer works along together in nodejs.
buffers data-stream-processing data-streaming nodejs nodejs-buffers nodejs-streams stream streams-buffers
Last synced: 24 Mar 2025