An open API service indexing awesome lists of open source software.

https://github.com/jaki729/simple_kafkaairflowstream

Sample Data Pipeline
https://github.com/jaki729/simple_kafkaairflowstream

airflow docker kafka python3 sql

Last synced: 15 days ago
JSON representation

Sample Data Pipeline

Awesome Lists containing this project

README

          

# Simple_KafkaAirflowStream

KafkaAirflowStream is a project designed to integrate Kafka for real-time data streaming and Apache Airflow for orchestrating complex data pipelines. The system captures and processes streams of data, managing their flow from data generation, through Kafka's producer and broker architecture, to consumption and processing. Apache Airflow handles the scheduling and management of these processes, ensuring reliable and efficient operation. The data is stored in a MySQL database for further analysis. This project aims to create a robust, scalable, and Dockerized solution for handling and orchestrating real-time data streams with a focus on flexibility and automation.