Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/longnguyen010203/spark-kafka-self-learning

📚🌊🎓 A third-year student is self-studying Spark and Kafka as part of their 👷 data engineering journey, with the goal of securing an 📬 internship or fresher job in 2024.
https://github.com/longnguyen010203/spark-kafka-self-learning

apache-kafka apache-spark cluster docker docker-compose zookeeper

Last synced: about 7 hours ago
JSON representation

📚🌊🎓 A third-year student is self-studying Spark and Kafka as part of their 👷 data engineering journey, with the goal of securing an 📬 internship or fresher job in 2024.

Awesome Lists containing this project

README

        

# 🌃 Apache kafka Self-Learning 🌊
A third-year student is self-studying Spark and Kafka as part of their data engineering journey, with the goal of securing an internship or fresher job in 2024.

## 📦 Technologies
- `Docker`
- `PostgreSQL`
- `Apache Spark`
- `Apache Kafka`
- `Zookeeper`

## 🔦 Architecture

### 1. Apache Spark

- `SparkContext`
- `Driver Program`
- `Cluster Manager`
- `Worker Node`
- `Executor`
- `Cache`
- `Task`

### 2. Apache Kafka

- `Producer`
- `Consumer`
- `Broker`
- `Cluster`
- `Topic`
- `Partition`
- `Offset`
- `Consumer-group`