https://github.com/huy-dataguy/reddit-genai-data-platform
(In Progess) - Real-time Reddit trend and sentiment analysis using Lakehouse Architecture with Kafka, Spark, Gemini 2.5, Iceberg, Grafana, and Airflow.
https://github.com/huy-dataguy/reddit-genai-data-platform
aiflow kafka lakehouse pipeline real-time reddit spark-streaming
Last synced: about 2 months ago
JSON representation
(In Progess) - Real-time Reddit trend and sentiment analysis using Lakehouse Architecture with Kafka, Spark, Gemini 2.5, Iceberg, Grafana, and Airflow.
- Host: GitHub
- URL: https://github.com/huy-dataguy/reddit-genai-data-platform
- Owner: huy-dataguy
- Created: 2025-07-01T16:00:36.000Z (3 months ago)
- Default Branch: main
- Last Pushed: 2025-08-09T09:32:53.000Z (2 months ago)
- Last Synced: 2025-08-09T11:35:19.702Z (2 months ago)
- Topics: aiflow, kafka, lakehouse, pipeline, real-time, reddit, spark-streaming
- Language: Python
- Homepage:
- Size: 93.8 KB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md