Projects in Awesome Lists tagged with parquet-storage
A curated list of projects in awesome lists tagged with parquet-storage .
https://github.com/lilivalgo/smartcity
Simulates a real-time Smart City data pipeline with Kafka, Apache Spark, and S3. Streams and processes vehicle, GPS, weather, traffic, and emergency data with Dockerized components and Parquet storage for efficient, scalable data engineering
apache-spark aws-s3 data-pipeline distributed-processing docker parquet-storage real-time-streaming
Last synced: 04 Feb 2026