Projects in Awesome Lists tagged with rdds
A curated list of projects in awesome lists tagged with rdds .
https://github.com/akshitvjain/realtime-twitter-trends-analytics
A big data project to develop a real-time data pipeline for analyzing the popularity and sentiments of trending topics on Twitter.
big-data business-intelligence data-pipeline drill dstream geo-visualization hashtags kafka kafka-producer-consumer mongodb parallel-data-processing rdds realtime-dashboard realtime-data-pipeline spark tableau twitter twitter-sentiment-analysis twitter-streaming-api zookeeper
Last synced: 11 Mar 2025
https://github.com/trainingbypackt/big-data-processing-with-apache-spark-elearning
Efficiently tackle large datasets and perform big data analysis with Spark and Python
dataset python rdds spark spark-mllib structured-streaming
Last synced: 10 Apr 2025
https://github.com/neha-dev-dot/pyspark-tutorial
This repository is part of my journey to learn **PySpark**, the Python API for Apache Spark. I explored the fundamentals of distributed data processing using Spark and practiced with real-world data transformation and querying use cases.
actions data-partitioning dataframes pyspark-basics pyspark-sql rdds sparkbasics sparkcontext sparksession transformations udfs window-functions
Last synced: 17 Apr 2026