An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with rdds

A curated list of projects in awesome lists tagged with rdds .

https://github.com/trainingbypackt/big-data-processing-with-apache-spark-elearning

Efficiently tackle large datasets and perform big data analysis with Spark and Python

dataset python rdds spark spark-mllib structured-streaming

Last synced: 10 Apr 2025

https://github.com/aiwithqasim/pyspark_bigdata

Getting started with PySpark for Big data analysis

graphx mllib pyspark python rdds

Last synced: 17 Mar 2025

https://github.com/neha-dev-dot/pyspark-tutorial

This repository is part of my journey to learn **PySpark**, the Python API for Apache Spark. I explored the fundamentals of distributed data processing using Spark and practiced with real-world data transformation and querying use cases.

actions data-partitioning dataframes pyspark-basics pyspark-sql rdds sparkbasics sparkcontext sparksession transformations udfs window-functions

Last synced: 17 Apr 2026

https://github.com/thanaraklee/pyspark-big-data-rdd-operations

This project illustrates Apache Spark RDD operations, from creation and transformation to actions and results, enhancing users' understanding of distributed data processing.

big-data pyspark python rdds spark

Last synced: 09 May 2026