An open API service indexing awesome lists of open source software.

https://github.com/webdevcaptain/pyspark-intro

Introduction to PySpark
https://github.com/webdevcaptain/pyspark-intro

pyspark

Last synced: 3 months ago
JSON representation

Introduction to PySpark

Awesome Lists containing this project

README

        

# Introduction to PySpark

PySpark is a Python API for Apache Spark. It offers a faster, more flexible alternative to the traditional MapReduce framework.

## Contents
1. [RDD Notebook](./pyspark-primer.ipynb)
2. [Pyspark SQL](./pyspark-intro.ipynb)

## References

- [PySpark](https://spark.apache.org/docs/latest/api/python/index.html)