Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/juanmanuel-tirado/pyspark-tutorial
This is a collection of PySpark tutorials
https://github.com/juanmanuel-tirado/pyspark-tutorial
jupyter-notebook machine-learning ml pyspark python spark tutorial
Last synced: 13 days ago
JSON representation
This is a collection of PySpark tutorials
- Host: GitHub
- URL: https://github.com/juanmanuel-tirado/pyspark-tutorial
- Owner: juanmanuel-tirado
- License: apache-2.0
- Created: 2024-01-25T11:48:59.000Z (11 months ago)
- Default Branch: main
- Last Pushed: 2024-02-13T21:15:34.000Z (10 months ago)
- Last Synced: 2024-10-28T13:21:47.905Z (about 2 months ago)
- Topics: jupyter-notebook, machine-learning, ml, pyspark, python, spark, tutorial
- Language: Jupyter Notebook
- Homepage: https://jmtirado.net
- Size: 641 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# pyspark-tutorial
This repo contains a collection of PySpark tutorials and examples.- [PySpark MLlib](pyspark-ml.ipynb)
Code examples of how to use Spark MLlib with PySpark.
- [PySpark FMA](pyspark_fma.ipynb)
Does a song with a long title have a longer duration? Learn how to answer this question using PySpark and the FMA dataset.