Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/vigneshss-07/pyspark-acompleteguide
This repo explains pyspark modules in python. Used to deal with big data more practical handson.
https://github.com/vigneshss-07/pyspark-acompleteguide
pyspark pyspark-mllib pyspark-notebook pyspark-python pyspark-tutorial
Last synced: 5 days ago
JSON representation
This repo explains pyspark modules in python. Used to deal with big data more practical handson.
- Host: GitHub
- URL: https://github.com/vigneshss-07/pyspark-acompleteguide
- Owner: vigneshSs-07
- Created: 2021-09-13T12:04:31.000Z (about 3 years ago)
- Default Branch: main
- Last Pushed: 2023-06-14T13:49:13.000Z (over 1 year ago)
- Last Synced: 2024-11-15T19:47:25.331Z (5 days ago)
- Topics: pyspark, pyspark-mllib, pyspark-notebook, pyspark-python, pyspark-tutorial
- Language: Jupyter Notebook
- Homepage:
- Size: 1.86 MB
- Stars: 4
- Watchers: 1
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Spark_Pyspark
* http://spark.apache.org/docs/latest/api/python/reference/index.html
***Apache Spark using Python***
1. https://github.com/dgadiraju/itversity-books/tree/master/Data%20Engineering%20Bootcamp/46%20Apache%20Spark%20using%20Python
2. https://github.com/dgadiraju/itversity-books/tree/master/starterkits/spark/python1. A quick introduction to the Spark API
https://lnkd.in/g8Y3tdhX2. Overview of Spark - RDD, accumulators, broadcast variable
https://lnkd.in/g7fepuFF3. Spark SQL, Datasets, and DataFrames:
https://lnkd.in/g3iZp7zk4. PySpark - Processing data with Spark in Python
https://lnkd.in/gBnh6PAi5. Processing data with SQL on the command line
https://lnkd.in/ggnxDaUu6. Cluster Overview
https://lnkd.in/guCQnJnv7. Packaging and deploying applications
https://lnkd.in/gUZpi2P98. Customize Spark via its configuration system
https://lnkd.in/gZh8Vkmv9. Monitoring - Track the behavior of your applications
https://lnkd.in/grpGKFuP10. Best practices to optimize performance and memory use
https://lnkd.in/gTRYBDQu