https://github.com/trainingbypackt/big-data-processing-with-apache-spark-elearning
Efficiently tackle large datasets and perform big data analysis with Spark and Python
https://github.com/trainingbypackt/big-data-processing-with-apache-spark-elearning
dataset python rdds spark spark-mllib structured-streaming
Last synced: about 1 year ago
JSON representation
Efficiently tackle large datasets and perform big data analysis with Spark and Python
- Host: GitHub
- URL: https://github.com/trainingbypackt/big-data-processing-with-apache-spark-elearning
- Owner: TrainingByPackt
- License: mit
- Created: 2018-12-24T06:30:51.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2019-01-11T09:47:12.000Z (over 7 years ago)
- Last Synced: 2025-03-24T10:03:54.295Z (about 1 year ago)
- Topics: dataset, python, rdds, spark, spark-mllib, structured-streaming
- Language: Python
- Size: 36.1 KB
- Stars: 7
- Watchers: 4
- Forks: 6
- Open Issues: 0