An open API service indexing awesome lists of open source software.

https://github.com/dimdasci/yp11-pyspark-training

Training project with Spark DataFrame and MLlib
https://github.com/dimdasci/yp11-pyspark-training

pyspark-eda pyspark-mllib pyspark-notebook pyspark-regression pyspark-tutorial

Last synced: 5 days ago
JSON representation

Training project with Spark DataFrame and MLlib

Awesome Lists containing this project

README

          

# yp11-pyspark-training
Training project with Spark DataFrame and MLlib

This project assumes using conda to manage environments.

## How to install and run

git clone https://github.com/dimdasci/yp11-pyspark-training.git
cd ./yp11-pyspark-training
make install
conda activate ./envs
make run

## How to unistall

conda deactivate
make uninstall
cd ..
rm -rf ./yp11-pyspark-training