https://github.com/dimdasci/yp11-pyspark-training
Training project with Spark DataFrame and MLlib
https://github.com/dimdasci/yp11-pyspark-training
pyspark-eda pyspark-mllib pyspark-notebook pyspark-regression pyspark-tutorial
Last synced: 5 days ago
JSON representation
Training project with Spark DataFrame and MLlib
- Host: GitHub
- URL: https://github.com/dimdasci/yp11-pyspark-training
- Owner: dimdasci
- License: cc0-1.0
- Created: 2022-08-01T14:34:54.000Z (almost 4 years ago)
- Default Branch: main
- Last Pushed: 2022-08-06T11:13:45.000Z (almost 4 years ago)
- Last Synced: 2024-04-16T01:32:01.691Z (about 2 years ago)
- Topics: pyspark-eda, pyspark-mllib, pyspark-notebook, pyspark-regression, pyspark-tutorial
- Language: Jupyter Notebook
- Homepage:
- Size: 765 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# yp11-pyspark-training
Training project with Spark DataFrame and MLlib
This project assumes using conda to manage environments.
## How to install and run
git clone https://github.com/dimdasci/yp11-pyspark-training.git
cd ./yp11-pyspark-training
make install
conda activate ./envs
make run
## How to unistall
conda deactivate
make uninstall
cd ..
rm -rf ./yp11-pyspark-training