https://github.com/jackgerrits/cb-spark-demo
https://github.com/jackgerrits/cb-spark-demo
Last synced: 2 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/jackgerrits/cb-spark-demo
- Owner: jackgerrits
- License: bsd-3-clause
- Created: 2020-09-04T18:56:03.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2020-09-11T17:18:32.000Z (over 4 years ago)
- Last Synced: 2025-01-23T06:32:36.197Z (4 months ago)
- Language: Jupyter Notebook
- Size: 16.6 KB
- Stars: 0
- Watchers: 3
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: readme.md
- License: LICENSE
Awesome Lists containing this project
README
# Getting started
1. Download and extract Spark: [download](https://spark.apache.org/downloads.html)
2. Add to your `.bashrc`
```sh
# This is where you extracted Spark
export SPARK_HOME="$HOME/spark/spark-2.4.6-bin-hadoop2.7/"
export PATH=$SPARK_HOME/bin:$PATH
export PYTHONPATH=$SPARK_HOME/python/:$PYTHONPATH
```
3. Install pyspark
```sh
pip install pyspark
```
4. Start notebook server:
```sh
jupyter notebook
```