https://github.com/acroz/sparkmagic-talk
Slides and a demo notebook from my talk at PyCon UK 2018.
https://github.com/acroz/sparkmagic-talk
Last synced: about 2 months ago
JSON representation
Slides and a demo notebook from my talk at PyCon UK 2018.
- Host: GitHub
- URL: https://github.com/acroz/sparkmagic-talk
- Owner: acroz
- Created: 2018-09-17T18:11:51.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2018-09-18T10:47:21.000Z (over 6 years ago)
- Last Synced: 2025-02-08T09:45:27.463Z (4 months ago)
- Language: Jupyter Notebook
- Homepage:
- Size: 2.86 MB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Using Spark from Python and Jupyter
This repo contains slides and a demo notebook from my talk at PyCon UK 2018.
You can watch the talk [on youtube][talk on youtube].
## Content
In this talk I presented:
* A brief introduction to [Apache Spark][spark].
* Connecting to a Spark cluster running the [Apache Livy][livy] REST interface
from Jupyter with [sparkmagic][sparkmagic] and any Python code with
[pylivy][pylivy].
* The basics of loading data into Spark, manipulating it and doing analysis
with [MLlib][mllib].
* Retrieving data back into Jupyter or Python for further analysis.
* An example web app using [Plotly Dash][dash], [Python RQ][rq] and pylivy to
build a Spark-powered dashboard using only Python.## Questions and feedback
Any questions or feedback are welcome either as GitHub issues on this repo, or
directly over email at [email protected].## Contribute
[pylivy][pylivy] doesn't yet support nearly all the features provided by Livy.
If you'd like to contribute please get in touch![talk on youtube]: https://www.youtube.com/watch?v=iHg0NwEjbxk
[spark]: https://spark.apache.org
[livy]: https://livy.incubator.apache.org
[sparkmagic]: https://github.com/jupyter-incubator/sparkmagic
[pylivy]: https://github.com/acroz/pylivy
[mllib]: https://spark.apache.org/docs/latest/ml-guide.html
[dash]: https://dash.plot.ly
[rq]: https://python-rq.org