Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/npatta01/spark_metis_investigation
My investigation presentation during the Metis Bootcamp
https://github.com/npatta01/spark_metis_investigation
Last synced: 14 days ago
JSON representation
My investigation presentation during the Metis Bootcamp
- Host: GitHub
- URL: https://github.com/npatta01/spark_metis_investigation
- Owner: npatta01
- Created: 2015-10-02T00:41:04.000Z (about 9 years ago)
- Default Branch: master
- Last Pushed: 2015-11-09T14:51:39.000Z (about 9 years ago)
- Last Synced: 2024-12-07T07:04:31.484Z (18 days ago)
- Size: 5.01 MB
- Stars: 1
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# About
My investigation presentation during my stay at [Metis](Bootcamp).Simple word count and length on a sampe of the github commit messages.
# Audience
Meant for a beginner audience who has heard the term big data but is more comfortable with pandas and python.# Data
The data contain two hours data from github that is archived by the github archive project.
# Deliverables
[Slides](http://www.slideshare.net/nidhinpattaniyil/beginner-apache-spark-presentation)
[Analysis](https://github.com/npatta01/spark_metis_investigation/blob/master/spark.ipynb)Presented on a high level what Apache Spark was and how to use it using pyspark.
My slides and sample data are included in the repo
{% iframe https://www.slideshare.net/slideshow/embed_code/key/7t1RDQfmaRkh9N 425 355 %}