Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/lynnlangit/learning-hadoop-and-spark
Companion to Learning Hadoop and Learning Spark courses on Linked In Learning
https://github.com/lynnlangit/learning-hadoop-and-spark
apache-spark dataproc emr hadoop learning-hadoop mapreduce spark wordcount
Last synced: 2 days ago
JSON representation
Companion to Learning Hadoop and Learning Spark courses on Linked In Learning
- Host: GitHub
- URL: https://github.com/lynnlangit/learning-hadoop-and-spark
- Owner: lynnlangit
- License: apache-2.0
- Created: 2019-06-22T15:20:09.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2024-12-10T17:52:23.000Z (about 1 month ago)
- Last Synced: 2025-01-13T03:03:17.429Z (9 days ago)
- Topics: apache-spark, dataproc, emr, hadoop, learning-hadoop, mapreduce, spark, wordcount
- Language: HTML
- Homepage: https://www.linkedin.com/learning/learning-hadoop-2
- Size: 13.6 MB
- Stars: 189
- Watchers: 18
- Forks: 163
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Learning Hadoop and Spark
## Contents
This is the companion repo to my Linked In Learning Courses on Apache Hadoop and Apache Spark.
đ **1. Learning Hadoop** - [link](https://www.linkedin.com/learning/learning-hadoop-23008320)
- this course demos I use mostly GCP Dataproc
- for running Hadoop & associated libraries (i.e. Hive, Pig, Spark...) workloads
đŠī¸ **2. Cloud Hadoop: Scaling Apache Spark** - [link](https://www.linkedin.com/learning/cloud-hadoop-scaling-apache-spark) & [link to content area in this repo](https://github.com/lynnlangit/learning-hadoop-and-spark/tree/master/5-Use-Spark)
- this course demos I use GCP DataProc, AWS EMR --or--
- I use Databricks on AWS or on GCP
âī¸ **3. Azure Databricks Spark Essential Training** - [link](https://www.linkedin.com/learning/azure-databricks-essential-training) & [link to content area in this repo](https://github.com/lynnlangit/learning-hadoop-and-spark/tree/master/5-Use-Spark/Jupyter-Notebooks)
- this course demos I use Azure with Databricks
- for scaling Apache Spark workloads---
## Other LinkedIn Learning Courses on Hadoop or Spark
There are ~ 10 courses on Hadoop/Spark topics on LinkedIn Learning. See graphic below
![Learning Paths](https://github.com/lynnlangit/learning-hadoop-and-spark/blob/master/images/path.png)- **Hadoop** for Data Science Tips and Tricks - [link](https://www.linkedin.com/learning/hadoop-for-data-science-tips-tricks-techniques)
- Set up Cloudera Enviroment
- Working with Files in HDFS
- Connecting to Hadoop Hive
- Complex Data Structures in Hive
- **Spark** courses - [link](https://www.linkedin.com/learning/search?entityType=COURSE&keywords=Spark&software=Apache%20Spark~Hadoop)
- Various Topics - see screenshot below![LinkedInLearningSpark](https://github.com/lynnlangit/learning-hadoop-and-spark/blob/master/images/spark-courses.png)