https://github.com/axsaucedo/hadoop-overview

Hands on Hadoop, services, installation
https://github.com/axsaucedo/hadoop-overview

ambari hadoop hdfs hive mapreduce mesos notes pig spark yarn

Last synced: 2 months ago
JSON representation

Hands on Hadoop, services, installation

Host: GitHub
URL: https://github.com/axsaucedo/hadoop-overview
Owner: axsaucedo
Created: 2017-05-26T11:43:49.000Z (over 8 years ago)
Default Branch: master
Last Pushed: 2017-06-10T19:31:40.000Z (over 8 years ago)
Last Synced: 2025-03-23T18:11:19.276Z (7 months ago)
Topics: ambari, hadoop, hdfs, hive, mapreduce, mesos, notes, pig, spark, yarn
Language: HTML
Homepage: https://axsauze.github.io/hadoop-overview
Size: 1.75 MB
Stars: 6
Watchers: 1
Forks: 10
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

          # Introduction

These are complete notes on Hands on practice, exercises and installation of hadoop related services based on the [Hands on Hadoop Course](https://www.udemy.com/the-ultimate-hands-on-hadoop-tame-your-big-data) including: HDFS, YARN, MapReduce, Pig, Hive, Ambari, Spark, Mesos, TEZ, HBase, Storm, Oozie, Flink, Scoop, Flume, Kafka, MySQL, Cassandra, MongoDB, Drill, Hue, Phoenix, Presto and Zeppelin.

The live notebook can be found at: [axsauze.github.io/hadoop-overview](https://axsauze.github.io/hadoop-overview/) 

List of contents: 

* Section 1

    - [1-1 - Hortonworks Installation](section-1/1-1.md) 

    - [1-2 - The Hadoop Ecosystem](section-1/1-2.md) 

    - [1-3 - Hadoop Ecosystem and Technologies](section-1/1-3.md) 

* Section 2

    - [2-1 - HDFS: What is it and how it works](section-2/2-1.md) 

    - [2-2 - Install Movielens dataset](section-2/2-2.md) 

    - [2-3 - HDFS: Command line interface ](section-2/2-3.md) 

    - [2-4 - MapReduce Fundamental Concepts](section-2/2-4.md) 

    - [2-5 - MapReduce on a Cluster - How MR Scales](section-2/2-5.md) 

    - [2-6 - MapReduce: A Real Life Example](section-2/2-6.md) 

    - [2-7 - Running MapReduce with MRJOB](section-2/2-7.md) 

    - [2-8 - Running with MRJob](section-2/2-8.md) 

    - [2-9 - Hadoop Your Challenge](section-2/2-9.md) 

    - [2-10 - Check your results](section-2/2-10.md) 

* Section 3

    - [3-1 - Introducing Ambari](section-3/3-1.md)

    - [3-2 - Introducing Pig](section-3/3-2.md)

    - [3-3 - PIG Example](section-3/3-3.md)

    - [3-4 - Runing Pig Script](section-3/3-4.md)

    - [3-5 - PigLatin: Diving Deeper](section-3/3-5.md)

    - [3-6 - PIG Challenge](section-3/3-6.md)

* Section 4

    - [4-1 - SPARK](section-4/4-1.md)

    - [4-2 - Introducing RDDs](section-4/4-2.md)

    - [4-3 - Example RDD in Spark](section-4/4-3.md)

    - [4-4 - Spark2.0 way of DataFrames and DataSets](section-4/4-4.md)

    - [4-5 - Spark2 Finding movies!](section-4/4-5.md)

    - [4-6 - Using MLLib in Spark](section-4/4-6.md)

    - [4-7 - Spark2 Challenge](section-4/4-7.md)

* Section 5

    - [5-1 - What is HIVE?](section-5/5-1.md)

    - [5-2 - Hive Example](section-5/5-2.md)

    - [5-3 - How Hive Works](section-5/5-3.md)

    - [5-4 - Hive Challenge](section-5/5-4.md)

    - [5-5 - Integrating MySQL & Hadoop](section-5/5-5.md)

    - [5-6 - Install MySQL and Run Stuff](section-5/5-6.md)

    - [5-7 - Using Sqoop to extract data from MySQL](section-5/5-7.md)

    - [5-8 - Sqoop export data to MySQL](section-5/5-8.md)

* Section 6

    - [6-1 - Why NoSQL](section-6/6-1.md)

    - [6-2 - HBase](section-6/6-2.md)

    - [6-3 - Let's play with HBase](section-6/6-3.md)

    - [6-4 - HBase / Pig Integration](section-6/6-4.md)

    - [6-5 - Cassandra Overview](section-6/6-5.md)

    - [6-6 - Installing Cassandra](section-6/6-6.md)

    - [6-7 - Write Spark to Cassandra](section-6/6-7.md)

    - [6-8 - MongoDB](section-6/6-8.md)

    - [6-9 - Installing MongoDB](section-6/6-9.md)

    - [6-10 - MongoDB Shell](section-6/6-10.md)

    - [6-11 - Choosing your database](section-6/6-11.md)

    - [6-12 - Exercise](section-6/6-12.md)

* Section 7

    - [7-1 - External query engiens](section-7/7-1.md)

    - [7-2 - Setting up drill](section-7/7-2.md)

    - [7-3 - Querying with Drill](section-7/7-3.md)

    - [7-4 - Apache Phoenix](section-7/7-4.md)

    - [7-5 - Installing Phoenix](section-7/7-5.md)

    - [7-6 - Using Phoenix](section-7/7-6.md)

    - [7-7 - What is presto?](section-7/7-7.md)

    - [7-8 - Installing PRESTO!](section-7/7-8.md)

    - [7-9 - Presto Cassandra integration](section-7/7-9.md)

* Section 8

    - [8-1 - Hadoop under the hood](section-8/8-1.md)

    - [8-2 - TEZ Explained](section-8/8-2.md)

    - [8-3 - Configure and run hive](section-8/8-3.md)

    - [8-4 - Apache Mesos](section-8/8-4.md)

    - [8-5 - Zookeeper Overview](section-8/8-5.md)

    - [8-6 - Simulating Zookeeper Master Failure](section-8/8-6.md)

    - [8-7 - Oozie](section-8/8-7.md)

    - [8-8 - Workflow in Oozie](section-8/8-8.md)

    - [8-9 - Zeppelin Overview](section-8/8-9.md)

    - [8-10 - Playing with Zepplin](section-8/8-10.md)

    - [8-11 - Zeppelin Advanced ](section-8/8-11.md)

    - [8-12 - HUE](section-8/8-12.md)

    - [8-13 - Other Admin Technologies](section-8/8-13.md)

* Section 9

    - [9-1 - Streaming with Kafka](section-9/9-1.md)

    - [9-2 - Setting up Kafka](section-9/9-2.md)

    - [9-3 - Advanced Kafka](section-9/9-3.md)

    - [9-4 - Apache Flume](section-9/9-4.md)

    - [9-5 - Setup Flume](section-9/9-5.md)

    - [9-6 - Setup Flume Flow](section-9/9-6.md)

* Section 10

    - [10-1 - Spark STREAMING](section-10/10-1.md)

    - [10-2 - Spark Streaming Example](section-10/10-2.md)

    - [10-3 - Python Challenge](section-10/10-3.md)

    - [10-4 - Apache Storm](section-10/10-4.md)

    - [10-5 - Apache Storm Example](section-10/10-5.md)

    - [10-6 - What is Flink](section-10/10-6.md)

    - [10-7 - Counting words with Flink](section-10/10-7.md)

* Section 11

    - [11-1 - All the other services](section-11/11-1.md)

    - [11-2 - How pieces fit together](section-11/11-2.md)

    - [11-3 - Understanding your requirements](section-11/11-3.md)

    - [11-4 - Example Top Sellers](section-11/11-4.md)

    - [11-5 - Example: Movie Recommendations](section-11/11-5.md)

    - [11-6 - Challenge: System report](section-11/11-6.md)

* Section 12

    - [12-1 - Books and Online Courses](section-12/12-1.md)

[Image]: https://www.udemy.com/the-ultimate-hands-on-hadoop-tame-your-big-data

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/axsaucedo/hadoop-overview

Awesome Lists containing this project

README