Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/drylikov/hadoop_tutorials

Hadoop Tutorials.
https://github.com/drylikov/hadoop_tutorials

Last synced: 8 days ago
JSON representation

Hadoop Tutorials.

Awesome Lists containing this project

README

        

##Hadoop Tutorials

This repo contains a set of tutorials for Hadoop designed to work inside or alongside the [Hortonworks Sandbox](http://hortonworks.com/products/hortonworks-sandbox/). Sandbox is a single-node Hadoop cluster that runs in a Virtual Machine (Virtual Box, VMWare, Hyper-V).

![image](sandbox.png?raw=true)

Current tutorials are as follows:

###Sandbox Tutorials (Embedded in Sandbox and tested with v2.0)

* [1 - Hello World - Using Sandbox with Hive and Pig](/Sandbox/T01_Hello_World_Using_Sandbox_with_Hive_and_Pig.md)
* [2 - Data Processing with Pig - Processing Baseball Stats](/Sandbox/T02_Data_Processing_with_Pig.md)
* [3 - Data Processing with Hive - Processing Baseball Stats](/Sandbox/T03_Data_Processing_with_Hive.md)
* [4 - HCatalog, Basic Pig and Hive Commands](/Sandbox/T04_HCatalog_Basic_Pig_and_Hive.md)
* [5 - Using Basic Pig Commands](/Sandbox/T05_Using_Basic_Pig_Commands.md)
* [6 - Loading Data into the Hadoop](/Sandbox/T06_Loading_Data_into_Sandbox.md)
* [7 - Installing the Hortonworks ODBC driver on Windows 7](/Sandbox/T07_Installing_the_Hortonworks_ODBC_Driver_on_Windows_7.md)
* [8 - Using Excel 2013 to Access Hadoop Data](/Sandbox/T08_Using_Excel_2013_to_Access_Hadoop_data.md)
* [9 - Using Excel 2013 to Analyze Hadoop Data](/Sandbox/T09_Using_Excel_2013_to_Analyze_Hadoop_data.md)
* [10 - Visualizing Website Clickstream Data](/Sandbox/T10_Visualizing_Website_Clickstream_Data.md)
* [11 - Installing the Hortonworks ODBC driver on Mac OS X](/Sandbox/T11_Installing_the_Hortonworks_ODBC_driver_on_Mac_OSX.md)
* [12 - Refining and Visualizing Server Log Data](/Sandbox/T12_Refining_and_Visualizing_Server_Log_Data.md)
* [13 - Refining and Visualizing Sentiment Data](/Sandbox/T13_Refining_and_Visualizing_Sentiment_Data.md)
* [14 - Analyzing Machine and Sensor Data](/Sandbox/T14_Analyzing_Machine_and_Sensor_Data.md)

###Community Tutorials

Feel free to contribute tutorials and help the community harness Hadoop!

* [1 - Using RHadoop to Predict Web Visitors](/Community/T01_RHadoop_visitors_prediction.md) from [mwacc](https://github.com/mwacc)
* [2 - Using Spring XD to stream Tweets to Hadoop for Sentiment Analysis](/Community/T02_Spring_XD_Hadoop_Twitter.md) from [mehzer](https://github.com/mehzer)
* [3 - Word Counting with Pig](/Community/T03_Word_Counting_With_Pig.md) from [flacrosse](https://github.com/flacrosse)
* [4 - Import from Microsoft SQL Server into the Hortonworks Sandbox using Sqoop](/Community/T04_Import_from_Microsoft_SQL_Server_into_the_Hortonworks_Sandbox_using_Sqoop.md) from [flacrosse](https://github.com/flacrosse)
* [5 - Use HDFS Explorer to Manage Files on Hortonworks Sandbox](/Community/T05_Use_HDFS_Explorer_To_Manage_Files_On_The_Hortonworks_Sandbox.md) from [RedGate](http://redgate.com)
* [6 - Java-Hive Integration - Salary Delta Analysis and Visualization Using Hive, Java and JFreeChart](/Community/T06_Java_Hive_Integration_Salary_Delta_Analysis_And_Visualization_Using_JFreeChart.md) from [vivganes](https://github.com/vivganes)
* [7 - elasticsearch-Hadoop integration](/Community/T07_Elasticsearch_Hadoop_Integration.md) from [yannbrrd](https://github.com/yannbrrd)
* [8 - Java-HBase Integration - Professional Analysis and Visualization Using HBase, Java and JFreeChart](/Community/T08_Java_HBase_Integration_Profession_Analysis_And_Visualization_Using_JFreeChart.md) from [vivganes](https://github.com/vivganes)
* [9 - Write and Run Your Own MapReduce Java Program - Poll Result Analysis Using Hadoop, Java and Eclipse](/Community/T09_Write_And_Run_Your_Own_MapReduce_Java_Program_Poll_Result_Analysis.md) from [vivganes](https://github.com/vivganes)