Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/drylikov/hadoop_tutorials
Hadoop Tutorials.
https://github.com/drylikov/hadoop_tutorials
Last synced: 8 days ago
JSON representation
Hadoop Tutorials.
- Host: GitHub
- URL: https://github.com/drylikov/hadoop_tutorials
- Owner: drylikov
- License: mit
- Created: 2024-09-06T22:30:44.000Z (2 months ago)
- Default Branch: drylikov
- Last Pushed: 2024-09-06T22:38:28.000Z (2 months ago)
- Last Synced: 2024-09-07T02:15:21.868Z (2 months ago)
- Size: 67.1 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
##Hadoop Tutorials
This repo contains a set of tutorials for Hadoop designed to work inside or alongside the [Hortonworks Sandbox](http://hortonworks.com/products/hortonworks-sandbox/). Sandbox is a single-node Hadoop cluster that runs in a Virtual Machine (Virtual Box, VMWare, Hyper-V).
![image](sandbox.png?raw=true)
Current tutorials are as follows:
###Sandbox Tutorials (Embedded in Sandbox and tested with v2.0)
* [1 - Hello World - Using Sandbox with Hive and Pig](/Sandbox/T01_Hello_World_Using_Sandbox_with_Hive_and_Pig.md)
* [2 - Data Processing with Pig - Processing Baseball Stats](/Sandbox/T02_Data_Processing_with_Pig.md)
* [3 - Data Processing with Hive - Processing Baseball Stats](/Sandbox/T03_Data_Processing_with_Hive.md)
* [4 - HCatalog, Basic Pig and Hive Commands](/Sandbox/T04_HCatalog_Basic_Pig_and_Hive.md)
* [5 - Using Basic Pig Commands](/Sandbox/T05_Using_Basic_Pig_Commands.md)
* [6 - Loading Data into the Hadoop](/Sandbox/T06_Loading_Data_into_Sandbox.md)
* [7 - Installing the Hortonworks ODBC driver on Windows 7](/Sandbox/T07_Installing_the_Hortonworks_ODBC_Driver_on_Windows_7.md)
* [8 - Using Excel 2013 to Access Hadoop Data](/Sandbox/T08_Using_Excel_2013_to_Access_Hadoop_data.md)
* [9 - Using Excel 2013 to Analyze Hadoop Data](/Sandbox/T09_Using_Excel_2013_to_Analyze_Hadoop_data.md)
* [10 - Visualizing Website Clickstream Data](/Sandbox/T10_Visualizing_Website_Clickstream_Data.md)
* [11 - Installing the Hortonworks ODBC driver on Mac OS X](/Sandbox/T11_Installing_the_Hortonworks_ODBC_driver_on_Mac_OSX.md)
* [12 - Refining and Visualizing Server Log Data](/Sandbox/T12_Refining_and_Visualizing_Server_Log_Data.md)
* [13 - Refining and Visualizing Sentiment Data](/Sandbox/T13_Refining_and_Visualizing_Sentiment_Data.md)
* [14 - Analyzing Machine and Sensor Data](/Sandbox/T14_Analyzing_Machine_and_Sensor_Data.md)###Community Tutorials
Feel free to contribute tutorials and help the community harness Hadoop!
* [1 - Using RHadoop to Predict Web Visitors](/Community/T01_RHadoop_visitors_prediction.md) from [mwacc](https://github.com/mwacc)
* [2 - Using Spring XD to stream Tweets to Hadoop for Sentiment Analysis](/Community/T02_Spring_XD_Hadoop_Twitter.md) from [mehzer](https://github.com/mehzer)
* [3 - Word Counting with Pig](/Community/T03_Word_Counting_With_Pig.md) from [flacrosse](https://github.com/flacrosse)
* [4 - Import from Microsoft SQL Server into the Hortonworks Sandbox using Sqoop](/Community/T04_Import_from_Microsoft_SQL_Server_into_the_Hortonworks_Sandbox_using_Sqoop.md) from [flacrosse](https://github.com/flacrosse)
* [5 - Use HDFS Explorer to Manage Files on Hortonworks Sandbox](/Community/T05_Use_HDFS_Explorer_To_Manage_Files_On_The_Hortonworks_Sandbox.md) from [RedGate](http://redgate.com)
* [6 - Java-Hive Integration - Salary Delta Analysis and Visualization Using Hive, Java and JFreeChart](/Community/T06_Java_Hive_Integration_Salary_Delta_Analysis_And_Visualization_Using_JFreeChart.md) from [vivganes](https://github.com/vivganes)
* [7 - elasticsearch-Hadoop integration](/Community/T07_Elasticsearch_Hadoop_Integration.md) from [yannbrrd](https://github.com/yannbrrd)
* [8 - Java-HBase Integration - Professional Analysis and Visualization Using HBase, Java and JFreeChart](/Community/T08_Java_HBase_Integration_Profession_Analysis_And_Visualization_Using_JFreeChart.md) from [vivganes](https://github.com/vivganes)
* [9 - Write and Run Your Own MapReduce Java Program - Poll Result Analysis Using Hadoop, Java and Eclipse](/Community/T09_Write_And_Run_Your_Own_MapReduce_Java_Program_Poll_Result_Analysis.md) from [vivganes](https://github.com/vivganes)