Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/dimajix/hadoop-training

Source Code for Hadoop Training
https://github.com/dimajix/hadoop-training

hadoop hadoop-training spark

Last synced: about 1 month ago
JSON representation

Source Code for Hadoop Training

Awesome Lists containing this project

README

        

# Setup Instructions

## Directory Structure

For easiest results, I recommend to have the following directory layout

/home/cloudera
|
+---- hadoop-training
| |
| +----- exameple-01
| |
| +----- exameple-02
| |
| ...
|
+---- data
|
+--- weather
|
+--- 2011

## Building

mvn install

# Contained Examples

## hadoop-wordcount-01

Classic example

## hadoop-wordcount-02

Classic example, improved