Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/dimajix/hadoop-training
Source Code for Hadoop Training
https://github.com/dimajix/hadoop-training
hadoop hadoop-training spark
Last synced: about 1 month ago
JSON representation
Source Code for Hadoop Training
- Host: GitHub
- URL: https://github.com/dimajix/hadoop-training
- Owner: dimajix
- Created: 2015-11-15T15:54:36.000Z (about 9 years ago)
- Default Branch: master
- Last Pushed: 2015-12-28T17:44:41.000Z (about 9 years ago)
- Last Synced: 2024-11-09T17:38:40.883Z (3 months ago)
- Topics: hadoop, hadoop-training, spark
- Language: Scala
- Size: 171 KB
- Stars: 0
- Watchers: 4
- Forks: 4
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Setup Instructions
## Directory Structure
For easiest results, I recommend to have the following directory layout
/home/cloudera
|
+---- hadoop-training
| |
| +----- exameple-01
| |
| +----- exameple-02
| |
| ...
|
+---- data
|
+--- weather
|
+--- 2011
## Building
mvn install
# Contained Examples
## hadoop-wordcount-01
Classic example
## hadoop-wordcount-02
Classic example, improved