Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/onecricketeer/mapreduce-sandbox

Sandbox for Hadoop MapReduce
https://github.com/onecricketeer/mapreduce-sandbox

hadoop mapreduce sandbox-development

Last synced: 11 days ago
JSON representation

Sandbox for Hadoop MapReduce

Awesome Lists containing this project

README

        

mapreduce-sandbox
===

Sandbox for running Hadoop MapReduce programs

To run a specific task, do `./gradlew :`

The following classes are available

|Main class|Input File|Gradle task|
|---|---|---|
|`CustomerDriver`|`inFiles/customer`|`runCustomerDriver`|
|`DateGrouperDriver`|`inFiles/dates`|`runDateGrouperDriver`|
|`StoreSumDriver`|`inFiles/stores`|`runStoreSumDriver`|
|`TempMinMaxAvgDriver`|`inFiles/temp`|`runTempMinMaxAvgDriver`|
|`WeatherMultiDriver`|`inFiles/weather`|`runWeatherMultiDriver`|

Each task will write into (and overwrite) the `outFiles/` directory.