Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/sbcd90/hadoop

Miscellanous prototypes on Hadoop & its eco-system components
https://github.com/sbcd90/hadoop

Last synced: 9 days ago
JSON representation

Miscellanous prototypes on Hadoop & its eco-system components

Awesome Lists containing this project

README

        

# Hadoop
Miscellanous prototypes on Hadoop & its eco-system components

- directories

- catalina_logs
- Catalina logs directory, generated by tomcat

- flume-client
- sample apache flume client

- flume_custom_sink
- sample apache flume custom sink

- hbase-log-files

- put flume on top of HBase to read data from client & write to HBase.
- writing log files to hbase.
- create an RDD in Spark using the log file data stored in hbase.

- hbase_storage

- storing catalina log files using apache kafka & hbase

- hbase_storage

- storing catalina log files using apache kafka & hbase (in java, with monsoon deployable automated scripts)

- kafka-producer

- a dummy kafka producer-conusmer mechanism

- spring_xd_batch_jobs

- creating spring batch jobs with custom reader & writer

- spring_xd_hana

- connecting to hana using standard reader from spring & using custom writer

- flink_poc

- flink batch processing poc with transformations
- flink streaming poc

- mapreduce

- simple mapreduce wordcount example