Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/sbcd90/hadoop

Miscellanous prototypes on Hadoop & its eco-system components
https://github.com/sbcd90/hadoop

Last synced: 9 days ago
JSON representation

Miscellanous prototypes on Hadoop & its eco-system components

Host: GitHub
URL: https://github.com/sbcd90/hadoop
Owner: sbcd90
Created: 2015-04-07T18:50:45.000Z (over 9 years ago)
Default Branch: master
Last Pushed: 2019-05-11T14:30:34.000Z (over 5 years ago)
Last Synced: 2024-04-14T18:21:09.888Z (7 months ago)
Language: Java
Size: 194 KB
Stars: 1
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Hadoop
Miscellanous prototypes on Hadoop & its eco-system components

- directories

- catalina_logs
- Catalina logs directory, generated by tomcat

- flume-client
- sample apache flume client

- flume_custom_sink
- sample apache flume custom sink

- hbase-log-files

- put flume on top of HBase to read data from client & write to HBase.
- writing log files to hbase.
- create an RDD in Spark using the log file data stored in hbase.

- hbase_storage

- storing catalina log files using apache kafka & hbase

- hbase_storage

- storing catalina log files using apache kafka & hbase (in java, with monsoon deployable automated scripts)

- kafka-producer

- a dummy kafka producer-conusmer mechanism

- spring_xd_batch_jobs

- creating spring batch jobs with custom reader & writer

- spring_xd_hana

- connecting to hana using standard reader from spring & using custom writer

- flink_poc

- flink batch processing poc with transformations
- flink streaming poc

- mapreduce

- simple mapreduce wordcount example