Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/sbcd90/hadoop
Miscellanous prototypes on Hadoop & its eco-system components
https://github.com/sbcd90/hadoop
Last synced: 9 days ago
JSON representation
Miscellanous prototypes on Hadoop & its eco-system components
- Host: GitHub
- URL: https://github.com/sbcd90/hadoop
- Owner: sbcd90
- Created: 2015-04-07T18:50:45.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2019-05-11T14:30:34.000Z (over 5 years ago)
- Last Synced: 2024-04-14T18:21:09.888Z (7 months ago)
- Language: Java
- Size: 194 KB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Hadoop
Miscellanous prototypes on Hadoop & its eco-system components- directories
- catalina_logs
- Catalina logs directory, generated by tomcat- flume-client
- sample apache flume client- flume_custom_sink
- sample apache flume custom sink- hbase-log-files
- put flume on top of HBase to read data from client & write to HBase.
- writing log files to hbase.
- create an RDD in Spark using the log file data stored in hbase.- hbase_storage
- storing catalina log files using apache kafka & hbase
- hbase_storage
- storing catalina log files using apache kafka & hbase (in java, with monsoon deployable automated scripts)
- kafka-producer
- a dummy kafka producer-conusmer mechanism
- spring_xd_batch_jobs
- creating spring batch jobs with custom reader & writer
- spring_xd_hana
- connecting to hana using standard reader from spring & using custom writer
- flink_poc
- flink batch processing poc with transformations
- flink streaming poc- mapreduce
- simple mapreduce wordcount example