Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/drylikov/sequenceiq
SequenceIQ Hadoop examples.
https://github.com/drylikov/sequenceiq
Last synced: 8 days ago
JSON representation
SequenceIQ Hadoop examples.
- Host: GitHub
- URL: https://github.com/drylikov/sequenceiq
- Owner: drylikov
- License: mit
- Created: 2024-09-07T21:54:50.000Z (2 months ago)
- Default Branch: drylikov
- Last Pushed: 2024-09-07T21:57:12.000Z (2 months ago)
- Last Synced: 2024-09-07T22:54:23.296Z (2 months ago)
- Language: Java
- Size: 0 Bytes
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
SequenceIQ Hadoop sample projects
============================This repository is a collection of sample projects and code examples featured in our blog entries - for more details check [SequenceIQ blog](http://blog.sequenceiq.com).
This samples repository and the blog contains random thoughts and proof-of-concepts/interesting issues we have face during our product development stack.Where the samples are not covered by a blog entry, we try to make them self explanatory or supply a short readme. Please feel free to collaborate, share, ask for help or report issues.
* **flume-sources** module: [Custom Apache Flume source](http://blog.sequenceiq.com/blog/2014/02/22/custom-flume-source/)
* **etl-samples** module: [ETL - producing better quality data](http://blog.sequenceiq.com/blog/2014/02/28/etl-and-data-quality/)
* **hdp-sandbox-access** module: [Accessing HDP2 sandbox from the host](http://blog.sequenceiq.com/blog/2014/03/05/access-hdp2-sandbox/)
* **lastfm-morphlines-etl** module: [How-to: Process Data using Morphlines (in Kite SDK)](http://blog.cloudera.com/blog/2014/04/how-to-process-data-using-morphlines-in-kite-sdk/)
* **hdp-sandbox-access** module: [HDFS and java.nio.channels](http://blog.sequenceiq.com/blog/2014/03/07/read-from-hdfs/)
* **mapreduce-morphline** module: [Data cleaning with MapReduce and Morphlines](http://blog.sequenceiq.com/blog/2014/03/11/data-cleaning-with-mapreduce-and-morphlines/)
* **yarn-queue-tests** module: [YARN Capacity Scheduler](http://blog.sequenceiq.com/blog/2014/03/14/yarn-capacity-scheduler/)
* **tez-dag-jobs** module: [Using Mahout with Tez](http://blog.sequenceiq.com/blog/2014/03/31/mahout-on-tez/)
* **yarn-monitoring-R** module: [Monitoring YARN with R]()Thanks,
[SequenceIQ](http://sequenceiq.com)