Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/greenvine/hadoop-cloud-environ

Hadoop Cloud Environment Bootstrapper
https://github.com/greenvine/hadoop-cloud-environ

cloud-environment hadoop hbase infrastructure zookeeper

Last synced: 12 days ago
JSON representation

Hadoop Cloud Environment Bootstrapper

Awesome Lists containing this project

README

        

# hadoop-cloud-environ

This repository provides the outline of a standard Hadoop cloud computing cluster, as well as configuration scripts for bootstrapping the environment.

## Components

[Hadoop](https://hadoop.apache.org/) is a collection of open-source software utilities that facilitate using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage and processing of big data using the MapReduce programming model.

[HBase](https://hbase.apache.org/) is an open-source, non-relational, distributed database modeled after Google's Bigtable and written in Java. It runs on top of HDFS, providing Bigtable-like capabilities for Hadoop.

[ZooKeeper](https://zookeeper.apache.org/) is a centralized service for distributed systems to a hierarchical key-value store, which is used to provide a distributed configuration service, synchronization service, and naming registry for large distributed systems.

(*Source: Wikipedia & Apache*)