Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/greenvine/hadoop-cloud-environ
Hadoop Cloud Environment Bootstrapper
https://github.com/greenvine/hadoop-cloud-environ
cloud-environment hadoop hbase infrastructure zookeeper
Last synced: 12 days ago
JSON representation
Hadoop Cloud Environment Bootstrapper
- Host: GitHub
- URL: https://github.com/greenvine/hadoop-cloud-environ
- Owner: GreenVine
- Created: 2018-12-15T11:43:28.000Z (about 6 years ago)
- Default Branch: develop
- Last Pushed: 2018-12-25T13:39:13.000Z (about 6 years ago)
- Last Synced: 2024-11-08T03:39:37.958Z (2 months ago)
- Topics: cloud-environment, hadoop, hbase, infrastructure, zookeeper
- Language: Shell
- Homepage:
- Size: 156 KB
- Stars: 0
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# hadoop-cloud-environ
This repository provides the outline of a standard Hadoop cloud computing cluster, as well as configuration scripts for bootstrapping the environment.
## Components
[Hadoop](https://hadoop.apache.org/) is a collection of open-source software utilities that facilitate using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage and processing of big data using the MapReduce programming model.
[HBase](https://hbase.apache.org/) is an open-source, non-relational, distributed database modeled after Google's Bigtable and written in Java. It runs on top of HDFS, providing Bigtable-like capabilities for Hadoop.
[ZooKeeper](https://zookeeper.apache.org/) is a centralized service for distributed systems to a hierarchical key-value store, which is used to provide a distributed configuration service, synchronization service, and naming registry for large distributed systems.
(*Source: Wikipedia & Apache*)