An open API service indexing awesome lists of open source software.

https://github.com/oracle-quickstart/oci-hadoop

Terraform module to deploy Hadoop on Oracle Cloud Infrastructure (OCI)
https://github.com/oracle-quickstart/oci-hadoop

cloud hadoop oci oracle oracle-led terraform

Last synced: 8 months ago
JSON representation

Terraform module to deploy Hadoop on Oracle Cloud Infrastructure (OCI)

Awesome Lists containing this project

README

          

# oci-hadoop
Apache Hadoop on OCI

## Deployment
This template uses [Oracle Resource Manager](https://docs.oracle.com/en-us/iaas/Content/ResourceManager/Concepts/resourcemanager.htm) to deploy an [Apache Hadoop](http://hadoop.apache.org) cluster on OCI. The schema driven install will guide you through customization options available prior to deployment.

Simply click the Deploy to OCI button here to initiate deployment of the stack.

[![Deploy to Oracle Cloud](https://oci-resourcemanager-plugin.plugins.oci.oraclecloud.com/latest/deploy-to-oracle-cloud.svg)](https://cloud.oracle.com/resourcemanager/stacks/create?region=home&zipUrl=https://github.com/oracle-quickstart/oci-hadoop/archive/1.0.10.zip)

## Architecture
Here is a diagram of the deployment architecture when creating a VCN as part of the stack deployment:

![Deployment Architecture Diagram](images/deployment_architecture.png)

## Hadoop Components
Cluster service topology is as follows:
* Master 1 is NameNode and JobHistory server
* Master 2 is ResourceManager, Timeline, Proxy and History Server
* All Master hosts are ZooKeeper servers

Edge node has client libraries deployed in .bashrc for root user. Cluster hosts have deployment logs in /var/log/OCI-Hadoop.log which can be used to triage any deployment issues.