Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/jaehyeon-kim/emr-on-eks-terraform
Manage EMR on EKS on Terraform
https://github.com/jaehyeon-kim/emr-on-eks-terraform
aws eks emr emroneks terraform
Last synced: about 1 month ago
JSON representation
Manage EMR on EKS on Terraform
- Host: GitHub
- URL: https://github.com/jaehyeon-kim/emr-on-eks-terraform
- Owner: jaehyeon-kim
- Created: 2022-08-17T23:08:41.000Z (over 2 years ago)
- Default Branch: master
- Last Pushed: 2022-08-30T21:41:57.000Z (over 2 years ago)
- Last Synced: 2024-10-30T05:35:14.970Z (3 months ago)
- Topics: aws, eks, emr, emroneks, terraform
- Language: HCL
- Homepage: https://cevo.com.au/post/manage-emr-on-eks-with-terraform/
- Size: 63.5 KB
- Stars: 1
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Manage EMR on EKS with Terraform
**UPDATE**
- tried to set up EMR studio but was not successful.
- use the [wo-studio](https://github.com/jaehyeon-kim/emr-on-eks-terraform/tree/wo-studio) branch.[Manage EMR on EKS with Terraform](https://cevo.com.au/post/manage-emr-on-eks-with-terraform/)
- [Amazon EMR on EKS](https://aws.amazon.com/emr/features/eks/) is a deployment option for Amazon EMR that allows you to automate the provisioning and management of open-source big data frameworks on EKS. While [eksctl](https://eksctl.io/) is popular for working with [Amazon EKS](https://aws.amazon.com/eks/) clusters, it has limitations when it comes to building infrastructure that integrates multiple AWS services. Also it is not straightforward to update EKS cluster resources incrementally with it. On the other hand [Terraform](https://www.terraform.io/) can be an effective tool for managing infrastructure that includes not only EKS and EMR virtual clusters but also other AWS resources. Moreover Terraform has a wide range of [modules](https://www.terraform.io/language/modules) and it can even be simpler to build and manage infrastructure using those compared to the CLI tool. In this post, we’ll discuss how to provision and manage Spark jobs on EMR on EKS with Terraform. [Amazon EKS Blueprints for Terraform](https://aws-ia.github.io/terraform-aws-eks-blueprints/v4.7.0/) will be used for provisioning EKS, EMR virtual cluster and related resources. Also Spark job autoscaling will be managed by [Karpenter](https://karpenter.sh/) where two Spark jobs with and without [Dynamic Resource Allocation (DRA)](https://spark.apache.org/docs/latest/job-scheduling.html#dynamic-resource-allocation) will be compared.