Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/wittline/pyspark-on-aws-emr
The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
https://github.com/wittline/pyspark-on-aws-emr
aws aws-emr big-data big-data-analytics dataengineering ec2-spot ec2-spot-instances emr-cluster pyspark python spark wordcloud-generator
Last synced: about 1 month ago
JSON representation
The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
- Host: GitHub
- URL: https://github.com/wittline/pyspark-on-aws-emr
- Owner: Wittline
- License: apache-2.0
- Created: 2021-02-18T20:24:29.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2022-06-13T15:16:50.000Z (over 2 years ago)
- Last Synced: 2024-05-02T04:05:14.918Z (7 months ago)
- Topics: aws, aws-emr, big-data, big-data-analytics, dataengineering, ec2-spot, ec2-spot-instances, emr-cluster, pyspark, python, spark, wordcloud-generator
- Language: Python
- Homepage: https://wittline.github.io/pyspark-on-aws-emr/
- Size: 3.61 MB
- Stars: 24
- Watchers: 4
- Forks: 13
- Open Issues: 0