Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/garystafford/emr-demo
Project files for the post: Running PySpark Applications on Amazon EMR: Methods for Interacting with PySpark on Amazon Elastic MapReduce.
https://github.com/garystafford/emr-demo
amazon-emr aws elastic-map-reduce emr-demo pyspark spark
Last synced: about 2 months ago
JSON representation
Project files for the post: Running PySpark Applications on Amazon EMR: Methods for Interacting with PySpark on Amazon Elastic MapReduce.
- Host: GitHub
- URL: https://github.com/garystafford/emr-demo
- Owner: garystafford
- License: apache-2.0
- Created: 2020-12-02T03:41:05.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2022-09-01T19:50:43.000Z (over 2 years ago)
- Last Synced: 2023-08-05T02:22:51.632Z (over 1 year ago)
- Topics: amazon-emr, aws, elastic-map-reduce, emr-demo, pyspark, spark
- Language: Python
- Homepage:
- Size: 691 KB
- Stars: 37
- Watchers: 5
- Forks: 17
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# PySpark on Amazon EMR Demo
## Overview
Project files for the post, [Running PySpark Applications on Amazon EMR: Methods for Interacting with PySpark on Amazon Elastic MapReduce](https://garystafford.medium.com/running-pyspark-applications-on-amazon-emr-e536b7a865ca). Please see post for complete instructions on using the project's files.
## Architecture
### AWS CloudFormation Stack Creation
![Architecture](images/CFN_Architecture.png)
### Data Analytics Platform![Architecture](images/Workflow_Architecture.png)