https://github.com/garystafford/emr-superset-demo
Project files for the post: Installing Apache Superset on Amazon EMR: Add data exploration and visualization to your analytics cluster.
https://github.com/garystafford/emr-superset-demo
amazon-emr apache-superset aws superset
Last synced: about 1 month ago
JSON representation
Project files for the post: Installing Apache Superset on Amazon EMR: Add data exploration and visualization to your analytics cluster.
- Host: GitHub
- URL: https://github.com/garystafford/emr-superset-demo
- Owner: garystafford
- License: apache-2.0
- Created: 2020-12-17T17:12:53.000Z (almost 5 years ago)
- Default Branch: main
- Last Pushed: 2020-12-29T13:15:02.000Z (almost 5 years ago)
- Last Synced: 2025-09-14T04:03:33.012Z (about 1 month ago)
- Topics: amazon-emr, apache-superset, aws, superset
- Language: Python
- Homepage:
- Size: 31.3 KB
- Stars: 1
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Installing Apache Superset on Amazon EMR
## Overview
Add data exploration and visualization to your analytics cluster. Project files for the post, [Installing Apache Superset on Amazon EMR: Add data exploration and visualization to your analytics cluster](https://garystafford.medium.com/installing-apache-superset-on-amazon-emr-5e2444f6d242). Please see post for complete instructions on using the project's files.
### Create CloudFormation Stack
```shell script
python3 ./create_cfn_stack.py \
--ec2-key-name \
--ec2-subnet-id \
--environment dev
```### Run Superset Bootstrap Script
```shell script
python3 ./install_superset.py \
--ec2-key-path \
--superset-port 8280
```### SSH Tunnel
Open an SSH tunnel to master node using dynamic port forwarding
```shell script
ssh -i -ND 8157 hadoop@
```### Troubleshooting Superset
Troubleshoot Superset process running on EMR Master.
```shell script
lsof -i :8280
```## References
- https://superset.apache.org/docs/installation/installing-superset-from-scratch
- https://gitmemory.com/issue/apache/incubator-superset/8169/528679887
- https://stackoverflow.com/questions/59195394/apache-superset-config-py-on