https://github.com/ev2900/datazone_demo
Prebuilt demo of Amazon DataZone using fake data for Pharmaceutical drug discovery
https://github.com/ev2900/datazone_demo
aws bussiness-data-catalog data-catalog datazone
Last synced: 23 days ago
JSON representation
Prebuilt demo of Amazon DataZone using fake data for Pharmaceutical drug discovery
- Host: GitHub
- URL: https://github.com/ev2900/datazone_demo
- Owner: ev2900
- Created: 2023-11-06T21:43:41.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-03-18T03:23:02.000Z (about 2 months ago)
- Last Synced: 2025-04-10T03:13:57.186Z (23 days ago)
- Topics: aws, bussiness-data-catalog, data-catalog, datazone
- Language: Python
- Homepage:
- Size: 420 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# DataZone Demonstration
![]()
![]()
This repository provides an easy deployment to set up an environment for demo'ing Amazon DataZone.
The instructions will deploy the architecture depicted in this diagram
You can use this architecture to learn and demonstrate publishing, subscribing workflows and other aspects of DataZone.
The sample data used for this demo is fake data that represents a few data sets that may be used by a Pharmaceutical company during drug development research and clinical trials.
## Instructions to deploy the demo an in AWS account
1. Launch the CloudFormation stack[](https://console.aws.amazon.com/cloudformation/home#/stacks/new?stackName=data-zone&templateURL=https://sharkech-public.s3.amazonaws.com/misc-public/0_datazone_cloudformation.yaml)
2. Run the following from the terminal of the [Cloud9](https://us-east-1.console.aws.amazon.com/cloud9control/home) environment that was deployed by the CloudFormation stack
```pip install boto3```
```python DataZone_Demo/1_lakeformation_s3_configuration.py```
3. Deploy a DataZone domain from the AWS console
* Navigate to the [DataZone](https://us-east-1.console.aws.amazon.com/datazone/home) home page and click on **Create domain**
* Provide a name for the domain
* Select the check mark next to the *Set-up this account for data consumption and publishing* under the Quick setup section
* Click on **Create domain**
4. Update the ```datazone_domain_id``` variable in [2_dataZone_configuration.py](https://github.com/ev2900/DataZone_Demo/blob/main/2_dataZone_configuration.py) file and [3_dataZone_configuration.py](https://github.com/ev2900/DataZone_Demo/blob/main/3_dataZone_configuration.py)
To find the domain id of the DataZone domain you just deployed look at the URL for the DataZone portal
For example if the URL is https//dzd_498d049z6o1gkn.datazone.us-east-1.on.aws the domain id is dzd_498d049z6o1gkn
Once you update the variables with the domain id **save the file**
6. Run the following from the terminal of the [Cloud9](https://us-east-1.console.aws.amazon.com/cloud9control/home) environment
```python DataZone_Demo/2_dataZone_configuration.py```
```python DataZone_Demo/3_dataZone_configuration.py```