https://github.com/cds-snc/data-lake
Infrastructure for the Platform Data Lake
https://github.com/cds-snc/data-lake
aws data-lake terraform
Last synced: 9 months ago
JSON representation
Infrastructure for the Platform Data Lake
- Host: GitHub
- URL: https://github.com/cds-snc/data-lake
- Owner: cds-snc
- License: mit
- Created: 2024-10-30T13:30:43.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2025-04-09T13:27:24.000Z (9 months ago)
- Last Synced: 2025-04-09T13:40:38.064Z (9 months ago)
- Topics: aws, data-lake, terraform
- Language: Python
- Homepage:
- Size: 443 KB
- Stars: 2
- Watchers: 4
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Data Lake
This repository holds the Terraform and AWS Glue jobs that manage the Platform Data Lake.
You can read more about the datasets below:
- [Data catalog](./docs/data/catalog)
- [Data pipelines](./docs/data/pipelines/)
To learn how to add a new dataset, read our [onboarding guide](./docs/data/onboarding.md).