https://github.com/cds-snc/data-lake
Infrastructure for the Platform Data Lake
https://github.com/cds-snc/data-lake
aws data-lake terraform
Last synced: 5 months ago
JSON representation
Infrastructure for the Platform Data Lake
- Host: GitHub
- URL: https://github.com/cds-snc/data-lake
- Owner: cds-snc
- License: mit
- Created: 2024-10-30T13:30:43.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2026-02-09T14:44:29.000Z (5 months ago)
- Last Synced: 2026-02-09T19:01:18.364Z (5 months ago)
- Topics: aws, data-lake, terraform
- Language: Python
- Homepage:
- Size: 1.59 MB
- Stars: 3
- Watchers: 4
- Forks: 0
- Open Issues: 7
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
- Codeowners: .github/CODEOWNERS
Awesome Lists containing this project
README
# Data Lake
This repository holds the Terraform and AWS Glue jobs that manage the Platform Data Lake.
You can read more about the datasets below:
- [Data catalog](./docs/data/catalog)
- [Data pipelines](./docs/data/pipelines/)
To learn how to add a new dataset, read our [onboarding guide](./docs/data/onboarding.md).