https://github.com/landerox/cloud-landerox-data
Reference architecture baseline for GCP data platforms (Apache Beam, BigQuery, Cloud Functions, Pub/Sub). Hybrid warehouse/lakehouse with batch + streaming, Medallion layering. Consumed by private runtime repos.
https://github.com/landerox/cloud-landerox-data
apache-beam batch-processing bigquery cloud-functions cloud-storage data-engineering data-platform dataform gcp google-cloud-dataflow iceberg lakehouse medallion-architecture opentelemetry pubsub python reference-architecture slsa streaming supply-chain-security
Last synced: 5 days ago
JSON representation
Reference architecture baseline for GCP data platforms (Apache Beam, BigQuery, Cloud Functions, Pub/Sub). Hybrid warehouse/lakehouse with batch + streaming, Medallion layering. Consumed by private runtime repos.
- Host: GitHub
- URL: https://github.com/landerox/cloud-landerox-data
- Owner: landerox
- License: mit
- Created: 2026-01-09T20:20:15.000Z (5 months ago)
- Default Branch: main
- Last Pushed: 2026-05-20T03:12:23.000Z (6 days ago)
- Last Synced: 2026-05-20T06:05:17.206Z (5 days ago)
- Topics: apache-beam, batch-processing, bigquery, cloud-functions, cloud-storage, data-engineering, data-platform, dataform, gcp, google-cloud-dataflow, iceberg, lakehouse, medallion-architecture, opentelemetry, pubsub, python, reference-architecture, slsa, streaming, supply-chain-security
- Language: Python
- Homepage: https://landerox.com/projects/cloud-data
- Size: 245 KB
- Stars: 1
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: .github/CONTRIBUTING.md
- License: LICENSE
- Code of conduct: .github/CODE_OF_CONDUCT.md
- Codeowners: .github/CODEOWNERS
- Security: .github/SECURITY.md
- Agents: AGENTS.md
Awesome Lists containing this project
README
# cloud-landerox-data
[](https://github.com/landerox/cloud-landerox-data/actions/workflows/lint.yml)
[](https://github.com/landerox/cloud-landerox-data/actions/workflows/codeql.yml)
[](https://codecov.io/gh/landerox/cloud-landerox-data)
[](https://scorecard.dev/viewer/?uri=github.com/landerox/cloud-landerox-data)
[](https://www.bestpractices.dev/projects/12905)
[](https://www.bestpractices.dev/projects/12905)
[](https://slsa.dev)
[](https://www.python.org/downloads/release/python-3130/)
[](https://github.com/microsoft/pyright)
[](https://github.com/astral-sh/ruff)
[](https://www.conventionalcommits.org)
[](LICENSE)
[](https://cloud.google.com/)
> **Reference architecture baseline for GCP data platforms** — hybrid
> warehouse/lakehouse, batch + streaming, Medallion layering.
> Consumed by private runtime repos.
Welcome to the `cloud-landerox-data` repository. This project serves as a comprehensive architecture baseline for Google Cloud Platform (GCP) data platforms. It provides structural guidance, engineering standards, and folder placeholders designed to be consumed by private runtime repositories.
Please note that this repository is **not** a production runtime environment, a Terraform infrastructure repository, or a Data Mesh framework. Instead, it is a pragmatic, hybrid baseline that intelligently selects patterns based on data source, Service Level Agreements (SLAs), and cost profiles. For a deep dive into our decision-making model, please review our [Architecture Stance](docs/architecture.md).
## Status
The project is currently in a **pre-1.0** state with active quality and supply-chain gates on every PR and push: ruff, pyright (strict), pytest with coverage (≥90% on `shared/**`), `hypothesis` property tests, nightly `mutmut`; CodeQL, Semgrep, Trivy, pip-audit, license allowlist, gitleaks, zizmor, validate-pyproject, lychee link check, and DCO sign-off enforced locally and in CI. OpenSSF Scorecard runs weekly. Every release attaches a CycloneDX SBOM and a SLSA L3 build provenance attestation. Dependabot regular and security updates are both enabled. By design, no runtime code is deployed from this repository.
To see how this baseline is applied in practice, check out our minimal public runtime example at [landerox/cloud-landerox-runtime-example](https://github.com/landerox/cloud-landerox-runtime-example).
## Getting started
### Prerequisites
- Python 3.13
- [`mise`](https://mise.jdx.dev/getting-started.html) — manages `just`,
`lychee`, and other non-Python tools via `mise.toml`
- [`uv`](https://docs.astral.sh/uv/)
### Setup
```bash
mise install
just sync
just pre-commit-install
```
### Quality checks
```bash
just lint
just type
just test
```
## Learn more
- [Documentation index](docs/README.md) — full catalogue of architecture, guides, diagrams, and blueprints
- [Architecture stance](docs/architecture.md)
- [Architecture decisions (ADRs)](docs/adr/README.md)
- [CI/CD guide](docs/cicd.md)
- [Governance](GOVERNANCE.md)
- [Roadmap](ROADMAP.md)
- [Contributing](.github/CONTRIBUTING.md)
- [Security policy](.github/SECURITY.md)
- [Changelog](CHANGELOG.md)
## License
This project is licensed under the terms found in the [LICENSE](LICENSE) file.