https://github.com/googlecloudplatform/bigquery-data-lineage
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
https://github.com/googlecloudplatform/bigquery-data-lineage
bigdata bigquery data-catalog data-governance data-lineage data-management dataflow zetasql
Last synced: 3 months ago
JSON representation
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
- Host: GitHub
- URL: https://github.com/googlecloudplatform/bigquery-data-lineage
- Owner: GoogleCloudPlatform
- License: apache-2.0
- Archived: true
- Created: 2020-06-09T05:52:32.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2024-06-03T01:17:51.000Z (11 months ago)
- Last Synced: 2025-01-22T07:36:34.745Z (3 months ago)
- Topics: bigdata, bigquery, data-catalog, data-governance, data-lineage, data-management, dataflow, zetasql
- Language: Java
- Homepage: https://cloud.google.com/blog/products/data-analytics/architecting-a-data-lineage-system-for-bigquery
- Size: 356 KB
- Stars: 143
- Watchers: 17
- Forks: 40
- Open Issues: 20
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Authors: AUTHORS
Awesome Lists containing this project
README

[](https://lgtm.com/projects/g/GoogleCloudPlatform/bigquery-data-lineage/alerts/)
[](https://codecov.io/gh/GoogleCloudPlatform/bigquery-data-lineage)Learn more on [Data lineage for a datawarehouse](https://cloud.google.com/solutions/architecture-concept-data-lineage-systems-in-a-data-warehouse)
Refer instructions for deployment
* [Data lineage pipeline for BigQuery](documents/bigquery_lineage_pipeline.md)
* [Automatic policy tag cascading based on data lineage pipeline](documents/cascade_bq_table_acl_pipeline.md)
## Disclaimer
**License**: Apache 2.0This is not an official Google product.