Projects in Awesome Lists tagged with data-contracts
A curated list of projects in awesome lists tagged with data-contracts .
https://github.com/open-metadata/openmetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
data-catalog data-collaboration data-contracts data-discovery data-governance data-lineage data-observability data-profiling data-quality data-quality-checks data-science data-validation datadiscovery dataengineering dataquality dbt hacktoberfest metadata metadata-management snowflake
Last synced: 22 Feb 2026
https://github.com/open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
data-catalog data-collaboration data-contracts data-discovery data-governance data-lineage data-observability data-profiling data-quality data-quality-checks data-science data-validation datadiscovery dataengineering dataquality dbt hacktoberfest metadata metadata-management snowflake
Last synced: 15 Mar 2025
https://github.com/sodadata/soda-core
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
data-contracts data-engineering data-governance data-monitoring data-observability data-profiling data-quality data-quality-checks data-quality-monitoring data-quality-testing data-reliability data-testing data-unit-tests data-validation dataquality datatesting dbt pipeline-testing python snowflake
Last synced: 14 May 2025
https://github.com/bitol-io/open-data-contract-standard
Home of the Open Data Contract Standard (ODCS).
data data-contract data-contracts data-engineering data-mesh data-quality standard
Last synced: 10 Mar 2026
https://github.com/streamdal/streamdal
Code-Native Data Privacy
astrojs data-contracts deno docker event-driven go javascript message-queues nodejs observability python reactjs rust streaming-data tail-f wasi wasm
Last synced: 14 May 2025
https://github.com/rocky-data/rocky
The typed graph between your code and whichever warehouse, table format, or query engine you've chosen — typed compiler, branches, replay, column-level lineage, compile-time contracts, per-model cost. Adapters: Databricks, Snowflake, BigQuery, DuckDB. Single static Rust binary. Apache 2.0.
column-lineage dagster data-contracts data-engineering data-lineage data-pipeline data-platform data-quality dbt-alternative rust schema-drift sql
Last synced: 06 Jun 2026
https://github.com/matsmoll/aligned
The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt
ai data-contracts data-lake datacontracts dbt feature-engineering feature-store ml ml-ops mlops
Last synced: 13 Aug 2025
https://github.com/getstrm/pace
Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery (or plain 'ol Postgres, even!) with definitions imported from Collibra, Datahub, ODD and the like.
bigquery data-catalog data-contracts data-governance data-processing databricks policy-enforcement snowflake
Last synced: 13 Oct 2025
https://github.com/galileo-galilei/kedro-pandera
A kedro plugin to use pandera in your kedro projects
data-contracts data-pipelines data-schemas kedro kedro-plugin pandera pipelines-testing
Last synced: 29 Jun 2025
https://github.com/opendqv/opendqv
Open-source, contract-driven data quality validation. Shift-left enforcement at the point of write — before data enters your pipeline.
data-contracts data-governance data-quality data-validation fastapi mcp open-source python shift-left
Last synced: 25 Apr 2026
https://github.com/flyersworder/agentic-data-contracts
YAML-first, domain-driven data governance for AI agents — teach agents your business domains, metrics, and rules before they write SQL
agent-sdk ai-agents analytics claude data-contracts data-engineering data-governance dbt domain-driven llm pydantic python semantic-layer sql-validation sqlglot yaml
Last synced: 24 Apr 2026
https://github.com/axonops/axonops-schema-registry
High-performance, API-compatible drop-in replacement for Confluent Schema Registry. Avro, Protobuf & JSON Schema with PostgreSQL, MySQL, Cassandra storage. Built-in RBAC, CSFLE encryption, data contracts, audit logging. No Kafka dependency. Apache 2.0.
apache-kafka avro cassandra confluent csfle data-contracts data-governance encryption golang json-schema kafka mysql postgresql protobuf rbac schema-management schema-registry
Last synced: 08 Mar 2026
https://github.com/npow/metaflow-contracts
Catch bad data between Metaflow steps before it corrupts your pipeline
data-contracts data-validation metaflow pipeline pydantic pypi python type-checking
Last synced: 23 May 2026
https://github.com/a-chumagin/soda-contract-poc
PoC for Soda Contracts against Vertica DB
data-contracts data-governance data-quality soda
Last synced: 12 Feb 2026
https://github.com/vim89/flowforge
Let's be honest - most data pipeline frameworks treat types as suggestions. Config files are strings. Schemas are "validated" at runtime. Data quality is an afterthought. So, let's do differently
archetype data data-contracts data-engineering data-pipelines data-quality data-science database dataengineering datapipeline etl etl-framework pipelines scala scalability spark spark-sql spark-streaming
Last synced: 14 Apr 2026
https://github.com/marquesantero/contractforge
Contract-first framework for governed lakehouse ingestion on Delta Lake and Databricks.
data-contracts data-engineering databricks delta-lake ingestion lakehouse openlineage pyspark
Last synced: 18 Jun 2026