An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with data-contracts

A curated list of projects in awesome lists tagged with data-contracts .

https://github.com/open-metadata/openmetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

data-catalog data-collaboration data-contracts data-discovery data-governance data-lineage data-observability data-profiling data-quality data-quality-checks data-science data-validation datadiscovery dataengineering dataquality dbt hacktoberfest metadata metadata-management snowflake

Last synced: 22 Feb 2026

https://github.com/open-metadata/OpenMetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

data-catalog data-collaboration data-contracts data-discovery data-governance data-lineage data-observability data-profiling data-quality data-quality-checks data-science data-validation datadiscovery dataengineering dataquality dbt hacktoberfest metadata metadata-management snowflake

Last synced: 15 Mar 2025

https://github.com/rocky-data/rocky

The typed graph between your code and whichever warehouse, table format, or query engine you've chosen — typed compiler, branches, replay, column-level lineage, compile-time contracts, per-model cost. Adapters: Databricks, Snowflake, BigQuery, DuckDB. Single static Rust binary. Apache 2.0.

column-lineage dagster data-contracts data-engineering data-lineage data-pipeline data-platform data-quality dbt-alternative rust schema-drift sql

Last synced: 06 Jun 2026

https://github.com/matsmoll/aligned

The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt

ai data-contracts data-lake datacontracts dbt feature-engineering feature-store ml ml-ops mlops

Last synced: 13 Aug 2025

https://github.com/getstrm/pace

Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery (or plain 'ol Postgres, even!) with definitions imported from Collibra, Datahub, ODD and the like.

bigquery data-catalog data-contracts data-governance data-processing databricks policy-enforcement snowflake

Last synced: 13 Oct 2025

https://github.com/opendqv/opendqv

Open-source, contract-driven data quality validation. Shift-left enforcement at the point of write — before data enters your pipeline.

data-contracts data-governance data-quality data-validation fastapi mcp open-source python shift-left

Last synced: 25 Apr 2026

https://github.com/flyersworder/agentic-data-contracts

YAML-first, domain-driven data governance for AI agents — teach agents your business domains, metrics, and rules before they write SQL

agent-sdk ai-agents analytics claude data-contracts data-engineering data-governance dbt domain-driven llm pydantic python semantic-layer sql-validation sqlglot yaml

Last synced: 24 Apr 2026

https://github.com/axonops/axonops-schema-registry

High-performance, API-compatible drop-in replacement for Confluent Schema Registry. Avro, Protobuf & JSON Schema with PostgreSQL, MySQL, Cassandra storage. Built-in RBAC, CSFLE encryption, data contracts, audit logging. No Kafka dependency. Apache 2.0.

apache-kafka avro cassandra confluent csfle data-contracts data-governance encryption golang json-schema kafka mysql postgresql protobuf rbac schema-management schema-registry

Last synced: 08 Mar 2026

https://github.com/npow/metaflow-contracts

Catch bad data between Metaflow steps before it corrupts your pipeline

data-contracts data-validation metaflow pipeline pydantic pypi python type-checking

Last synced: 23 May 2026

https://github.com/a-chumagin/soda-contract-poc

PoC for Soda Contracts against Vertica DB

data-contracts data-governance data-quality soda

Last synced: 12 Feb 2026

https://github.com/vim89/flowforge

Let's be honest - most data pipeline frameworks treat types as suggestions. Config files are strings. Schemas are "validated" at runtime. Data quality is an afterthought. So, let's do differently

archetype data data-contracts data-engineering data-pipelines data-quality data-science database dataengineering datapipeline etl etl-framework pipelines scala scalability spark spark-sql spark-streaming

Last synced: 14 Apr 2026

https://github.com/marquesantero/contractforge

Contract-first framework for governed lakehouse ingestion on Delta Lake and Databricks.

data-contracts data-engineering databricks delta-lake ingestion lakehouse openlineage pyspark

Last synced: 18 Jun 2026