An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with data-observability

A curated list of projects in awesome lists tagged with data-observability .

https://github.com/open-metadata/openmetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

data-catalog data-collaboration data-contracts data-discovery data-governance data-lineage data-observability data-profiling data-quality data-quality-checks data-science data-validation datadiscovery dataengineering dataquality dbt hacktoberfest metadata metadata-management snowflake

Last synced: 22 Feb 2026

https://github.com/open-metadata/OpenMetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

data-catalog data-collaboration data-contracts data-discovery data-governance data-lineage data-observability data-profiling data-quality data-quality-checks data-science data-validation datadiscovery dataengineering dataquality dbt hacktoberfest metadata metadata-management snowflake

Last synced: 15 Mar 2025

https://github.com/elementary-data/elementary

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

analytics-engineer bigquery data-analysis data-governance data-lineage data-observability data-pipeline data-pipelines data-reliability data-warehouse dataops dbt dbt-artifacts dbt-packages lineage redshift snowflake

Last synced: 19 May 2026

https://github.com/marmotdata/marmot

Marmot helps teams discover, understand, and leverage their data with powerful search and lineage visualisation tools. It's designed to make data accessible for everyone.

bigdata data-catalog data-collaboration data-discovery data-exploration data-governance data-lineage data-observability datacatalog datadiscovery dataengineering lineage mcp mcp-server metadata

Last synced: 09 Apr 2026

https://github.com/elementary-data/dbt-data-reliability

dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

analytics analytics-engineering data data-lineage data-observability data-pipeline-monitoring data-pipelines data-reliability dbt dbt-artifacts dbt-packages dbt-tests

Last synced: 16 May 2025

https://github.com/dqops/dqo

Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observability. Configure data quality checks from the UI or in YAML files, let DQOps run the data quality checks daily to detect data quality issues.

data-observability data-ops data-profiling data-quality data-quality-checks data-quality-measurement data-quality-monitoring data-quality-report monitoring

Last synced: 13 Dec 2025

https://github.com/DataKitchen/data-observability-installer

Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.

data data-engineering data-observability data-profiling data-quality data-reliability data-science datachecker datacleaner datacleaning dataops dataquality datatesting datavalidation mssql pipeline-tests postgresql redshift self-hosted snowflake

Last synced: 05 May 2025

https://github.com/datakitchen/data-observability-installer

Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.

data data-engineering data-observability data-profiling data-quality data-reliability data-science datachecker datacleaner datacleaning dataops dataquality datatesting datavalidation mssql pipeline-tests postgresql redshift self-hosted snowflake

Last synced: 06 Apr 2026

https://github.com/re-data/dbt-re-data

re_data - fix data issues before your users & CEO would discover them 😊

data-monitoring data-observability data-quality data-testing dbt dbt-packages sql

Last synced: 07 Apr 2025

https://github.com/monte-carlo-data/mc-agent-toolkit

Official Monte Carlo toolkit for AI coding agents. Skills and plugins that bring data and agent observability β€” monitoring, triaging, troubleshooting, health checks β€” into Claude Code, Cursor, and more.

agent-observability agent-skills ai-agents claude-code codex-skills cursor data-observability data-quality mcp monte-carlo opencode skill-md skillsmp vscode

Last synced: 20 Apr 2026

https://github.com/oslabs-beta/DataDoc

Endpoint downtime detection, monitoring, and traffic simulation developer tool

data-observability devtool express-js monitoring-tool traffic-simulation

Last synced: 06 Aug 2025

https://github.com/oslabs-beta/datadoc

Endpoint downtime detection, monitoring, and traffic simulation developer tool

data-observability devtool express-js monitoring-tool traffic-simulation

Last synced: 02 May 2025

https://github.com/sodadata/soda-spark

Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes

data-engineering data-observability data-quality data-testing pyspark python soda-sql spark

Last synced: 26 Jul 2025

https://github.com/datakitchen/dataops-testgen

DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data quality test generation and execution by data profiling, Β new dataset hygiene review, AI generation of data quality validation tests, ongoing testing of data refreshes, & continuous anomaly monitoring

data data-engineering data-observability data-quality data-science data-testing datachecker dataops dataprofiling dataquality datavalidation mssql postgresql python redshift self-hosted snowflake

Last synced: 25 Feb 2026

https://github.com/datakitchen/dataops-observability

DataOps Observability is part of DataKitchen's Open Source Data Observability. DataOps Observability monitors every data journey from data source to customer value, from any team development environment into production, across every tool, team, environment, and customer so that problems are detected, localized, and understood immediately.

data data-engineering data-observability data-science dataops pipleine-monitoring

Last synced: 01 Apr 2026

https://github.com/montara-io/dbt-command-center

Never sift through endless dbtβ„’ logs again. dbt Command Center is a free, open-source, local web application that provides a user-friendly interface to monitor and manage dbt runs.

analytics-engineering bigquery data-analysis data-catalog data-engineering data-lineage data-observability data-pipeline data-pipelines data-validation data-warehouse dataops dbt dbt-packages elt etl orchestration python redshift

Last synced: 05 May 2025

https://github.com/datakitchen/dataops-observability-agents

DataOps Observability Integration Agents are part of DataKitchen's Open Source Data Observability. They connect to various ETL, ELT, BI, data science, data visualization, data governance, and data analytic tools. They provide logs, messages, metrics, overall run-time start/stop, subtask status, and scheduling information to DataOps Observability.

data-engineering data-observability integrations monitoring-plugins

Last synced: 09 Apr 2025

https://github.com/dynatrace-oss/dynatrace-snowflake-observability-agent

A tool that streams selected Snowflake telemetry to the Dynatrace API, enabling enhanced data platform observability through Dynatrace dashboards, workflows, and anomaly detection.

data-observability dynatrace-api snowflake telemetry-collection

Last synced: 11 Feb 2026

https://github.com/datasphere-oss/datasphere

DataSphere is the first open-source cloud-native data observability platform that helps you trace the whole data infrastructure in your warehouses, lakes and databases.

cloud-native daas data-analytics data-governance data-lake data-management data-observability datamesh datasphere warehouse

Last synced: 14 Jul 2025

https://github.com/jbris/openmetadata-test

Testing a Docker deployment of OpenMetadata for S3 data ingestion

data-lineage data-observability minio minio-docker openmetadata openmetadata-docker s3

Last synced: 18 Feb 2026

https://github.com/cgnorthcutt/reliablity_framework_for_rag

Demo showing how the Trustworthy Language Model add reliability to LLM outputs and improves RAG, agents, and data enrichment worfklows. can be used to improve fine-tuning of LLMs, accuracy of LLM outputs, and smart routing for RAG and agents.

chatgpt data-cleaning data-curation data-observability data-quality llms observability rag

Last synced: 29 Jul 2025

https://github.com/jaimeloeuf/jevents

A simple to use EventEmitter and Data-Observer python package.

data-observability event-driven-programming eventemitter events python

Last synced: 29 Jun 2025

https://github.com/rbmuller/scherlok

A detective for your data. Zero-config data quality monitoring β€” works with dbt, Postgres, BigQuery, Snowflake. No YAML.

anomaly-detection bigquery cli data-engineering data-observability data-quality dbt etl monitoring open-source postgres postgresql python snowflake

Last synced: 15 May 2026

https://github.com/jbris/marquez-test

Testing a Docker deployment of Marquez and OpenLineage

data-lineage data-observability marquez marquez-docker observability openlineage openlineage-docker

Last synced: 06 Feb 2026

https://github.com/sachinvarghese/pgamber

Data observability for postgreSQL using alibi-detect

data-observability outlier-detection postgresql

Last synced: 22 Apr 2026