Projects in Awesome Lists tagged with data-observability
A curated list of projects in awesome lists tagged with data-observability .
https://github.com/open-metadata/openmetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
data-catalog data-collaboration data-contracts data-discovery data-governance data-lineage data-observability data-profiling data-quality data-quality-checks data-science data-validation datadiscovery dataengineering dataquality dbt hacktoberfest metadata metadata-management snowflake
Last synced: 22 Feb 2026
https://github.com/open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
data-catalog data-collaboration data-contracts data-discovery data-governance data-lineage data-observability data-profiling data-quality data-quality-checks data-science data-validation datadiscovery dataengineering dataquality dbt hacktoberfest metadata metadata-management snowflake
Last synced: 15 Mar 2025
https://github.com/sodadata/soda-core
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
data-contracts data-engineering data-governance data-monitoring data-observability data-profiling data-quality data-quality-checks data-quality-monitoring data-quality-testing data-reliability data-testing data-unit-tests data-validation dataquality datatesting dbt pipeline-testing python snowflake
Last synced: 14 May 2025
https://github.com/elementary-data/elementary
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
analytics-engineer bigquery data-analysis data-governance data-lineage data-observability data-pipeline data-pipelines data-reliability data-warehouse dataops dbt dbt-artifacts dbt-packages lineage redshift snowflake
Last synced: 19 May 2026
https://github.com/re-data/re-data
re_data - fix data issues before your users & CEO would discover them π
data-analysis data-monitoring data-observability data-quality data-quality-checks data-quality-monitoring data-reliability data-testing dataquality dbt dbt-packages open-source-tooling
Last synced: 14 May 2025
https://github.com/opendatadiscovery/odd-platform
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
alerting bigdata data-catalog data-discovery data-engineering data-exploration data-governance data-lineage data-observability data-pipelines data-platform data-profiling data-quality data-science datacatalog lineage metadata metadata-management observability oss
Last synced: 02 Apr 2026
https://github.com/marmotdata/marmot
Marmot helps teams discover, understand, and leverage their data with powerful search and lineage visualisation tools. It's designed to make data accessible for everyone.
bigdata data-catalog data-collaboration data-discovery data-exploration data-governance data-lineage data-observability datacatalog datadiscovery dataengineering lineage mcp mcp-server metadata
Last synced: 09 Apr 2026
https://github.com/infuseai/piperider
Code review for data in dbt
code-review continuous-integration data-exploration data-observability data-pipeline data-profiler data-profiling data-quality data-reliability data-science data-testing data-visualization dbt dbt-metrics eda exploratory-data-analysis pull-requests python reporting
Last synced: 10 Apr 2025
https://github.com/InfuseAI/piperider
Code review for data in dbt
code-review continuous-integration data-exploration data-observability data-pipeline data-profiler data-profiling data-quality data-reliability data-science data-testing data-visualization dbt dbt-metrics eda exploratory-data-analysis pull-requests python reporting
Last synced: 18 Apr 2025
https://github.com/elementary-data/dbt-data-reliability
dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
analytics analytics-engineering data data-lineage data-observability data-pipeline-monitoring data-pipelines data-reliability dbt dbt-artifacts dbt-packages dbt-tests
Last synced: 16 May 2025
https://github.com/data-drift/data-drift
Metrics Observability & Troubleshooting
analytics bigquery context data-diffing data-governance data-lineage data-monitoring data-observability data-quality data-reliability data-version-control dbt dbt-metrics dbt-packages drill-down metrics reconciliation redshift semantic-layer snowflake
Last synced: 08 Oct 2025
https://github.com/datachecks/dcs-core
Open Source Data Quality Monitoring.
data-engineering data-governance data-observability data-ops data-quality-monitor data-quality-monitoring data-validation database dataops dataquality elasticsearch etl metrics mlops monitoring mysql postgres postgresql python sql
Last synced: 03 Mar 2026
https://github.com/dqops/dqo
Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observability. Configure data quality checks from the UI or in YAML files, let DQOps run the data quality checks daily to detect data quality issues.
data-observability data-ops data-profiling data-quality data-quality-checks data-quality-measurement data-quality-monitoring data-quality-report monitoring
Last synced: 13 Dec 2025
https://github.com/DataKitchen/data-observability-installer
Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.
data data-engineering data-observability data-profiling data-quality data-reliability data-science datachecker datacleaner datacleaning dataops dataquality datatesting datavalidation mssql pipeline-tests postgresql redshift self-hosted snowflake
Last synced: 05 May 2025
https://github.com/datakitchen/data-observability-installer
Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.
data data-engineering data-observability data-profiling data-quality data-reliability data-science datachecker datacleaner datacleaning dataops dataquality datatesting datavalidation mssql pipeline-tests postgresql redshift self-hosted snowflake
Last synced: 06 Apr 2026
https://github.com/re-data/dbt-re-data
re_data - fix data issues before your users & CEO would discover them π
data-monitoring data-observability data-quality data-testing dbt dbt-packages sql
Last synced: 07 Apr 2025
https://github.com/monte-carlo-data/mc-agent-toolkit
Official Monte Carlo toolkit for AI coding agents. Skills and plugins that bring data and agent observability β monitoring, triaging, troubleshooting, health checks β into Claude Code, Cursor, and more.
agent-observability agent-skills ai-agents claude-code codex-skills cursor data-observability data-quality mcp monte-carlo opencode skill-md skillsmp vscode
Last synced: 20 Apr 2026
https://github.com/oslabs-beta/DataDoc
Endpoint downtime detection, monitoring, and traffic simulation developer tool
data-observability devtool express-js monitoring-tool traffic-simulation
Last synced: 06 Aug 2025
https://github.com/oslabs-beta/datadoc
Endpoint downtime detection, monitoring, and traffic simulation developer tool
data-observability devtool express-js monitoring-tool traffic-simulation
Last synced: 02 May 2025
https://github.com/sodadata/soda-spark
Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
data-engineering data-observability data-quality data-testing pyspark python soda-sql spark
Last synced: 26 Jul 2025
https://github.com/datakitchen/dataops-testgen
DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data quality test generation and execution by data profiling, Β new dataset hygiene review, AI generation of data quality validation tests, ongoing testing of data refreshes, & continuous anomaly monitoring
data data-engineering data-observability data-quality data-science data-testing datachecker dataops dataprofiling dataquality datavalidation mssql postgresql python redshift self-hosted snowflake
Last synced: 25 Feb 2026
https://github.com/datakitchen/dataops-observability
DataOps Observability is part of DataKitchen's Open Source Data Observability. DataOps Observability monitors every data journey from data source to customer value, from any team development environment into production, across every tool, team, environment, and customer so that problems are detected, localized, and understood immediately.
data data-engineering data-observability data-science dataops pipleine-monitoring
Last synced: 01 Apr 2026
https://github.com/montara-io/dbt-command-center
Never sift through endless dbtβ’ logs again. dbt Command Center is a free, open-source, local web application that provides a user-friendly interface to monitor and manage dbt runs.
analytics-engineering bigquery data-analysis data-catalog data-engineering data-lineage data-observability data-pipeline data-pipelines data-validation data-warehouse dataops dbt dbt-packages elt etl orchestration python redshift
Last synced: 05 May 2025
https://github.com/datakitchen/dataops-observability-agents
DataOps Observability Integration Agents are part of DataKitchen's Open Source Data Observability. They connect to various ETL, ELT, BI, data science, data visualization, data governance, and data analytic tools. They provide logs, messages, metrics, overall run-time start/stop, subtask status, and scheduling information to DataOps Observability.
data-engineering data-observability integrations monitoring-plugins
Last synced: 09 Apr 2025
https://github.com/open-metadata/openmetadata-site
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
automation bigdata bigdataanalytics data-catalog data-discovery data-observability data-profiling data-quality-monitoring data-science datadiscovery dataengineering dataquality datascience dbt governance hacktoberfest hacktoberfest2022 metadata metadata-api metadata-management
Last synced: 14 Apr 2025
https://github.com/opendatadiscovery/odd-collectors
data-catalog data-governance data-observability
Last synced: 12 Apr 2025
https://github.com/dynatrace-oss/dynatrace-snowflake-observability-agent
A tool that streams selected Snowflake telemetry to the Dynatrace API, enabling enhanced data platform observability through Dynatrace dashboards, workflows, and anomaly detection.
data-observability dynatrace-api snowflake telemetry-collection
Last synced: 11 Feb 2026
https://github.com/datasphere-oss/datasphere
DataSphere is the first open-source cloud-native data observability platform that helps you trace the whole data infrastructure in your warehouses, lakes and databases.
cloud-native daas data-analytics data-governance data-lake data-management data-observability datamesh datasphere warehouse
Last synced: 14 Jul 2025
https://github.com/jbris/openmetadata-test
Testing a Docker deployment of OpenMetadata for S3 data ingestion
data-lineage data-observability minio minio-docker openmetadata openmetadata-docker s3
Last synced: 18 Feb 2026
https://github.com/cgnorthcutt/reliablity_framework_for_rag
Demo showing how the Trustworthy Language Model add reliability to LLM outputs and improves RAG, agents, and data enrichment worfklows. can be used to improve fine-tuning of LLMs, accuracy of LLM outputs, and smart routing for RAG and agents.
chatgpt data-cleaning data-curation data-observability data-quality llms observability rag
Last synced: 29 Jul 2025
https://github.com/jaimeloeuf/jevents
A simple to use EventEmitter and Data-Observer python package.
data-observability event-driven-programming eventemitter events python
Last synced: 29 Jun 2025
https://github.com/rbmuller/scherlok
A detective for your data. Zero-config data quality monitoring β works with dbt, Postgres, BigQuery, Snowflake. No YAML.
anomaly-detection bigquery cli data-engineering data-observability data-quality dbt etl monitoring open-source postgres postgresql python snowflake
Last synced: 15 May 2026
https://github.com/jbris/marquez-test
Testing a Docker deployment of Marquez and OpenLineage
data-lineage data-observability marquez marquez-docker observability openlineage openlineage-docker
Last synced: 06 Feb 2026
https://github.com/mjanez/portaljs-starter-marmot
Docker-based deployment PortalJS Framework template for Marmot
data-catalog data-discovery data-distribution-platform data-governance data-lineage data-observability data-quality docker docker-compose marmot metadata nextjs portaljs starter-kit
Last synced: 28 Feb 2026
https://github.com/sachinvarghese/pgamber
Data observability for postgreSQL using alibi-detect
data-observability outlier-detection postgresql
Last synced: 22 Apr 2026