Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with data-quality

A curated list of projects in awesome lists tagged with data-quality .

https://github.com/opendatadiscovery/odd-great-expectations

Integration for collecting metadata from Great Expectations

data-governance data-quality

Last synced: 14 Nov 2024

https://github.com/data-drift/dbt-snapshot-analytics

Get insight from a dbt snapshot on your metric quality

analytics data-quality dbt monitoring snapshot

Last synced: 30 Dec 2024

https://github.com/maastrichtu-ids/dqa-pipeline

Large-scale RDF-based Data Quality Assessment Pipeline

data-quality docker fair-data rdf sparql

Last synced: 21 Dec 2024

https://github.com/maastrichtu-ids/fairsharing-metrics

📊 Fairsharing metrics implementation

bioinformatics data-quality docker python rdf rdfunit

Last synced: 21 Dec 2024

https://github.com/warfox/dqt

Data Quality Tool

clojure data-quality data-reliability

Last synced: 09 Nov 2024

https://github.com/a-chumagin/soda-contract-poc

PoC for Soda Contracts against Vertica DB

data-contracts data-governance data-quality soda

Last synced: 02 Jan 2025

https://github.com/dp6/templates-centro-de-inovacoes

Modelos de arquiteturas, documentações, testes e deploys para as iniciativas do centro de inovação

data-quality data-science data-structures dp6 gtm inovacao

Last synced: 04 Dec 2024

https://github.com/anerv/bikedna_analysis

Code for analyzing the results from running BikeDNA BIG (https://github.com/anerv/BikeDNA_BIG) on bicycle infrastructure data from Denmark.

bicycle-infrastructure bicycle-network data-quality geospatial-data open-street-map sustainable-mobility urban-planning volunteered-geographic-information

Last synced: 16 Dec 2024

https://github.com/openfoodfacts/contributor-quality-issues

Report data quality issues due to contributing apps/users

data-quality

Last synced: 11 Nov 2024

https://github.com/nhsdigital/sde_summary_notebooks

Notebooks provided by the Wranglers for users to quickly gain insights on datasets inside the Secure Data Environment (SDE)

data-analysis data-linkage data-quality data-summary metrics statistics

Last synced: 23 Dec 2024

https://github.com/BetweenTwoTests/between_dbs

DDL & test data for different databases for ETL data quality checks / data loading tests

data-quality database etl

Last synced: 04 Dec 2024

https://github.com/manesioz/bq_dq_plugin

Airflow plug-in that allows you to automate robust Data Quality checks for BigQuery

airflow airflow-plugin data-quality data-quality-checks google-bigquery

Last synced: 09 Nov 2024

https://github.com/wikidata/purdue-data-mine-2024

Program materials for WMDE's 2024 Purdue Data Mine project

analytics data-analysis data-quality data-science etl open-data python wikidata wikimedia

Last synced: 18 Nov 2024

https://github.com/steveanik/kestra

Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.

data data-engineering data-integration data-pipeline data-quality elt etl low-code orchestration pipelines scheduler workflow workflow-engine

Last synced: 07 Dec 2024

https://github.com/hadarsharon/compars

DataFrame comparison done right, powered by Rust with polars (AKA the bear-agnostic 🐻 🐼 🐨 🐻‍❄️ DataFrame comparison library)

data-engineering data-profiling data-quality dataframe dataframes koalas pandas polars pyspark python rust spark

Last synced: 12 Nov 2024

https://github.com/pawsanie/pyspark_universal_dq_report

The script reads the dataset along the path and selects the columns in it received from the argument for the specified dates. Then it saves the report to the specified path of HDFS.

data-quality data-quality-checks data-quality-monitoring dq hadoop hadoop-hdfs hdfs pyspark python python-3 python-script python3

Last synced: 02 Jan 2025

https://github.com/mynttt/dqgui

DQGUI is an IDE written in JavaFX for the IQM4HD DSL (Domain Specific Language)

data-quality data-quality-monitoring data-science database dsl ide java javafx

Last synced: 23 Dec 2024

https://github.com/firoz-ahmad-likhon/great-expectations-example

Sample project to demonstrate the use of Great Expectations

data-engineering data-quality data-validation great-expectations python

Last synced: 01 Jan 2025

https://github.com/b-cubed-eu/comp-unstructured-data

Scripts to explore the conditions that determine the reliability of models, trends and status by comparing aggregated cubes with structured monitoring schemes

data-cubes data-quality r rstats structured-data unstructured-data

Last synced: 14 Dec 2024