Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with data-quality
A curated list of projects in awesome lists tagged with data-quality .
https://github.com/byteplant/jquery-address-validator-net
jQuery plugin for the address-validator.net API
address address-autocomplete address-validation data-cleaning data-quality data-validation form-validation form-validation-jquery javascript javascript-library jquery validation
Last synced: 16 Nov 2024
https://github.com/byteplant/phone-validator-net
NodeJS wrapper for the phone-validator.net API
byteplant cleaning cleaning-data data-quality data-validation javascript node-js node-module phone phone-marketing phone-number phone-number-verification phone-validation phonenumber typescript validation
Last synced: 12 Oct 2024
https://github.com/opendatadiscovery/odd-great-expectations
Integration for collecting metadata from Great Expectations
Last synced: 14 Nov 2024
https://github.com/data-drift/dbt-snapshot-analytics
Get insight from a dbt snapshot on your metric quality
analytics data-quality dbt monitoring snapshot
Last synced: 30 Dec 2024
https://github.com/maastrichtu-ids/dqa-pipeline
Large-scale RDF-based Data Quality Assessment Pipeline
data-quality docker fair-data rdf sparql
Last synced: 21 Dec 2024
https://github.com/maastrichtu-ids/fairsharing-metrics
📊 Fairsharing metrics implementation
bioinformatics data-quality docker python rdf rdfunit
Last synced: 21 Dec 2024
https://github.com/warfox/dqt
Data Quality Tool
clojure data-quality data-reliability
Last synced: 09 Nov 2024
https://github.com/bharathsudharsan/tiny-impute
On-device Hybrid Anomaly Detection and Data Imputation
anamoly-detection arduino data-quality edge-computing esp32 expectation-maximization imputation-algorithm iot knn laplacian micro-python mkr1000 moving-average raspberry-pi simple-linear-regression tinyml
Last synced: 25 Dec 2024
https://github.com/a-chumagin/soda-contract-poc
PoC for Soda Contracts against Vertica DB
data-contracts data-governance data-quality soda
Last synced: 02 Jan 2025
https://github.com/dp6/templates-centro-de-inovacoes
Modelos de arquiteturas, documentações, testes e deploys para as iniciativas do centro de inovação
data-quality data-science data-structures dp6 gtm inovacao
Last synced: 04 Dec 2024
https://github.com/byteplant/jquery-email-validator-net
jQuery plugin for the email-validator.net API
cleaning data-cleaning data-quality email email-cleaning email-marketing email-validation email-verification form-validation form-validation-jquery javascript javascript-library jquery validation verification
Last synced: 16 Nov 2024
https://github.com/anerv/bikedna_analysis
Code for analyzing the results from running BikeDNA BIG (https://github.com/anerv/BikeDNA_BIG) on bicycle infrastructure data from Denmark.
bicycle-infrastructure bicycle-network data-quality geospatial-data open-street-map sustainable-mobility urban-planning volunteered-geographic-information
Last synced: 16 Dec 2024
https://github.com/openfoodfacts/contributor-quality-issues
Report data quality issues due to contributing apps/users
Last synced: 11 Nov 2024
https://github.com/nhsdigital/sde_summary_notebooks
Notebooks provided by the Wranglers for users to quickly gain insights on datasets inside the Secure Data Environment (SDE)
data-analysis data-linkage data-quality data-summary metrics statistics
Last synced: 23 Dec 2024
https://github.com/BetweenTwoTests/between_dbs
DDL & test data for different databases for ETL data quality checks / data loading tests
Last synced: 04 Dec 2024
https://github.com/joocer/data_expectations
Are your data meeting your expectations?
data data-engineering data-quality data-science data-unit-tests observability pipelines quality validation
Last synced: 01 Dec 2024
https://github.com/ashbyt/python
Ashley Bythell - Python
dat data-cleansing data-quality data-science dataframe exploratory-data-analysis gis missing-data numpy pandas parsing python regression regular-expression scraping-websites sklearn svm-classifier visualization wrangling
Last synced: 09 Nov 2024
https://github.com/manesioz/bq_dq_plugin
Airflow plug-in that allows you to automate robust Data Quality checks for BigQuery
airflow airflow-plugin data-quality data-quality-checks google-bigquery
Last synced: 09 Nov 2024
https://github.com/jadelhelm/autoprep
Automated Preprocessing Pipeline - DataFrame
anomalies anomaly anomaly-detection automated automated-machine-learning automation data-cleaning data-cleaning-and-preprocessing data-quality machine-learning machinelearning machinelearning-python preprocessing preprocessing-data preprocessing-pipeline python python3 sklearn standardization tabular-data
Last synced: 10 Oct 2024
https://github.com/wikidata/purdue-data-mine-2024
Program materials for WMDE's 2024 Purdue Data Mine project
analytics data-analysis data-quality data-science etl open-data python wikidata wikimedia
Last synced: 18 Nov 2024
https://github.com/steveanik/kestra
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
data data-engineering data-integration data-pipeline data-quality elt etl low-code orchestration pipelines scheduler workflow workflow-engine
Last synced: 07 Dec 2024
https://github.com/hadarsharon/compars
DataFrame comparison done right, powered by Rust with polars (AKA the bear-agnostic 🐻 🐼 🐨 🐻❄️ DataFrame comparison library)
data-engineering data-profiling data-quality dataframe dataframes koalas pandas polars pyspark python rust spark
Last synced: 12 Nov 2024
https://github.com/maximiliancw/completely
Measure your data completeness
data data-cleaning data-quality data-science missing-data
Last synced: 03 Dec 2024
https://github.com/byteplant/jquery-phone-validator-net
jQuery plugin for the phone-validator.net API
data-cleaning data-quality data-validation form-validation form-validation-jquery javascript javascript-library jquery phone phone-validation validation
Last synced: 16 Nov 2024
https://github.com/pawsanie/pyspark_universal_dq_report
The script reads the dataset along the path and selects the columns in it received from the argument for the specified dates. Then it saves the report to the specified path of HDFS.
data-quality data-quality-checks data-quality-monitoring dq hadoop hadoop-hdfs hdfs pyspark python python-3 python-script python3
Last synced: 02 Jan 2025
https://github.com/mynttt/dqgui
DQGUI is an IDE written in JavaFX for the IQM4HD DSL (Domain Specific Language)
data-quality data-quality-monitoring data-science database dsl ide java javafx
Last synced: 23 Dec 2024
https://github.com/firoz-ahmad-likhon/great-expectations-example
Sample project to demonstrate the use of Great Expectations
data-engineering data-quality data-validation great-expectations python
Last synced: 01 Jan 2025
https://github.com/b-cubed-eu/comp-unstructured-data
Scripts to explore the conditions that determine the reliability of models, trends and status by comparing aggregated cubes with structured monitoring schemes
data-cubes data-quality r rstats structured-data unstructured-data
Last synced: 14 Dec 2024
https://github.com/isislab-unisa/kgheartbeat-historical-analysis
History of quality analysis performed by KGHeartBeat
data-quality knowledge-graph linked-open-data quality quality-assessment semantic-web
Last synced: 15 Nov 2024