An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with data-testing

A curated list of projects in awesome lists tagged with data-testing .

https://posit-dev.github.io/pointblank/

Data validation made beautiful and powerful

data-quality data-testing data-validation easy-to-understand tabular-data

Last synced: 22 Jun 2025

https://github.com/re-data/dbt-re-data

re_data - fix data issues before your users & CEO would discover them 😊

data-monitoring data-observability data-quality data-testing dbt dbt-packages sql

Last synced: 07 Apr 2025

https://github.com/posit-dev/pointblank

Find out if your data is what you think it is

data-quality data-testing data-validation easy-to-understand tabular-data

Last synced: 04 Feb 2026

https://github.com/sodadata/soda-spark

Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes

data-engineering data-observability data-quality data-testing pyspark python soda-sql spark

Last synced: 26 Jul 2025

https://github.com/datakitchen/dataops-testgen

DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data quality test generation and execution by data profiling,  new dataset hygiene review, AI generation of data quality validation tests, ongoing testing of data refreshes, & continuous anomaly monitoring

data data-engineering data-observability data-quality data-science data-testing datachecker dataops dataprofiling dataquality datavalidation mssql postgresql python redshift self-hosted snowflake

Last synced: 25 Feb 2026

https://github.com/serialbandicoot/great-assertions

This library is inspired by the Great Expectations library. The library has made the various expectations found in Great Expectations available when using the inbuilt python unittest assertions.

data-science data-testing databricks great-expectations jupyter-notebook python python3 quality-assurance testing

Last synced: 28 Oct 2025

https://github.com/shridhar1504/sales-forecasting-datascience-project

Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.

data-analytics data-cleaning data-science data-testing data-visualization forecasting-models machin model-evaluation model-fitting prediction predictive-modeling python3 regression-algorithms salesforecast sklearn-library supervised-learning

Last synced: 30 Oct 2025

https://github.com/ericmjl/software-testing-open-source-and-data-science

Software Testing in Open Source and Data Science: A talk delivered at the Data Umbrella speaker series

data-science data-testing machine-learning-testing software-testing testing

Last synced: 25 Feb 2026

https://github.com/jafeerr/spark-data-test

Spark Data Test - A PySpark-based automation testing utility to compare Spark DataFrames

apache-spark data-testing dataframe pyspark

Last synced: 04 Oct 2025

https://github.com/manoj9788/spark-etl-tests

A sample repository showcasing, implementation of testing for ETL pipeline developed with Apache Spark

data-testing etl etl-automation scala

Last synced: 20 Jun 2025

https://github.com/balajimohan18/sales-forecasting-datascience-project

Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.

data-analytics data-science data-testing data-visualization forecasting forecasting-models machine-learning model-evaluation predictive-modeling python regression-algorithms salesforecast scipy sklearn-library supervised-learning

Last synced: 04 Mar 2025