Projects in Awesome Lists tagged with data-validation
A curated list of projects in awesome lists tagged with data-validation .
https://github.com/rjsf-team/react-jsonschema-form
A React component for building Web forms from JSON Schema.
data-validation forms json json-schema react ui web
Last synced: 12 Feb 2026
https://rjsf-team.github.io/react-jsonschema-form/
A React component for building Web forms from JSON Schema.
data-validation forms json json-schema react ui web
Last synced: 23 Apr 2025
https://github.com/cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
active-learning annotation data-centric-ai data-cleaning data-curation data-labeling data-profiling data-quality data-science data-validation dataops dataquality datasets exploratory-data-analysis labeling llms noisy-labels out-of-distribution-detection outlier-detection weak-supervision
Last synced: 08 Jan 2026
https://github.com/open-metadata/openmetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
data-catalog data-collaboration data-contracts data-discovery data-governance data-lineage data-observability data-profiling data-quality data-quality-checks data-science data-validation datadiscovery dataengineering dataquality dbt hacktoberfest metadata metadata-management snowflake
Last synced: 22 Feb 2026
https://github.com/evidentlyai/evidently
Evidently is ββan open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
data-drift data-quality data-science data-validation generative-ai hacktoberfest html-report jupyter-notebook llm llmops machine-learning mlops model-monitoring pandas-dataframe
Last synced: 13 May 2025
https://github.com/open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
data-catalog data-collaboration data-contracts data-discovery data-governance data-lineage data-observability data-profiling data-quality data-quality-checks data-science data-validation datadiscovery dataengineering dataquality dbt hacktoberfest metadata metadata-management snowflake
Last synced: 15 Mar 2025
https://github.com/deepchecks/deepchecks
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
data-drift data-science data-validation deep-learning html-report jupyter-notebook machine-learning ml mlops model-monitoring model-validation pandas-dataframe python pytorch
Last synced: 16 May 2025
https://github.com/unionai-oss/pandera
A light-weight, flexible, and expressive statistical data testing library
assertions data-assertions data-check data-cleaning data-processing data-validation data-verification dataframe-schema dataframes hypothesis-testing pandas pandas-dataframe pandas-validation pandas-validator schema testing testing-tools validation
Last synced: 29 Jan 2026
https://github.com/pyeve/cerberus
Lightweight, extensible data validation library for Python
Last synced: 12 Dec 2025
https://github.com/sodadata/soda-core
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
data-contracts data-engineering data-governance data-monitoring data-observability data-profiling data-quality data-quality-checks data-quality-monitoring data-quality-testing data-reliability data-testing data-unit-tests data-validation dataquality datatesting dbt pipeline-testing python snowflake
Last synced: 14 May 2025
https://github.com/dry-rb/dry-validation
Validation library with type-safe schemas and rules
coercion data-validation dry-rb gem ruby rubygem type-safety validation
Last synced: 12 May 2025
https://github.com/cleanlab/cleanvision
Automatically find issues in image datasets and practice data-centric computer vision.
computer-vision data-centric-ai data-exploration data-profiling data-quality data-science data-validation deep-learning exploratory-data-analysis image-analysis image-classification image-generation image-quality image-segmentation
Last synced: 06 Jan 2026
https://github.com/rstudio/pointblank
Data quality assessment and metadata reporting for data frames and database tables
data-assertions data-checker data-dictionaries data-frames data-inference data-management data-profiler data-quality data-validation data-verification database-tables easy-to-understand reporting-tool schema-validation testing-tools yaml-configuration
Last synced: 14 May 2025
https://github.com/biscolab/laravel-recaptcha
Google ReCaptcha package for Laravel
checkbox data-validation form-validation google-recaptcha google-recaptcha-v2 google-recaptcha-v3 larave11 laravel laravel-recaptcha laravel10 laravel5 laravel6 laravel7 laravel8 laravel9 php php8 recaptcha recaptcha-api recaptcha-validation
Last synced: 03 Oct 2025
https://github.com/dry-rb/dry-schema
Coercion and validation for data structures
coercion data-validation data-validator dry-rb ruby ruby-gem schema schema-validation type-safety
Last synced: 29 Apr 2025
https://github.com/lucono/xtypejs
Elegant, highly efficient data validation for JavaScript.
data-types data-validation javascript
Last synced: 04 Apr 2025
https://github.com/MigoXLab/dingo
Dingo: A Comprehensive AI Data Quality Evaluation Tool
common-crawl data-evaluation data-quality data-quality-assessment data-quality-report data-science data-validation dataquality datascience deepseek gpt hallucination hallucination-detection llm openai opencompass qwen spark vlm
Last synced: 29 Aug 2025
https://github.com/okfn/opendataeditor
The Open Data Editor (ODE) is a no-code application to explore and validate tabular data in a simple way. Forever free and open source project powered by the Frictionless Framework.
data-validation desktop-application digitalpublicgoods dpg sdg11 sdg16 sdg9 sdgs
Last synced: 23 Jan 2026
https://github.com/kjam/data-cleaning-101
Data Cleaning Libraries with Python
data-validation data-wrangling python teaching
Last synced: 31 Jan 2026
https://github.com/shopnilsazal/validus
A dead simple Python string validation library.
data-validation package python3
Last synced: 31 Jan 2026
https://github.com/maif/eurybia
β Eurybia monitors model drift over time and securizes model deployment with data validation
data-drift data-validation datadrift-classifier domain-classifier drift drift-detection html-report machine-learning model-drift model-monitoring production-machine-learning python
Last synced: 08 Oct 2025
https://posit-dev.github.io/pointblank/
Data validation made beautiful and powerful
data-quality data-testing data-validation easy-to-understand tabular-data
Last synced: 22 Jun 2025
https://github.com/bids-standard/legacy-validator
Validator for the Brain Imaging Data Structure
bids data-validation neuroimaging
Last synced: 04 Oct 2025
https://github.com/seandstewart/typical
Typical: Fast, simple, & correct data-validation using Python 3 typing.
annotations data-validation deserialization python-types python3 python3-library python36 python37 python38 serde serdes serialization type-annotations type-hints type-safety typical typing validation
Last synced: 21 Oct 2025
https://github.com/atrocore/atropim
AtroPIM is a flexible, highly configurable, modular, open-source product information management (PIM) system that extends the AtroCore data management and system integration platform.
b2b data-validation digital-platform ecommerce-platform ecommerce-platfrom erp-integration headless php pim product-catalog product-content-management product-data product-data-management product-experience product-experience-management product-information-management product-management product-quality pxm svelte
Last synced: 23 Jan 2026
https://github.com/datachecks/dcs-core
Open Source Data Quality Monitoring.
data-engineering data-governance data-observability data-ops data-quality-monitor data-quality-monitoring data-validation database dataops dataquality elasticsearch etl metrics mlops monitoring mysql postgres postgresql python sql
Last synced: 03 Mar 2026
https://github.com/AKSW/RDFUnit
An RDF Unit Testing Suite
data-quality data-quality-checks data-validation rdf schema schema-validation shacl unit-testing validation web-ontology-language
Last synced: 03 Apr 2025
https://github.com/medtagger/MedTagger
A collaborative framework for annotating medical datasets using crowdsourcing.
crowdsourcing data-science data-validation deep-learning labeling medical-imaging
Last synced: 08 May 2025
https://github.com/implerhq/impler.io
Powerful CSV & Excel Import experience for SaaS π Save months building data import experience from scratch π°
csv-import data-import data-mapping data-review data-validation excel-import lerna mongodb nestjs nextjs nx pnpm-monorepo typescript
Last synced: 27 Mar 2026
https://github.com/target/data-validator
A tool to validate data, built around Apache Spark.
data-science data-validation hacktoberfest
Last synced: 13 May 2025
https://github.com/fiverr/passable
Declarative data validations.
data-validation isomorphic isomorphic-validations validations
Last synced: 06 Apr 2025
https://github.com/posit-dev/pointblank
Find out if your data is what you think it is
data-quality data-testing data-validation easy-to-understand tabular-data
Last synced: 04 Feb 2026
https://github.com/serradura/u-attributes
Create "immutable" objects with no setters, just getters.
activemodel change-detection data-integrity data-validation getters immutability no-setters ruby ruby-gem
Last synced: 07 Apr 2025
https://github.com/green-coder/minimallist
A minimalist data driven data model library, inspired by Clojure Spec and Malli.
clojure clojurescript data-parsing data-validation
Last synced: 10 Jul 2025
https://github.com/voltraco/dml
A data modeling language (for node and the browser)
data-model data-validation modeling
Last synced: 12 Apr 2025
https://github.com/argyle-engineering/pydantic2zod
pydantic --> zod data models
data-validation pydantic python typescript zod
Last synced: 06 Apr 2025
https://github.com/vertti/daffy
Lightweight DataFrame validation decorators for Pandas, Polars, Modin, and PyArrow. No custom types required.
data-quality data-validation dataframe dataframe-schema dataframe-validation decorator modin narwhals pandas polars pyarrow pydantic python python-decorator runtime-validation validation
Last synced: 21 Feb 2026
https://github.com/sparkdq-community/sparkdq
A declarative PySpark framework for row- and aggregate-level data quality validation.
data-check data-engineering data-quality data-validation data-verification dq-framework pyspark pyspark-validation spark-data-quality
Last synced: 02 Jul 2025
https://github.com/m6io/rjsf-tailwind
A Tailwind theme for react-jsonschema-form
data-validation forms json json-schema jsonschema monaco-editor react shadcn-ui tailwind tailwindcss ui web
Last synced: 12 Apr 2025
https://github.com/cinar/checker
Effortless input validation in Go with the power of struct tags. No dependencies, just pure simplicity. β¨ See how! π
checker customizable data-integrity data-validation form-validation go golang input-validation library lightweight localization no-dependencies normalization security struct-tags validation validator
Last synced: 12 Jan 2026
https://github.com/matheusfelipeog/fordev
Gere e valide dados randΓ΄micos com fordev π²
4devs 4devs-api 4devs-module api data-generator data-manipulation data-validation fake-data fake-data-generator fordev fourthdev python random-data scrapping
Last synced: 29 Jun 2025
https://github.com/egnha/valaddin
Functional input validation to make R functions more readable and robust
data-validation input-validation r type-safety
Last synced: 02 May 2025
https://github.com/cstsunfu/intc
Python Intelligence Config Manager. A superset of hydra+pydantic+lsp
completion config data-validation dataclass deep-learning goto-definition hover hydra intelligent json json-schema lsp neovim pydantic python vscode
Last synced: 12 Apr 2025
https://github.com/emilyriederer/convo
R package based on "Column Names as Contracts" blog post (https://emilyriederer.netlify.app/post/column-name-contracts/)
controlled-vocabulary data-quality data-validation r-package schema-design variable-names variable-naming
Last synced: 26 Oct 2025
https://github.com/phink/changeset
(unreleased) Data validation with first-class and first-order labels in OCaml
Last synced: 07 May 2025
https://github.com/cleanlab/cleanlab-studio
Client interface to Cleanlab Studio and the Trustworthy Language Model
annotations automl computer-vision data-centric-ai data-cleaning data-curation data-labeling data-profiling data-quality data-science data-validation image-classification llm machine-learning model-deployment natural-language-processing noisy-labels outlier-detection structured-data text-classification
Last synced: 13 Apr 2025
https://github.com/jnmclarty/validada
Another library for defensive data analysis.
checkset data data-analysis data-validation decorators pandas slice validation
Last synced: 24 Jan 2026
https://github.com/montara-io/dbt-command-center
Never sift through endless dbtβ’ logs again. dbt Command Center is a free, open-source, local web application that provides a user-friendly interface to monitor and manage dbt runs.
analytics-engineering bigquery data-analysis data-catalog data-engineering data-lineage data-observability data-pipeline data-pipelines data-validation data-warehouse dataops dbt dbt-packages elt etl orchestration python redshift
Last synced: 05 May 2025
https://github.com/tminglei/form-binder-java
Java port of form-binder, a micro data binding and validating framework
Last synced: 04 May 2025
https://github.com/elliotgao2/xdata
Data validator for the zen of python
data-validation datatype marshalling python serialization validation validator xdata
Last synced: 01 Aug 2025
https://github.com/daveoncode/pyvaru
Rule based data validation library for python 3.
data data-validation form-validation model validation validator
Last synced: 06 May 2025
https://github.com/tminglei/form-binder
A micro data binding and validating framework, very easy to use and hack
Last synced: 29 Oct 2025
https://github.com/hendurhance/laracaptcha
A Laravel package to seamlessly use hCapthca or reCaptcha v2 or v3 on your forms or RESTful APIs
captcha data-validation form-validation google-recaptcha google-recaptcha-v2 google-recaptcha-v3 hcaptcha hcaptcha-api laravel laravel-captcha laravel-package laravel10 laravel8 laravel9 php php8 recaptcha recaptcha-api recaptcha-v2 recaptcha-v3
Last synced: 27 Oct 2025
https://github.com/romelperez/yrel
JavaScript JSON schema validation with TypeScript type inference.
data-validation schema-validation type-inference typescript
Last synced: 17 Mar 2025
https://github.com/seadonggyun4/truthound
"Sniffs out bad data"
anomaly-detection data-quality data-validation polars schema-inference statistical-drift-detection
Last synced: 27 Jan 2026
https://github.com/data-catering/data-caterer
Data generation and validation tool for any data source
data-generation data-quality data-test data-testing data-validation java scala testing-automation ui yaml
Last synced: 02 Sep 2025
https://github.com/sigma-andex/purescript-morello
Cherry-picking π for your data
data-transformation data-validation purescript
Last synced: 16 Jan 2026
https://github.com/seanluis/rest-data-validator
REST Data Validator is a versatile library designed to offer comprehensive validation for data in RESTful APIs. It supports a wide range of data types, validation rules, and is designed with extensibility in mind, making it ideal for ensuring data integrity and compliance with API specifications.
api assets data-validation decorators framework hacktoberfest hacktoberfest-2024 hacktoberfest-accepted input javascript node rest rest-api sanitize schema typescript validate validation-library validations validator
Last synced: 27 Feb 2026
https://github.com/inbo/pywhip
Python package to validate data against whip specifications
data-validation lifewatch oscibio python
Last synced: 02 May 2025
https://github.com/nshkrdotcom/sinter
Unified schema definition, validation, and JSON generation for Elixir
beam data-modeling data-validation elixir erlang-vm functional-programming json-generation json-schema nshkr-schema otp schema schema-validation type-checking type-system validation
Last synced: 21 Feb 2026
https://github.com/venkat-0706/number-guessing-game---python
Python number guessing game: Computer picks a random number, user tries to guess it. Computer gives hints (too high/low). Repeat until correct. Add difficulty levels, limited tries, or scoring for more challenge.
algorithm-design basic-python-project conditional-statements data-validation error-handling game-development logic-programming loops python-programming random-number-generator user-input-handling
Last synced: 13 Apr 2025
https://github.com/eredotpkfr/golidators
β¨ Data validators for Golang
data-validation data-validator go golang golang-package golidators valid validate validation validations validator validators
Last synced: 02 Mar 2026
https://github.com/ravage84/cakephp-multi-column-uniqueness
A CakePHP 2.x behavior plugin to validate the uniqueness of multiple columns of a model
cakephp cakephp-plugin data-validation uniqueness
Last synced: 11 Jul 2025
https://github.com/msgflux/msgspec-ext
High-performance settings management and validation library extending msgspec
data-validation settings-management
Last synced: 06 Mar 2026
https://github.com/inbo/whip
β Human and machine-readable syntax to express specifications for data
data-validation oscibio specification
Last synced: 13 Oct 2025
https://github.com/degree9/covenant
Access Control and Data Validation for Clojure(Script) written in Clojure Spec.
abac access-control acl clojure clojurescript covenant data-validation rbac
Last synced: 11 Aug 2025
https://github.com/HzaCode/ChemInformant
βοΈ An all-in-one solution for chemical property retrieval from PubChem.
api-client batch-processing caching cas cheminformatics chemistry cli compound-search data-validation dataframe drug-discovery iupac molecular-descriptors molecular-weight pandas pubchem python rest-api smiles sql
Last synced: 20 Nov 2025
https://github.com/openalloc/swiftdetailer
A multi-platform SwiftUI component for editing fielded data
data-editor data-validation details-view swift-lang swift-language swift-library swiftui swiftui-components swiftui-library
Last synced: 09 Apr 2025
https://github.com/nadeesha/structlm
Token-efficient schema definition for getting structured output from LLMs.
agents data-validation llm typescript
Last synced: 10 Oct 2025
https://github.com/nhsdigital/data-validation-engine
Data Validation Engine source code
data-validation duckdb pyspark python3
Last synced: 30 Oct 2025
https://github.com/algosup/2024-2025-project-3-quickest-path-team-5
π Quickest Path REST API | A high-performance C++ REST API for calculating the quickest path between landmarks in the United States. Supports JSON/XML responses, efficient pathfinding using graph algorithms, and robust data validation. Ideal for route planning applications!
cplusplus data-validation graph-algorithms high-performance json pathfinding rest-api server shortest-path xml
Last synced: 17 Mar 2026
https://github.com/discretetom/rdiabv
Remote Data Integrity Auditing with Bidirectional Verification.
cloud cloud-storage-provider data-validation
Last synced: 20 Aug 2025
https://github.com/azukds/input_checker
A python package to check that data in pandas DataFrame conforms to certain conditions.
Last synced: 10 Apr 2025
https://github.com/sno2/jason
A composable validation library for TypeScript and JavaScript with amazing type support.
data-validation deno deno-module node typescript
Last synced: 14 Apr 2025
https://github.com/tracebloc/data-ingestors
tracebloc data pipeline for training/test dataset setup
data-ingestion data-pipeline data-preparation data-preprocessing-and-cleaning data-validation tracebloc
Last synced: 17 Mar 2026
https://github.com/dealfonso/typedobject
TypeObject is a PHP class to ease the usage of objects coming from a JSON definition. The library parses JSON objects into PHP objects, taking care of the types of the attributes and/or the conversion to the target types
data-validation json json-parsing json-validation php php-json php-library
Last synced: 16 Mar 2026
https://github.com/gkshi/nuxt-models
JS model controller for your Nuxt.js project
data-validation entities models nuxt nuxtjs
Last synced: 16 Jun 2025
https://github.com/astrodynamic/retailanalitycs-in-postgresql
Develop a SQL script to create a database with tables, views, roles, and functions. Form personalized offers to increase average check, frequency of visits, and cross-selling.
bd csv data-analysis data-export data-input data-manipulation data-validation database-management functions git margin offers postgresql retail role-permission-management selling sql transaction tsv views
Last synced: 20 Sep 2025
https://github.com/adamo08/orderprocessingsystem
An automated system for processing customer orders from JSON input, validating data, and synchronizing with a database while managing errors and outputs.
automation data-validation database-integration error-handling file-handling java json-processing juint maven mysql order-management project-lombok threads timer-task
Last synced: 19 Apr 2025
https://github.com/swiftylab/validatablekit
Composable data validation API in Swift exposing simple DSL for writing validations.
carthage cocoapods data-validation data-validator dsl swift swift-package-manager
Last synced: 21 Jan 2026
https://github.com/daiangm/joker-validator
Validador de dados JSON para NodeJS
customized data data-validation javascript json json-schema nodejs schema validador validation validator
Last synced: 04 Jul 2025
https://github.com/byteplant/email-validator-net
NodeJS wrapper for the email-validator.net API
byteplant cleaning cleaning-data data-quality data-validation email email-cleaning email-marketing email-validation email-verification javascript node-js node-module typescript validation verification
Last synced: 29 Oct 2025
https://github.com/shariarfaisal/validator
Schema validator in go
data-validation error-handling go-error-handler go-validator input-validation schema-validator validator
Last synced: 11 Mar 2026
https://github.com/stopsopa/validator
JSR-303 Bean Validation for javascript
data-validation data-validator jsr-303 jsr-303-bean jsr-303-bean-validation jsr303 jsr303-bean jsr303-bean-validation validate validate-js validation validation-engine validation-library validations validator validatorjs validators
Last synced: 13 Mar 2025
https://github.com/byteplant/address-validator-net
NodeJS wrapper for the address-validator.net API
address address-autocomplete address-cleaning address-matching address-validation address-verification autocomplete byteplant cleaning cleaning-data data-quality data-validation javascript node-js node-module typescript validation verification wrapper
Last synced: 16 Jul 2025
https://github.com/cdcgov/cancer-report-validator
The Cancer Report Validator (CRV) is an interactive tool for validating the content of electronic submissions of cancer-related medical information prior to a system's communication with a public health central cancer registry.
cancer cancer-registry cancer-reporting cda clinical-document-architecture data-validation hl7 informatics
Last synced: 17 Feb 2026
https://github.com/jbris/jackson-data-models-example
Demonstration of Jackson data models
annotation annotations data-model data-model-tools data-modeling data-modelling data-models data-validation iris-dataset jackson jackson-annotation jackson-core jackson-databind jackson-json jackson-json-processor jackson-module json-schema json-schema-definitions json-schema-generator metadata
Last synced: 26 Feb 2026
https://github.com/dnth/fastdup-manage-clean-curate-blogpost
Find duplicate and anomalies in your dataset. Identify wrong/confusing labels in your dataset. Uncover data leak in your dataset.
anomaly-detection computer-vision data-science data-validation duplicate-detection python
Last synced: 27 Mar 2025
https://github.com/hetman-app/hetman-pipeline
Hetman Pipeline is a flexible, developer-centric validation engine.
data-validation python validation
Last synced: 21 Feb 2026
https://github.com/mrmcmullan/flycatcher
Define your schema once & for all β built for DataFrames, powered across Pydantic, Polars, and SQLAlchemy.
data-engineering data-validation dataframe etl orm polars pydantic python python3 schema sqlalchemy type-checking validation
Last synced: 05 Mar 2026
https://github.com/leadcodedev/vine
Vine is a robust, typed validation library for Dart/Flutter, designed to simplify and secure data validation in applications.
dartlang data-validation vinejs
Last synced: 07 Jul 2025
https://github.com/vathymut/dsos
Dataset shift with outlier scores
data-drift data-validation dataset-shifts drift-detection machine-learning mlops model-monitoring model-validation performance-monitoring r statistical-process-control statistical-tests
Last synced: 30 Oct 2025
https://github.com/byteplant/jquery-address-validator-net
jQuery plugin for the address-validator.net API
address address-autocomplete address-validation data-cleaning data-quality data-validation form-validation form-validation-jquery javascript javascript-library jquery validation
Last synced: 25 Oct 2025
https://github.com/wkornewald/schematic
Python data validation library with a very simple API
conversion data-validation form python python-3-6 schema validation
Last synced: 27 Mar 2025
https://github.com/byteplant/phone-validator-net
NodeJS wrapper for the phone-validator.net API
byteplant cleaning cleaning-data data-quality data-validation javascript node-js node-module phone phone-marketing phone-number phone-number-verification phone-validation phonenumber typescript validation
Last synced: 02 Jul 2025
https://github.com/jacyuan1/input-validation-and-data-sanitization-project
A program that checks and removes unwanted input ensuring that data conforms to security-related requirements.
cybersecurity data-sanitization data-validation secure-software-development
Last synced: 24 Mar 2025
https://github.com/codesvault/validator
Data validation library
data-validation data-validator validator
Last synced: 14 Apr 2025
https://github.com/xou816/verifikit
Declarative decodable object validation
data-validation declarative property-wrappers swift5 validation
Last synced: 24 Mar 2025
https://github.com/danielbayley/schemas
A collection of useful @JSON-schema-org schemas for data validation.
ajv config configuration data data-science data-structures data-validation json json-schema linter linting schema schema-org validation yaml yaml-configuration
Last synced: 13 Oct 2025
https://github.com/milind-kulshrestha/sentri
Production-ready data quality validation framework with configurable checks and multiple connectors
data-engineering data-pipeline data-quality data-validation monitoring monitoring-automation monitoring-tool python quality-assurance
Last synced: 13 Jan 2026