Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/awslabs/deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
https://github.com/awslabs/deequ
dataquality scala spark unit-testing
Last synced: about 2 months ago
JSON representation
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
- Host: GitHub
- URL: https://github.com/awslabs/deequ
- Owner: awslabs
- License: apache-2.0
- Created: 2018-08-07T20:55:14.000Z (about 6 years ago)
- Default Branch: master
- Last Pushed: 2024-07-31T17:35:11.000Z (about 2 months ago)
- Last Synced: 2024-07-31T21:39:39.582Z (about 2 months ago)
- Topics: dataquality, scala, spark, unit-testing
- Language: Scala
- Homepage:
- Size: 69.4 MB
- Stars: 3,198
- Watchers: 81
- Forks: 524
- Open Issues: 149
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
- awesome-dataops - Deequ - A library built on top of Apache Spark for measuring data quality in large datasets. (Data Quality)
- awesome-production-machine-learning - Deequ - A library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets. (Industry-strength AD)
- jimsghstars - awslabs/deequ - Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets. (Scala)