Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/great-expectations/great_expectations
Always know what to expect from your data.
https://github.com/great-expectations/great_expectations
cleandata data-engineering data-profilers data-profiling data-quality data-science data-unit-tests datacleaner datacleaning dataquality dataunittest eda exploratory-analysis exploratory-data-analysis exploratorydataanalysis mlops pipeline pipeline-debt pipeline-testing pipeline-tests
Last synced: 6 days ago
JSON representation
Always know what to expect from your data.
- Host: GitHub
- URL: https://github.com/great-expectations/great_expectations
- Owner: great-expectations
- License: apache-2.0
- Created: 2017-09-11T00:18:46.000Z (over 7 years ago)
- Default Branch: develop
- Last Pushed: 2024-12-31T15:50:55.000Z (12 days ago)
- Last Synced: 2025-01-02T04:51:08.999Z (10 days ago)
- Topics: cleandata, data-engineering, data-profilers, data-profiling, data-quality, data-science, data-unit-tests, datacleaner, datacleaning, dataquality, dataunittest, eda, exploratory-analysis, exploratory-data-analysis, exploratorydataanalysis, mlops, pipeline, pipeline-debt, pipeline-testing, pipeline-tests
- Language: Python
- Homepage: https://docs.greatexpectations.io/
- Size: 215 MB
- Stars: 10,078
- Watchers: 85
- Forks: 1,549
- Open Issues: 73
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING_CODE.md
- License: LICENSE
- Citation: CITATION.cff
- Codeowners: CODEOWNERS
Awesome Lists containing this project
- best-of-python - GitHub - 11% open ยท โฑ๏ธ 06.06.2024): (Data Pipelines & Streaming)
- awesome-llmops - Great Expectations - expectations/great_expectations.svg?style=flat-square) | (Security / Observability)
- awesome-python-machine-learning - Great Expectations - Great Expectations is a framework that helps teams save time and promote analytic integrity with a new twist on automated testing: pipeline tests. (Uncategorized / Uncategorized)
- awesome-data-quality - great-expectations - tool for data testing, documentation, and profiling. (Table of Contents / Frameworks and Libraries)
- awesome-starred - great-expectations/great_expectations - Always know what to expect from your data. (data-science)
- awesome-list - Great Expectations - Helps data teams eliminate pipeline debt, through data testing, documentation, and profiling. (Data Management & Processing / Database & Cloud Management)
- StarryDivineSky - great-expectations/great_expectations
- awesome-iceberg - Great Expectations - Data validation framework. (๐ Additional Sections / 3. Open-source Projects)
- awesome-lakehouse - Great Expectations - Data validation framework. (๐ Additional Sections / 3. Open-source Projects)
- awesome-lakehouse - Great Expectations - Data validation framework. (๐ Additional Sections / 3. Open-source Projects)
- awesome-iceberg - Great Expectations - Data validation framework. (๐ Additional Sections / 3. Open-source Projects)
- awesome-python-machine-learning-resources - GitHub - 12% open ยท โฑ๏ธ 26.08.2022): (ๆฐๆฎ็ฎก้ๅๆตๅค็)
- project-awesome - great-expectations/great_expectations - Always know what to expect from your data. (Python)
- jimsghstars - great-expectations/great_expectations - Always know what to expect from your data. (Python)
README
[![Python Versions](https://img.shields.io/pypi/pyversions/great_expectations.svg)](https://pypi.python.org/pypi/great_expectations)
[![PyPI](https://img.shields.io/pypi/v/great_expectations)](https://pypi.org/project/great-expectations/#history)
[![PyPI Downloads](https://img.shields.io/pypi/dm/great-expectations)](https://pypistats.org/packages/great-expectations)
[![Build Status](https://img.shields.io/azure-devops/build/great-expectations/bedaf2c2-4c4a-4b37-87b0-3877190e71f5/1)](https://dev.azure.com/great-expectations/great_expectations/_build/latest?definitionId=1&branchName=develop)
[![pre-commit.ci Status](https://results.pre-commit.ci/badge/github/great-expectations/great_expectations/develop.svg)](https://results.pre-commit.ci/latest/github/great-expectations/great_expectations/develop)
[![codecov](https://codecov.io/gh/great-expectations/great_expectations/graph/badge.svg?token=rbHxgTxYTs)](https://codecov.io/gh/great-expectations/great_expectations)
[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.5683574.svg)](https://doi.org/10.5281/zenodo.5683574)
[![Twitter Follow](https://img.shields.io/twitter/follow/expectgreatdata?style=social)](https://twitter.com/expectgreatdata)
[![Slack Status](https://img.shields.io/badge/slack-join_chat-white.svg?logo=slack&style=social)](https://greatexpectations.io/slack)
[![Contributors](https://img.shields.io/github/contributors/great-expectations/great_expectations)](https://github.com/great-expectations/great_expectations/graphs/contributors)
[![Ruff](https://img.shields.io/endpoint?url=https://raw.githubusercontent.com/charliermarsh/ruff/main/assets/badge/v1.json)](https://github.com/charliermarsh/ruff)## About GX Core
GX Core is the engine of the GX platform. It combines the collective wisdom of thousands of community members with a proven track record in data quality deployments worldwide, wrapped into a super-simple package for data teams.
Its powerful technical tools start with Expectations: expressive and extensible unit tests for your data. Expectations foster collaboration by giving teams a common language to express data quality tests in an intuitive way. You can automatically generate documentation for each set of validation results, making it easy for everyone to stay on the same page. This not only simplifies your data quality processes, but helps preserve your organizationโs institutional knowledge about its data.
Learn more about how data teams are using GX Core in our featured [case studies](https://greatexpectations.io/case-studies/).
## Integration support policy
GX Core supports Python `3.9` through `3.12`.
Experimental support for Python `3.13` and later can be enabled by setting a `GX_PYTHON_EXPERIMENTAL` environment variable when installing `great_expectations`.For data sources and other integrations that GX supports, see [GX integration support policy](https://docs.greatexpectations.io/docs/application_integration_support) for additional information.
## Get started
GX recommends deploying GX Core within a virtual environment. For more information about getting started with GX Core, see [Introduction to GX Core](https://docs.greatexpectations.io/docs/core/introduction/).
1. Run the following command in an empty base directory inside a Python virtual environment to install GX Core:
```bash title="Terminal input"
pip install great_expectations
```
2. Run the following command to import the `great_expectations module` and create a Data Context:```python
import great_expectations as gxcontext = gx.get_context()
```## Get support from GX and the community
They are listed in the order in which GX is prioritizing the support issues:
1. Issues and PRs in the [GX GitHub repository](https://github.com/great-expectations)
2. Questions posted to the [GX Core Discourse forum](https://discourse.greatexpectations.io/c/oss-support/11)
3. Questions posted to the [GX Slack community channel](https://greatexpectationstalk.slack.com/archives/CUTCNHN82)## Contribute
We deeply value the contributions of our community. We're now accepting PRs for bug fixes.To ensure the long-term quality of the GX Core codebase, we're not yet ready to accept feature contributions to the parts of the codebase that don't have clear APIs for extensions. We're actively working to increase the surface area for contributions. Thank you being a crucial part of GX's data quality platform!
### Levels of contribution readiness
๐ข Ready. Have a clear and public API for extensions.๐ก Partially ready. Case-by-case.
๐ด Not ready. Will accept contributions that fix existing bugs or workflows.
| GX Component | Readiness | Notes |
| -------------------- | ------------------ | ----- |
| CredentialStore | ๐ข Ready | |
| BatchDefinition | ๐ก Partially ready | Formerly known as splitters |
| Action | ๐ข Ready | |
| DataSource | ๐ด Not ready | Includes MetricProvider and ExecutionEngine |
| DataContext | ๐ด Not ready | Also known as Configuration Stores |
| DataAsset | ๐ด Not ready | |
| Expectation | ๐ด Not ready | |
| ValidationDefinition | ๐ด Not ready | |
| Checkpoint | ๐ด Not ready | |
| CustomExpectations | ๐ด Not ready | |
| Data Docs | ๐ด Not ready | Also known as Renderers |## Code of conduct
Everyone interacting in GX Core project codebases, Discourse forums, Slack channels, and email communications is expected to adhere to the [GX Community Code of Conduct](https://discourse.greatexpectations.io/t/gx-community-code-of-conduct/1199).