https://github.com/maximiliancw/completely
Measure your data completeness
https://github.com/maximiliancw/completely
data data-cleaning data-quality data-science missing-data
Last synced: 7 months ago
JSON representation
Measure your data completeness
- Host: GitHub
- URL: https://github.com/maximiliancw/completely
- Owner: maximiliancw
- License: mit
- Created: 2020-12-30T13:42:35.000Z (about 5 years ago)
- Default Branch: master
- Last Pushed: 2024-03-25T10:12:26.000Z (almost 2 years ago)
- Last Synced: 2025-05-28T07:32:12.953Z (8 months ago)
- Topics: data, data-cleaning, data-quality, data-science, missing-data
- Language: Python
- Homepage:
- Size: 12.7 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# completely
*High-quality* data is extremely important nowadays. But before you start cleaning/processing it,
you might want to check how **complete** the dataset really is:
```python
from completely import measure
data = [{'name': 'Bob', 'age': 42}, {'name': 'Alice', 'age': None}, {'name': '', 'age': 100}]
print(measure(data))
# Output: 0.667
```
**completely** currently works with:
- strings / ints / floats
- lists
- sets
- dicts
- Nested objects of one of the above
## Setup
```bash
pip install completely
```
## Contributing
We're happy about every meaningful contribution to this project via pull requests. If needed, we'll setup more precise guidelines on how to contribute at some point.