Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/renumics/sliceguard
A library for detecting problematic data segments in structured and unstructured data with few lines of code.
https://github.com/renumics/sliceguard
data-analysis data-cleaning data-curation data-exploration data-science data-visualization deep-learning eda exploratory-data-analysis machine-learning python visualization
Last synced: 10 days ago
JSON representation
A library for detecting problematic data segments in structured and unstructured data with few lines of code.
- Host: GitHub
- URL: https://github.com/renumics/sliceguard
- Owner: Renumics
- License: mit
- Created: 2023-06-14T07:57:52.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-01-10T15:46:47.000Z (10 months ago)
- Last Synced: 2024-10-11T09:37:53.602Z (26 days ago)
- Topics: data-analysis, data-cleaning, data-curation, data-exploration, data-science, data-visualization, deep-learning, eda, exploratory-data-analysis, machine-learning, python, visualization
- Language: Python
- Homepage:
- Size: 4.28 MB
- Stars: 62
- Watchers: 5
- Forks: 3
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Roadmap: ROADMAP.md
Awesome Lists containing this project
README
sliceguard
Detect problematic data slices in unstructured and structured data β fast.
## π Introduction
Sliceguard helps you to quickly discover **problematic data segments**. It supports structured data as well as unstructured data like images, text or audio. Sliceguard generates an **interactive report** with just a few lines of code:
```python
from sliceguard import SliceGuardsg = SliceGuard()
issues = sg.find_issues(df, features=["image"])sg.report()
```## β±οΈ Quickstart
Install sliceguard by running `pip install sliceguard[all]`.
Go straight to our quickstart examples for your use case:
* πΌοΈ **[Unstructured Data (Images, Audio, Text)](https://github.com/Renumics/sliceguard/blob/main/examples/quickstart_unstructured_data.ipynb)** **β** **[πΉοΈ Interactive Demo](https://huggingface.co/spaces/renumics/sliceguard-unstructured-data)**
* π **[Structured Data (Numerical, Categorical Variables)](https://github.com/Renumics/sliceguard/blob/main/examples/quickstart_structured_data.ipynb)** **β** **[πΉοΈ Interactive Demo](https://huggingface.co/spaces/renumics/sliceguard-structured-data)**
* π **[Mixed Data (Contains Both)](https://github.com/Renumics/sliceguard/blob/main/examples/quickstart_mixed_data.ipynb)** **β** **[πΉοΈ Interactive Demo](https://huggingface.co/spaces/renumics/sliceguard-mixed-data)**## πΊοΈ Public Roadmap
We maintain a **[public roadmap](https://github.com/Renumics/sliceguard/blob/main/ROADMAP.md)** so you can follow along the development of this library.