Projects in Awesome Lists by cleanlab
A curated list of projects in awesome lists by cleanlab .
https://github.com/cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
active-learning annotation data-centric-ai data-cleaning data-curation data-labeling data-profiling data-quality data-science data-validation dataops dataquality datasets exploratory-data-analysis labeling llms noisy-labels out-of-distribution-detection outlier-detection weak-supervision
Last synced: 12 May 2025
https://github.com/cleanlab/cleanvision
Automatically find issues in image datasets and practice data-centric computer vision.
computer-vision data-centric-ai data-exploration data-profiling data-quality data-science data-validation deep-learning exploratory-data-analysis image-analysis image-classification image-generation image-quality image-segmentation
Last synced: 09 Apr 2025
https://github.com/cleanlab/label-errors
🛠️ Corrected Test Sets for ImageNet, MNIST, CIFAR, Caltech-256, QuickDraw, IMDB, Amazon Reviews, 20News, and AudioSet
benchmarking datasets label-errors machine-learning
Last synced: 26 Mar 2025
https://github.com/cleanlab/examples
Notebooks demonstrating example applications of the cleanlab library
Last synced: 09 Apr 2025
https://github.com/cleanlab/multiannotator-benchmarks
Benchmarking algorithms for assessing quality of data labeled by multiple annotators
Last synced: 13 Apr 2025
https://github.com/cleanlab/cleanlab-studio
Client interface to Cleanlab Studio and the Trustworthy Language Model
annotations automl computer-vision data-centric-ai data-cleaning data-curation data-labeling data-profiling data-quality data-science data-validation image-classification llm machine-learning model-deployment natural-language-processing noisy-labels outlier-detection structured-data text-classification
Last synced: 13 Apr 2025
https://github.com/cleanlab/cleanvision-examples
Notebooks demonstrating example applications of the cleanvision library
Last synced: 13 Apr 2025
https://github.com/cleanlab/cleanlab-tools
Cookbooks showcasing various applications of Cleanlab
Last synced: 13 Apr 2025
https://github.com/cleanlab/cleanlab-tlm
Python client library for Cleanlab Trustworthy Language Model
Last synced: 13 Apr 2025
https://github.com/cleanlab/vizzy
Cleanlab Vizzy: illustrating the core ideas behind the Cleanlab algorithm
Last synced: 26 Mar 2025
https://github.com/cleanlab/cleanlab-codex
Python client library to integrate Cleanlab Codex into RAG applications
Last synced: 13 Apr 2025
https://github.com/cleanlab/ood-detection-benchmarks
Evaluation of algorithms to detect out-of-distribution data
Last synced: 13 Apr 2025
https://github.com/cleanlab/aws-marketplace
Documentation and Example Notebooks for using AWS Marketplace solutions from Cleanlab
Last synced: 13 Apr 2025
https://github.com/cleanlab/cleanlab-studio-tutorials
Automated repo - do not touch
Last synced: 13 Apr 2025
https://github.com/cleanlab/multilabel-error-detection-benchmarks
Benchmarking label error detection algorithms for multi-label classification
Last synced: 26 Mar 2025
https://github.com/cleanlab/token-label-error-benchmarks
Benchmarking methods for label error detection in token classification tasks
Last synced: 26 Mar 2025
https://github.com/cleanlab/assets
Binary assets (e.g. README images) factored out into a separate repository
Last synced: 26 Mar 2025
https://github.com/cleanlab/datasets
Repo where small data files can be downloaded
Last synced: 26 Mar 2025
https://github.com/cleanlab/sandbox-cleanlab-studio
Sandbox repo for cleanlab studio.
Last synced: 26 Mar 2025
https://github.com/cleanlab/stash
Miscellaneous code made available for purposes of education, reproducibility, and transparency
Last synced: 26 Mar 2025
https://github.com/cleanlab/regression-label-error-benchmark
Benchmark algorithms to detect erroneous label values in regression datasets
Last synced: 26 Mar 2025
https://github.com/cleanlab/cleanlab-frontend-scaffolding
Starter code for the senior frontend technical challenge.
Last synced: 26 Mar 2025
https://github.com/cleanlab/control-plane
Repository for Cleanlab control plane services (billing, telemetry, user management)
Last synced: 26 Mar 2025