https://github.com/probabl-ai/skore
the scikit-learn sidekick
https://github.com/probabl-ai/skore
data-analysis data-science data-visualization machine-learning python scikit-learn workflow
Last synced: about 1 month ago
JSON representation
the scikit-learn sidekick
- Host: GitHub
- URL: https://github.com/probabl-ai/skore
- Owner: probabl-ai
- License: mit
- Created: 2024-06-17T15:29:38.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-05-12T09:39:33.000Z (9 months ago)
- Last Synced: 2025-05-12T10:03:39.334Z (9 months ago)
- Topics: data-analysis, data-science, data-visualization, machine-learning, python, scikit-learn, workflow
- Language: Python
- Homepage: https://docs.skore.probabl.ai
- Size: 10.5 MB
- Stars: 436
- Watchers: 7
- Forks: 69
- Open Issues: 122
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.rst
- License: LICENSE
- Codeowners: .github/CODEOWNERS
Awesome Lists containing this project
README


[](https://pepy.tech/projects/skore)
[](https://pypi.org/project/skore/)
[](https://discord.probabl.ai/)
Own Your Data Science
Elevate ML Development with Built-in Recommended Practices \
[Documentation](https://docs.skore.probabl.ai) — [Community](https://discord.probabl.ai) — [YouTube](https://youtube.com/playlist?list=PLSIzlWDI17bTpixfFkooxLpbz4DNQcam3) — [Skore Hub](https://probabl.ai/skore)
## 🎯 Why Skore?
When it comes to data science, you have excellent tools at your disposal: `pandas` and `polars` for data exploration, `skrub` for stateful transformations, and `scikit-learn` for model training and evaluation. These libraries are designed to be generic and accommodate a wide range of use cases.
**But here's the challenge**: Your experience is key to choosing the right building blocks and methodologies. You often spend significant time navigating documentation, writing boilerplate code for common evaluations, and struggling to maintain clear project structure.
**Skore is the conductor** that transforms your data science pipeline into structured, meaningful artifacts. It reduces the time you spend on documentation navigation, eliminates boilerplate code, and guides you toward the right methodological information to answer your questions.
### What Skore does for you:
- **Structures your experiments**: Automatically generates the insights that matter for your use case
- **Reduces boilerplate**: One line of code gives you comprehensive model evaluation
- **Guides your decisions**: Built-in methodological warnings help you avoid common pitfalls
- **Maintains clarity**: Structured project organization makes your work easier to understand and maintain
⭐ Support us with a star and spread the word - it means a lot! ⭐
## 🧩 What is Skore?
The core mission of **Skore** is to turn uneven ML development into structured, effective decision-making. It consists of two complementary components:
- **Skore Lib**: the open-source Python library (described here!) that provides the structured artifacts and methodological guidance for your data science experiments.
- **Skore Hub**: the collaborative platform where teams can share, compare, and build upon each other's structured experiments. Learn more on our [product page](https://probabl.ai/skore).
## ⚡️ Quick start
### Installation
#### With pip
We recommend using a [virtual environment (venv)](https://docs.python.org/3/tutorial/venv.html). You need `python>=3.10`.
Then, you can install skore by using `pip`:
```bash
# If you plan to use Skore locally
pip install -U skore
# If you wish to interact with Skore Hub as well
pip install -U skore[hub]
```
#### With conda
skore is available in `conda-forge` both for local and hub use:
```bash
conda install conda-forge::skore
```
You can find information on the latest version [here](https://anaconda.org/conda-forge/skore).
### Get structured insights from your ML pipeline
Evaluate your model and get comprehensive insights in one line:
```python
from sklearn.datasets import make_classification
from sklearn.linear_model import LogisticRegression
from skore import CrossValidationReport
X, y = make_classification(n_classes=2, n_samples=100_000, n_informative=4)
clf = LogisticRegression()
# Get structured insights that matter for your use case
cv_report = CrossValidationReport(clf, X, y)
# See what insights are available
cv_report.help()
# Example: Access the metrics summary
metrics_summary = cv_report.metrics.summarize().frame()
# Example: Get the ROC curve
roc_plot = cv_report.metrics.roc()
roc_plot.plot()
```
Learn more in our [documentation](https://docs.skore.probabl.ai).
## 🛠️ Contributing
Join our mission to promote open-source and make machine learning development more robust and effective. If you'd like to contribute, please check the contributing guidelines [here](https://github.com/probabl-ai/skore/blob/main/CONTRIBUTING.rst).
## 👋 Feedback & Community
- Join our [Discord](https://discord.probabl.ai/) to share ideas or get support.
- Request a feature or report a bug via [GitHub Issues](https://github.com/probabl-ai/skore/issues).
## Support
Skore is tested on Linux and Windows, for at most 4 versions of Python, and at most 4 versions of scikit-learn:
- Python 3.10
- scikit-learn 1.4
- scikit-learn 1.7
- Python 3.11
- scikit-learn 1.4
- scikit-learn 1.7
- Python 3.12
- scikit-learn 1.4
- scikit-learn 1.7
- Python 3.13
- scikit-learn 1.5
- scikit-learn 1.6
- scikit-learn 1.7
---
Brought to you by