https://github.com/saattrupdan/doubt

Bringing back uncertainty to machine learning.
https://github.com/saattrupdan/doubt

bootstrap confidence-intervals machine-learning prediction-intervals quantile-regression quantile-regression-forests uncertainty

Last synced: 7 months ago
JSON representation

Bringing back uncertainty to machine learning.

Host: GitHub
URL: https://github.com/saattrupdan/doubt
Owner: saattrupdan
License: mit
Created: 2020-03-29T16:31:15.000Z (over 5 years ago)
Default Branch: main
Last Pushed: 2024-06-10T08:31:42.000Z (about 1 year ago)
Last Synced: 2024-12-04T12:11:43.712Z (7 months ago)
Topics: bootstrap, confidence-intervals, machine-learning, prediction-intervals, quantile-regression, quantile-regression-forests, uncertainty
Language: Python
Homepage:
Size: 2.28 MB
Stars: 50
Watchers: 3
Forks: 3
Open Issues: 1
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE

Awesome Lists containing this project

README

        # Doubt

*Bringing back uncertainty to machine learning.*

______________________________________________________________________

[![PyPI Status](https://badge.fury.io/py/doubt.svg)](https://pypi.org/project/doubt/)

[![Documentation](https://img.shields.io/badge/docs-passing-green)](https://saattrupdan.github.io/doubt/doubt.html)

[![License](https://img.shields.io/github/license/saattrupdan/doubt)](https://github.com/saattrupdan/doubt/blob/main/LICENSE)

[![LastCommit](https://img.shields.io/github/last-commit/saattrupdan/doubt)](https://github.com/saattrupdan/doubt/commits/main)

[![Code Coverage](https://img.shields.io/badge/Coverage-66%25-yellow.svg)](https://github.com/saattrupdan/doubt/tree/dev/tests)

[![Conference](https://img.shields.io/badge/Conference-AAAI23-blue)](https://doi.org/10.1609/aaai.v37i12.26755)

A Python package to include prediction intervals in the predictions of machine

learning models, to quantify their uncertainty.

## Installation

You can install `doubt` with `pip`:

```shell

pip install doubt

```

If you want to be able to use the preprocessed regression datasets as well, you install

it with the `datasets` extra:

```shell

pip install doubt[datasets]

```

## Features

- Bootstrap wrapper for all Scikit-Learn models

    - Can also be used to calculate usual bootstrapped statistics of a dataset

- Quantile Regression for all generalised linear models

- Quantile Regression Forests

- A uniform dataset API, with 24 regression datasets and counting

## Quick Start

If you already have a model in Scikit-Learn, then you can simply

wrap it in a `Boot` to enable predicting with prediction intervals:

```python

>>> from sklearn.linear_model import LinearRegression

>>> from doubt import Boot

>>> from doubt.datasets import PowerPlant

>>>

>>> X, y = PowerPlant().split()

>>> clf = Boot(LinearRegression())

>>> clf = clf.fit(X, y)

>>> clf.predict([10, 30, 1000, 50], uncertainty=0.05)

(481.9203102126274, array([473.43314309, 490.0313962 ]))

```

Alternatively, you can use one of the standalone models with uncertainty

outputs. For instance, a `QuantileRegressionForest`:

```python

>>> from doubt import QuantileRegressionForest as QRF

>>> from doubt.datasets import Concrete

>>> import numpy as np

>>>

>>> X, y = Concrete().split()

>>> clf = QRF(max_leaf_nodes=8)

>>> clf.fit(X, y)

>>> clf.predict(np.ones(8), uncertainty=0.25)

(16.933590347847982, array([ 8.93456428, 26.0664534 ]))

```

## Citation

```

@inproceedings{mougannielsen2023monitoring,

  title={Monitoring Model Deterioration with Explainable Uncertainty Estimation via Non-parametric Bootstrap},

  author={Mougan, Carlos and Nielsen, Dan Saattrup},

  booktitle={AAAI Conference on Artificial Intelligence},

  year={2023}

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/saattrupdan/doubt

Awesome Lists containing this project

README