An open API service indexing awesome lists of open source software.

https://github.com/statmixedml/xgboostlss

An extension of XGBoost to probabilistic modelling
https://github.com/statmixedml/xgboostlss

distributional-regression gamlss machine-learning mixture-density-model multi-target-regression normalizing-flows prediction-intervals probabilistic-forecasts uncertainty-estimation xgboost

Last synced: about 1 year ago
JSON representation

An extension of XGBoost to probabilistic modelling

Awesome Lists containing this project

README

          

![Python Version](https://img.shields.io/badge/python-3.10%20|%20%203.11-lightblue.svg)
![GitHub tag (with filter)](https://img.shields.io/github/v/tag/StatMixedML/XGBoostLSS?label=release&color=lightblue)
Documentation status badge
Unit test status badge
Code coverage status badge
![Pepy Total Downlods](https://img.shields.io/pepy/dt/xgboostlss?label=PyPI%20Downloads%2FMonth&color=green)

#

# XGBoostLSS - An extension of XGBoost to probabilistic modelling
We introduce a comprehensive framework that models and predicts the full conditional distribution of univariate and multivariate targets as a function of covariates. Choosing from a wide range of continuous, discrete, and mixed discrete-continuous distributions, modelling and predicting the entire conditional distribution greatly enhances the flexibility of XGBoost, as it allows to create probabilistic forecasts from which prediction intervals and quantiles of interest can be derived.

## `Features`
:white_check_mark: Estimation of all distributional parameters.

:white_check_mark: Normalizing Flows allow modelling of complex and multi-modal distributions.

:white_check_mark: Mixture-Densities can model a diverse range of data characteristics.

:white_check_mark: Multi-target regression allows modelling of multivariate responses and their dependencies.

:white_check_mark: Zero-Adjusted and Zero-Inflated Distributions for modelling excess of zeros in the data.

:white_check_mark: Automatic derivation of Gradients and Hessian of all distributional parameters using [PyTorch](https://pytorch.org/docs/stable/autograd.html).

:white_check_mark: Automated hyper-parameter search, including pruning, is done via [Optuna](https://optuna.org/).

:white_check_mark: The output of XGBoostLSS is explained using [SHapley Additive exPlanations](https://github.com/dsgibbons/shap).

:white_check_mark: XGBoostLSS provides full compatibility with all the features and functionality of XGBoost.

:white_check_mark: XGBoostLSS is available in Python.

## `News`
:boom: [2024-01-19] Release of XGBoostLSS to [PyPI](https://pypi.org/project/xgboostlss/).

:boom: [2023-08-25] Release of v0.4.0 introduces Mixture-Densities. See the [release notes](https://github.com/StatMixedML/XGBoostLSS/releases) for an overview.

:boom: [2023-07-19] Release of v0.3.0 introduces Normalizing Flows. See the [release notes](https://github.com/StatMixedML/XGBoostLSS/releases) for an overview.

:boom: [2023-06-22] Release of v0.2.2. See the [release notes](https://github.com/StatMixedML/XGBoostLSS/releases) for an overview.

:boom: [2023-06-21] XGBoostLSS now supports multi-target regression.

:boom: [2023-06-07] XGBoostLSS now supports Zero-Inflated and Zero-Adjusted Distributions.

:boom: [2023-05-26] Release of v0.2.1. See the [release notes](https://github.com/StatMixedML/XGBoostLSS/releases) for an overview.

:boom: [2023-05-18] Release of v0.2.0. See the [release notes](https://github.com/StatMixedML/XGBoostLSS/releases) for an overview.

:boom: [2021-12-22] XGBoostLSS now supports estimating the full predictive distribution via [Expectile Regression](https://epub.ub.uni-muenchen.de/31542/1/1471082x14561155.pdf).

:boom: [2021-12-20] XGBoostLSS is initialized with suitable starting values to improve convergence of estimation.

:boom: [2021-12-04] XGBoostLSS now supports automatic derivation of Gradients and Hessians.

:boom: [2021-12-02] XGBoostLSS now supports pruning during hyperparameter optimization.

:boom: [2021-11-14] XGBoostLSS v0.1.0 is released!

## `Installation`
To install the development version, please use
```python
pip install git+https://github.com/StatMixedML/XGBoostLSS.git
```
For the PyPI version, please use
```python
pip install xgboostlss
```

## `Available Distributions`
Our framework is built upon PyTorch and Pyro, enabling users to harness a diverse set of distributional families. XGBoostLSS currently supports the [following distributions](https://statmixedml.github.io/XGBoostLSS/distributions/).

## `How to Use`
Please visit the [example section](https://statmixedml.github.io/XGBoostLSS/examples/Gaussian_Regression/) for guidance on how to use the framework.

## `Documentation`
For more information and context, please visit the [documentation](https://statmixedml.github.io/XGBoostLSS/).

## `Feedback`
We encourage you to provide feedback on how to enhance XGBoostLSS or request the implementation of additional distributions by opening a [new discussion](https://github.com/StatMixedML/XGBoostLSS/discussions).

## `How to Cite`
If you use XGBoostLSS in your research, please cite it as:

```bibtex
@misc{Maerz2023,
author = {Alexander M\"arz},
title = {{XGBoostLSS: An Extension of XGBoost to Probabilistic Modelling}},
year = {2023},
note = {GitHub repository, Version 0.4.0},
howpublished = {\url{https://github.com/StatMixedML/XGBoostLSS}}
}
```

## `Reference Paper`
[![Arxiv link](https://img.shields.io/badge/arXiv-Multi%20Target%20XGBoostLSS%20Regression-color=brightgreen)](https://arxiv.org/abs/2210.06831)

[![Arxiv link](https://img.shields.io/badge/arXiv-Distributional%20Gradient%20Boosting%20Machines-color=brightgreen)](https://arxiv.org/abs/2204.00778)

[![Arxiv link](https://img.shields.io/badge/arXiv-XGBoostLSS%3A%20An%20extension%20of%20XGBoost%20to%20probabilistic%20forecasting-color=brightgreen)](https://arxiv.org/abs/1907.03178)

## `Star History`