https://github.com/ModelOriented/survex

Explainable Machine Learning in Survival Analysis
https://github.com/ModelOriented/survex

biostatistics brier-scores censored-data cox-model cox-regression explainable-ai explainable-machine-learning explainable-ml explanatory-model-analysis interpretable-machine-learning interpretable-ml machine-learning probabilistic-machine-learning r r-package shap survival-analysis time-to-event variable-importance xai

Last synced: 9 months ago
JSON representation

Explainable Machine Learning in Survival Analysis

Host: GitHub
URL: https://github.com/ModelOriented/survex
Owner: ModelOriented
License: gpl-3.0
Created: 2022-08-31T08:07:03.000Z (over 3 years ago)
Default Branch: main
Last Pushed: 2024-06-15T13:15:47.000Z (over 1 year ago)
Last Synced: 2025-04-14T10:42:55.610Z (10 months ago)
Topics: biostatistics, brier-scores, censored-data, cox-model, cox-regression, explainable-ai, explainable-machine-learning, explainable-ml, explanatory-model-analysis, interpretable-machine-learning, interpretable-ml, machine-learning, probabilistic-machine-learning, r, r-package, shap, survival-analysis, time-to-event, variable-importance, xai
Language: R
Homepage: https://modeloriented.github.io/survex
Size: 309 MB
Stars: 111
Watchers: 6
Forks: 10
Open Issues: 8
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

AwesomeResponsibleAI - survex

README

          # survex: Explainable Machine Learning in Survival Analysis 

[![R-CMD-check](https://github.com/ModelOriented/survex/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/ModelOriented/survex/actions/workflows/R-CMD-check.yaml)

[![Codecov test coverage](https://codecov.io/gh/ModelOriented/survex/branch/main/graph/badge.svg)](https://app.codecov.io/gh/ModelOriented/survex?branch=main)

[![CRAN status](https://www.r-pkg.org/badges/version/survex)](https://cran.r-project.org/package=survex)

[![Total downloads](https://cranlogs.r-pkg.org/badges/grand-total/survex?color=orange)](https://cranlogs.r-pkg.org/badges/grand-total/survex)

[![DrWhy-BackBone](https://img.shields.io/badge/DrWhy-BackBone-373589)](http://drwhy.ai/#BackBone)

## Overview 

Survival analysis is a task dealing with time-to-event prediction. Aside from the well-understood models like CPH, many more complex models have recently emerged, but most lack interpretability. Due to a functional type of prediction, either in the form of a survival function or a cumulative hazard function, standard model-agnostic explanations cannot be applied directly.

The `survex` package provides model-agnostic explanations for machine learning survival models. It is based on the [`DALEX` package](https://github.com/ModelOriented/DALEX). If you're unfamiliar with explainable machine learning, consider referring to the [Explanatory Model Analysis](https://ema.drwhy.ai) book -- most of the methods included in `survex` extend these described in EMA and implemented in `DALEX` but to models with functional output. 

The main `explain()` function uses a model and data to create a standardized `explainer` object, which is further used as an interface for calculating predictions. We automate creating explainers from the following packages: `mlr3proba`, `censored`, `ranger`, `randomForestSRC`, and `survival`. **Raise an Issue on GitHub if you find models from other packages that we can incorporate into the `explain()` interface.**

Note that an explainer can be created for **any** survival model, using the `explain_survival()` function by passing `model`, `data`, `y`, and `predict_survival_function` arguments.

## Installation

The package is available on [CRAN](https://cran.r-project.org/package=survex):

```r

install.packages("survex")

```

The latest development version can be installed from GitHub using `devtools::install_github()`:

```r

devtools::install_github("https://github.com/ModelOriented/survex")

```

## Simple demo

```r

library("survex")

library("survival")

library("ranger")

# create a model

model <- ranger(Surv(time, status) ~ ., data = veteran)

# create an explainer

explainer <- explain(model, 

                     data = veteran[, -c(3, 4)],

                     y = Surv(veteran$time, veteran$status))

# evaluate the model

model_performance(explainer)

# visualize permutation-based feature importance

plot(model_parts(explainer))

# explain one prediction with SurvSHAP(t)

plot(predict_parts(explainer, veteran[1, -c(3, 4)]))

```

## Functionalities and roadmap

Existing functionalities:

- [x] unified prediction interface using the explainer object - `predict()`

- [x] calculation of performance metrics (Brier Score, Time-dependent C/D AUC, metrics from `mlr3proba`) - `model_performance()`

- [x] calculation of feature importance (Permutation Feature Importance - PFI) - `model_parts()`

- [x] calculation of partial dependence (Partial Dependence Profiles - PDP, Accumulated Local Effects - ALE) - `model_profile()`

- [x] calculation of 2-dimensional partial dependence (2D PDP, 2D ALE) - `model_profile_2d()`

- [x] calculation of local feature attributions (SurvSHAP(t), SurvLIME) - `predict_parts()`

- [x] calculation of local ceteris paribus explanations (Ceteris Paribus profiles - CP/ Individual Conditional Expectations - ICE) - `predict_profile()`

- [x] calculation of global feature attributions using SurvSHAP(t) - `model_survshap()`

Currently in develompment:

- [ ] ...

Future plans:

- [ ] ... (raise an Issue on GitHub if you have any suggestions)

- [ ] examples for sursvm and survboost (https://github.com/ModelOriented/survex/issues/88)

## Usage

[![`survex` usage cheatsheet](man/figures/cheatsheet.png)](https://github.com/ModelOriented/survex/blob/main/misc/cheatsheet.pdf)

## Citation

If you use `survex`, please cite [our article](https://doi.org/10.1093/bioinformatics/btad723):

> M. Spytek, M. Krzyziński, S. H. Langbein, H. Baniecki, M. N. Wright, P. Biecek. *survex: an R package for explaining machine learning survival models*. **Bioinformatics**, Volume 39, Issue 12, btad723, 2023.

```

@article{spytek2023survex,

    author  = {Mikołaj Spytek and Mateusz Krzyziński and Sophie Hanna Langbein and 

               Hubert Baniecki and Marvin N Wright and Przemysław Biecek},

    title   = {survex: an {R} package for explaining machine learning survival models},

    journal = {Bioinformatics},

    volume  = {39},

    number  = {12},

    pages   = {btad723},

    year    = {2023},

    month   = {12},

    doi     = {10.1093/bioinformatics/btad723}

}

```

## Applications of `survex`

- H. Baniecki, B. Sobieski, P. Bombiński, P. Szatkowski, P. Biecek. [Hospital Length of Stay Prediction Based on Multi-modal Data towards Trustworthy Human-AI Collaboration in Radiomics](https://arxiv.org/abs/2303.09817). *International Conference on Artificial Intelligence in Medicine*, 2023.

- W. Chen, B. Zhou, C. Y. Jeon, F. Xie, Y.-C. Lin, R. K. Butler, Y. Zhou, T. Q. Luong, E. Lustigova, J. R. Pisegna, B. U. Wu. [Machine learning versus regression for prediction of sporadic pancreatic cancer](https://doi.org/10.1016/j.pan.2023.04.009). *Pancreatology*, 2023.

- M. Nachit, Y. Horsmans, R. M. Summers, I. A. Leclercq, P. J. Pickhardt. [AI-based CT Body Composition Identifies Myosteatosis as Key Mortality Predictor in Asymptomatic Adults](https://doi.org/10.1148/radiol.222008). *Radiology*, 2023.

- R. Passera, S. Zompi, J. Gill, A. Busca. [Explainable Machine Learning (XAI) for Survival in Bone Marrow Transplantation Trials: A Technical Report](https://doi.org/10.3390/biomedinformatics3030048). *BioMedInformatics*, 2023.

- P. Donizy, M. Spytek, M. Krzyziński, K. Kotowski, A. Markiewicz, B. Romanowska-Dixon, P. Biecek, M. P. Hoang. [Ki67 is a better marker than PRAME in risk stratification of BAP1-positive and BAP1-loss uveal melanomas](http://dx.doi.org/10.1136/bjo-2023-323816). *British Journal of Ophthalmology*, 2023.

- X. Qi, Y. Ge, A. Yang, Y. Liu, Q. Wang, G. Wu. [Potential value of mitochondrial regulatory pathways in the clinical application of clear cell renal cell carcinoma: a machine learning-based study](https://doi.org/10.1007/s00432-023-05393-8). *Journal of Cancer Research and Clinical Oncology*, 2023.

- C. C. Lee, S. Y. Su, S. F. Sung. [Machine learning-based survival analysis approaches for predicting the risk of pneumonia post-stroke discharge](https://doi.org/10.1016/j.ijmedinf.2024.105422). *International Journal of Medical Informatics*, 2024.

- P. Wang, X. Qian, W. Jiang, H. Wang, Y. Wang, Y. Zhou, Y. Zhang, Y. Huang, X. Zhai. [Cord Blood Transplantation for Very Early‑Onset Inflammatory Bowel Disease Caused by Interleukin‑10 Receptor Deficiency](https://doi.org/10.1007/s10875-024-01669-x). *Journal of Clinical Immunology*, 2024.

- E. Ruiz, J. Honles, R. Fernández, K. Uribe, J. P. Cerapio, K. Cancino, J. Contreras-Mancilla, S. Casavilca-Zambrano, F. Berrospi, P. Pineau, S. Bertani. [A preoperative risk score based on early recurrence for estimating outcomes after resection of hepatocellular carcinoma in the non-cirrhotic liver](https://doi.org/10.1016/j.hpb.2024.02.010). *HPB*, 2024.

- Share it with us!

## Related work

- H. Ishwaran, U. B. Kogalur, E. H. Blackstone, M. S. Lauer. [Random survival forests](https://doi.org/10.1214/08-AOAS169). *Annals of Applied Statistics*, 2008.

- A. Grudziąż, A. Gosiewska, P. Biecek. [survxai: an R package for structure-agnostic explanations of survival models](https://doi.org/10.21105/joss.00961). *Journal of Open Source Software*, 2018.

- M. S. Kovalev, L. V. Utkin, E. M. Kasimov. [SurvLIME: A method for explaining machine learning survival models](https://doi.org/10.1016/j.knosys.2020.106164). *Knowledge-Based Systems*, 2020.

- R. Sonabend, F. J. Király, A. Bender, B. Bischl, M. Lang. [mlr3proba: an R package for machine learning in survival analysis](https://doi.org/10.1093/bioinformatics/btab039). *Bioinformatics*, 2021.

- E. Hvitfeldt, H. Frick. [censored: 'parsnip' Engines for Survival Models](https://github.com/tidymodels/censored). *CRAN v0.1.0*, 2022.

- M. Krzyziński, M. Spytek, H. Baniecki, P. Biecek. [SurvSHAP(t): Time-dependent explanations of machine learning survival models](https://doi.org/10.1016/j.knosys.2022.110234). *Knowledge-Based Systems*, 2023.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ModelOriented/survex

Awesome Lists containing this project

README