https://github.com/zapiano/sashe.jl

A Julia package for sensitivity analysis wih Shapley effects.
https://github.com/zapiano/sashe.jl

sensitivity-analysis shapley-effect shapley-values statistics

Last synced: 7 months ago
JSON representation

A Julia package for sensitivity analysis wih Shapley effects.

Host: GitHub
URL: https://github.com/zapiano/sashe.jl
Owner: Zapiano
License: mit
Created: 2025-06-29T09:14:19.000Z (11 months ago)
Default Branch: main
Last Pushed: 2025-10-20T22:47:13.000Z (7 months ago)
Last Synced: 2025-10-21T11:59:21.837Z (7 months ago)
Topics: sensitivity-analysis, shapley-effect, shapley-values, statistics
Language: Julia
Homepage:
Size: 51.8 KB
Stars: 4
Watchers: 1
Forks: 0
Open Issues: 2
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # SAShE.jl

[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.16777876.svg)](https://doi.org/10.5281/zenodo.16777876)

This package performs a Sensitivity Analysis using Shapley Effects given a model in the form of a function, referred to here as `my_model`, that accepts a vector of factors `X`. The approach implemented here was presented in [1]. If the user is using `Distributed` and has added some procs with `addprocs`, `SAShE.solve` will be run using multiple cores.

The "examples/ishigami.jl" script can be used to compare the result of this implementation with the one from the paper for the Ishigami function.

# Quick start

Assuming a function `my_model` that  accepts a vector of factors `X`

```julia

my_model(X::Vector{Float64}) = X[1] + X[2]^2 + X[3]^3

```

First create two separate sample DataFrames with the same shape:

```julia

using DataFrames

n_factors = 3

n_samples = 1000

X1 = DataFrame(rand(n_samples, n_factors), :auto)

X2 = DataFrame(rand(n_samples, n_factors), :auto)

```

Then create a `SAShE.Problem` instance and solve it:

```julia

using SAShE

sa_problem = SAShE.Problem(my_model, X1, X2)

Φ, Φ², Yₙ = SAShE.solve(sa_problem)

```

The three objects returned are:

- `Φₙ` : The contribution that each sample gives to the each factor's Shapley Effect expected value. The Shapley Effect for each factor, `Φ`, can be calculated by summing all columns of each row of `Φₙ` or simply `SAShE.shapley_effects(Φₙ)`;

- `Φ²ₙ` : The contribution that each sample gives to each Shapley Effect squared expected valued (`E[Φ²]`). This can be used to calculate the [confidence intervals](#shapley-effects-and-confidence-intervals);

- `Yₙ` : The value of the model calculated for each sample on `X1`. Besides being used in the Shapley Effects computation, the variance of `Yₙ` can be compared to the sum of the estimated Shapley Effects `Φ`. If the algorithm has converged, the sum of all Shapley Effects should approach the model's variance.

## Shapley effects and confidence intervals

The function `shapley_effects` returns each factor's Shapley Effect. If used with the second argument `Φ²ₙ`, it also returns the lower and upper bounds of each factor's Shapley Effect confidence interval.

```julia

# Vector of Shapley Effects for each factor

Φ = SAShE.shapley_effects(Φₙ)

# Or with confidence intervals

Φ, Φ_lb , Φ_ub = SAShE.shapley_effects(Φₙ, Φ²ₙ)

```

The confidence intervals and margin of errors can also be accessed directly via:

```julia

# Margin of error

SAShE.margin_of_error(Φₙ, Φ²ₙ)

# Confidence interval

SAShE.confint(Φₙ, Φ²ₙ)

```

# Reference

1. Goda, T. (2021). A simple algorithm for global sensitivity analysis with Shapley effects. Reliability Engineering & System Safety, 213, 107702.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/zapiano/sashe.jl

Awesome Lists containing this project

README