https://github.com/MarcelRobeer/ContrastiveExplanation

Contrastive Explanation (Foil Trees), developed at TNO/Utrecht University
https://github.com/MarcelRobeer/ContrastiveExplanation

contrastive-explanations explainable-ai foil interpretability machine-learning model-agnostic

Last synced: 11 months ago
JSON representation

Contrastive Explanation (Foil Trees), developed at TNO/Utrecht University

Host: GitHub
URL: https://github.com/MarcelRobeer/ContrastiveExplanation
Owner: MarcelRobeer
License: bsd-3-clause
Created: 2018-07-09T13:44:54.000Z (over 7 years ago)
Default Branch: master
Last Pushed: 2023-01-31T12:03:11.000Z (about 3 years ago)
Last Synced: 2024-04-22T12:32:52.308Z (almost 2 years ago)
Topics: contrastive-explanations, explainable-ai, foil, interpretability, machine-learning, model-agnostic
Language: Python
Homepage:
Size: 133 KB
Stars: 44
Watchers: 4
Forks: 6
Open Issues: 2
Metadata Files:
- Readme: README.md
- License: LICENSE
- Citation: CITATION.cff

Awesome Lists containing this project

awesome-production-machine-learning - ContrastiveExplanation (Foil Trees) - Python script for model agnostic contrastive/counterfactual explanations for machine learning. Accompanying code for the paper ["Contrastive Explanations with Local Foil Trees"](https://arxiv.org/abs/1806.07470). (Explaining Black Box Models and Datasets)
awesome-production-machine-learning - ContrastiveExplanation (Foil Trees) - Python script for model agnostic contrastive/counterfactual explanations for machine learning. Accompanying code for the paper ["Contrastive Explanations with Local Foil Trees"](https://arxiv.org/abs/1806.07470). (Explaining Black Box Models and Datasets)
awesome-python-data-science - Contrastive Explanation - Contrastive Explanation (Foil Trees). <img height="20" src="img/sklearn_big.png" alt="sklearn"> (Model Explanation / NLP)
awesome-machine-learning-interpretability - ContrastiveExplanation - Foil Trees - ![](https://img.shields.io/github/stars/MarcelRobeer/ContrastiveExplanation?style=social) | "provides an explanation for why an instance had the current outcome (fact) rather than a targeted outcome of interest (foil). These counterfactual explanations limit the explanation to the features relevant in distinguishing fact from foil, thereby disregarding irrelevant features.” | (Technical Resources / Open Source/Access Responsible AI Software Packages)
awesome-python-data-science - Contrastive Explanation - Contrastive Explanation (Foil Trees). <img height="20" src="img/sklearn_big.png" alt="sklearn"> (Model Explanation / Others)
awesome-counterfactual-explanations - https://github.com/MarcelRobeer/ContrastiveExplanation - learn, ![](https://img.shields.io/github/stars/MarcelRobeer/ContrastiveExplanation?style=social) (Github Repos)
awesome-utrecht-university - ContrastiveExplanation - Contrastive Explanation (Foil Trees), developed at TNO/Utrecht University (Projects / Research code)

README

          :warning: **WARNING**: This version of Foil Trees is no longer actively maintained. For a re-implementation see [explabox.readthedocs.io](https://explabox.readthedocs.io/en/latest/api/explabox.explain.text.html\#explabox.explain.text.Explainer.explain_prediction).

---

# Contrastive Explanation (Foil Trees)

> **Contrastive and counterfactual explanations for machine learning (ML)**

> 

> Marcel Robeer (2018-2020), *TNO/Utrecht University*

![Travis (.org)](https://img.shields.io/travis/MarcelRobeer/ContrastiveExplanation?style=flat-square) ![License](https://img.shields.io/github/license/marcelrobeer/ContrastiveExplanation?style=flat-square) ![Python version](https://img.shields.io/badge/python-3.6%20%7C%203.7%20%7C%203.8%20%7C%203.9-blue?style=flat-square)

#### Contents

1. [Introduction](#introduction)

2. [Publications: citing this package](#publications)

3. [Example usage](#example)

4. [Documentation: choices for problem explanation](#documentation)

5. [License](#license)



## Introduction 

Contrastive Explanation provides an explanation for why an instance had the current outcome (*fact*) rather than a targeted outcome of interest (*foil*). These *counterfactual* explanations limit the explanation to the features relevant in distinguishing fact from foil, thereby disregarding irrelevant features. The idea of contrastive explanations is captured in this Python package `ContrastiveExplanation`. Example facts and foils are:

Machine Learning (ML) type | Problem | Explainable AI (XAI) question | Fact | Foil

---|---|---|---|---

Classification | Determine type of animal | *Why is this instance a cat rather than a dog?* | Cat | Dog

Regression analysis | Predict students' grade | *Why is the predicted grade for this student 6.5 rather than higher?* | 6.5 | More than 6.5 

Clustering | Find similar flowers | *Why is this flower in cluster 1 rather than cluster 4?* | Cluster 1 | Cluster 4

## Publications 

One scientific paper was published on Contrastive Explanation / Foil Trees:

* J. van der Waa, M. Robeer, J. van Diggelen, M. Brinkhuis, and M. Neerincx, "Contrastive Explanations with Local Foil Trees", in _2018 Workshop on Human Interpretability in Machine Learning (WHI 2018)_, 2018, pp. 41-47. \[Online\]. Available: [http://arxiv.org/abs/1806.07470](http://arxiv.org/abs/1806.07470)

It was developed as part of a Master's Thesis at Utrecht University / TNO:

*  M. Robeer, "Contrastive Explanation for Machine Learning", MSc Thesis, Utrecht University, 2018. \[Online\]. Available: [https://dspace.library.uu.nl/handle/1874/368081](https://dspace.library.uu.nl/handle/1874/368081)

#### Citing this package

```

@inproceedings{vanderwaa2018,

  title={{Contrastive Explanations with Local Foil Trees}},

  author={van der Waa, Jasper and Robeer, Marcel and van Diggelen, Jurriaan and Brinkhuis, Matthieu and Neerincx, Mark},

  booktitle={2018 Workshop on Human Interpretability in Machine Learning (WHI)},

  year={2018}

}

```



## Example usage  

As a simple example, let us explain a Random Forest classifier that determine the type of flower in the well-known *Iris flower classification* problem. The data set comprises 150 instances, each one of three types of flowers (setosa, versicolor and virginica). For each instance, the data set includes four features (sepal length, sepal width, petal length, petal width) and the goal is to determine which type of flower (class) each instance is.

#### Steps

First, train the 'black-box' model to explain

```python

from sklearn import datasets, model_selection, ensemble

seed = 1

# Train black-box model on Iris data

data = datasets.load_iris()

train, test, y_train, y_test = model_selection.train_test_split(data.data, 

                                                                data.target, 

                                                                train_size=0.80, 

                                                                random_state=seed)

model = ensemble.RandomForestClassifier(random_state=seed)

model.fit(train, y_train)

```

Next, perform contrastive explanation on the first test instance (`test[0]`) by wrapping the tabular data in a `DomainMapper`, and then using method `ContrastiveExplanation.explain_instance_domain()`

```python

# Contrastive explanation

import contrastive_explanation as ce

dm = ce.domain_mappers.DomainMapperTabular(train, 

                                           feature_names=data.feature_names,

					   contrast_names=data.target_names)

exp = ce.ContrastiveExplanation(dm, verbose=True)

sample = test[0]

exp.explain_instance_domain(model.predict_proba, sample)

```

[OUT] *"The model predicted 'setosa' instead of 'versicolor' because 'sepal length (cm) <= 6.517 and petal width (cm) <= 0.868'"*

The predicted class using the `RandomForestClassifier` was 'setosa', while the second most probable class 'versicolor' may have been expected instead. The difference of why the current instance was classified 'setosa' is because its sepal length is less than or equal to 6.517 centimers and its petal width is less than or equal to 0.868 centimers. In other words, if the instance would keep all feature values the same, but change its sepal width to more than 6.517 centimers and its petal width to more than 0.868 centimers, the black-box classifier would have changed the outcome to 'versicolor'.

#### More examples

For more examples, check out the attached [Notebook](https://nbviewer.jupyter.org/github/MarcelRobeer/ContrastiveExplanation/blob/master/Contrastive%20explanation%20-%20example%20usage.ipynb).



## Documentation

Several choices can be made to tailor the explanation to your type of explanation problem.

### Choices for problem explanation 

##### FactFoil

Used for determining the current outcome (fact) and the outcome of interest (foil), based on a `foil_method` (e.g. second most probable class, random class, greater than the current outcome). Foils can also be manually selected by using the `foil=...` optional argument of the `ContrastiveExplanation.explain_instance_domain()` method.

FactFoil | Description | foil_method

---------|-------------|---

`FactFoilClassification` (*default*) | Determine fact and foil for classification/unsupervised learning | `second`, `random`

`FactFoilRegression` | Determine fact and foil for regression analysis | `greater`, `smaller`

##### Explanators

Method for forming the explanation, either using a Foil Tree (`TreeExplanator`) as described in the [paper](http://arxiv.org/abs/1806.07470), or using a prototype (`PointExplanator`, not fully implemented). As multiple explanations hold, one can choose the `foil_strategy` as either 'closest' (shortest explanation), 'size' (move the current outcome to the area containing most samples of the foil outcome), 'impurity' (most informative foil area), or 'random' (random foil area)

Explanator | Description | foil_strategy

-----------|-------------|---

`TreeExplanator` (*default*) | __Foil Tree__: Explain using a decision tree  | `closest`, `size`, `impurity`, `random`

`PointExplanator` | Explain with a representatitive point (prototype) of the foil class | `closest`, `medoid`, `random`

##### Domain Mappers

For handling the different types of data:

- Tabular (rows and columns)

- Images (rudimentary support)

Maps to a general format that the explanator can form the explanation in, and then maps the explanation back into this format. Ensures meaningful feature names.

DomainMapper | Description

-------------|-------------

`DomainMapperTabular` | Tabular data (columns with feature names, rows)

`DomainMapperPandas` | Uses a `pandas` dataframe to create a `DomainMapperTabular`, while automatically inferring feature names

`DomainMapperImage` | Image data



## License

ContrastiveExplanation is [BSD-3 Licensed](https://github.com/MarcelRobeer/ContrastiveExplanation/blob/master/LICENSE).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/MarcelRobeer/ContrastiveExplanation

Awesome Lists containing this project

README