https://github.com/utkukose/gemex

GEMEX is a novel, model-agnostic Explainable AI (XAI) library grounded in Riemannian information geometry and differential geometry. It treats a trained model as defining a statistical manifold equipped with the Fisher Information Metric and derives all explanations from the intrinsic geometry of that manifold.
https://github.com/utkukose/gemex
differential-geometry explainable-ai explainable-artificial-intelligence explainable-machine-learning explainable-ml fisher-information-metric gemex manifold model-agnostic model-agnostic-explanations riemannian-geometry riemannian-information-geometry xai
Last synced: about 2 months ago
JSON representation
Host: GitHub
URL: https://github.com/utkukose/gemex
Owner: utkukose
License: other
Created: 2026-04-03T15:00:53.000Z (about 2 months ago)
Default Branch: main
Last Pushed: 2026-04-04T15:41:04.000Z (about 2 months ago)
Last Synced: 2026-04-04T18:31:00.267Z (about 2 months ago)
Topics: differential-geometry, explainable-ai, explainable-artificial-intelligence, explainable-machine-learning, explainable-ml, fisher-information-metric, gemex, manifold, model-agnostic, model-agnostic-explanations, riemannian-geometry, riemannian-information-geometry, xai
Language: Python
Homepage:
Size: 8.19 MB
Stars: 0
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Citation: CITATION.cff
Awesome Lists containing this project

README

          




# GEMEX

### Geodesic Entropic Manifold Explainability

*The first XAI library grounded in Riemannian information geometry*

[![Python 3.8+](https://img.shields.io/badge/python-3.8%2B-blue.svg)](https://python.org)

[![License: MIT](https://img.shields.io/badge/license-MIT-green.svg)](LICENSE)

[![Version](https://img.shields.io/badge/version-1.2.1-orange.svg)](pyproject.toml)

[![ORCID](https://img.shields.io/badge/ORCID-0000--0002--9652--6415-a6ce39.svg)](https://orcid.org/0000-0002-9652-6415)

[![Submitted to: IEEE HORA 2026](https://img.shields.io/badge/IEEE-HORA%202026-blue.svg)](#publication)

**Prof. Dr. Utku Kose**

Suleyman Demirel University, Turkey  ·  University of North Dakota, USA

 ·  VelTech University, India  ·  Universidad Panamericana, Mexico

[utkukose@gmail.com](mailto:utkukose@gmail.com)  ·  [www.utkukose.com](http://www.utkukose.com)



---

**GEMEX** explains *any* machine learning model — tabular data, time series, or

images — by measuring the **true curved geometry** of its prediction surface,

rather than approximating it as flat.

```bash

pip install gemex

```

---

## Quick start

```python

from gemex import Explainer, GemexConfig

cfg = GemexConfig(n_geodesic_steps=20, n_reference_samples=60,

                  interaction_order=2)

exp = Explainer(model, data_type='tabular',

                feature_names=feature_names,

                class_names=['No','Yes'], config=cfg)

result = exp.explain(x_instance, X_reference=X_train)

print(result.summary())

print(result.top_features(5))

print(result.top_interactions(3))

print(f"Ricci: {result.manifold_curvature:.4f}  FIM: {result.fim_quality}")

# All 13 visualisation types — dark and light themes

result.plot("gsf_bar",             theme="dark")

result.plot("force",               theme="dark")

result.plot("waterfall",           theme="dark")

result.plot("heatmap",             theme="dark")

result.plot("beeswarm",            theme="dark", batch_results=batch)

result.plot("network",             theme="dark")

result.plot("curvature",           theme="dark")

result.plot("attention_heatmap",   theme="dark")

result.plot("attention_dwell",     theme="dark")

result.plot("attention_vs_effect", theme="dark")

result.plot("bias",                theme="dark")

result.plot("image_trio",          theme="dark")  # image data only

result.plot("triplet_hypergraph",  theme="dark")  # interaction_order=3

```

---

## All 13 visualisations — what they show and how to read them

### 1 · GSF Attribution Bar Chart (`gsf_bar`)







**What it shows:** Each bar is a signed **Geodesic Sensitivity Field (GSF)** score

— how strongly each feature pushes the model's prediction toward or against the

predicted class, measured by integrating directional sensitivity along the curved

geodesic path on the statistical manifold.

**How to read it:**

- 🟢 **Green bar** → feature *supports* the predicted class. Longer = stronger.

- 🔴 **Red/orange bar** → feature *opposes* the predicted class.

- **Error bars** → curvature-weighted geometric uncertainty. Width reflects how

  much the manifold curves in that feature's direction. Wide bars mean the model's

  surface is geometrically complex near this feature — a confidence indicator

  *exclusive to GEMEX* (unavailable in SHAP or LIME).

- Features are **sorted by absolute importance** (most important at bottom).

- Unlike SHAP values, GSF scores are **reparametrisation-invariant**: rescaling a

  feature (e.g. cm → metres) does not change its attribution.

---

### 2 · Force Plot (`force`)







**What it shows:** A waterfall-style push/pull visualisation showing how the

prediction probability is built up from a baseline value through the contributions

of each feature.

**How to read it:**

- The plot reads left-to-right from baseline probability to final prediction.

- 🟢 **Green segments** push the probability upward (toward the predicted class).

- 🔴 **Red segments** pull it downward.

- Feature order follows actual attribution magnitude — the largest movers appear first.

- The final bar position shows the predicted probability.

- Useful for explaining a single prediction to a non-technical stakeholder: "your

  prediction starts at the average of 0.5 and these factors moved it to 0.73."

---

### 3 · Waterfall Chart (`waterfall`)

**What it shows:** Cumulative attribution from baseline to prediction, shown as

a stacked bar where each feature's contribution is added sequentially.

**How to read it:**

- Start at the left (baseline f(x₀)) and read right to the final prediction f(x).

- Each step shows the marginal contribution of one feature along the geodesic.

- Unlike the force plot, waterfall shows the exact numerical accumulation at each step.

- Useful for verifying completeness: the cumulative sum should approach f(x) − f(x₀).

---

### 4 · Feature × Instance Heatmap (`heatmap`)

**What it shows:** A 2D heatmap where each row is one instance and each column

is one feature. Cell colour encodes the signed GSF attribution.

**How to read it:**

- **Bright warm colours** → strong positive attribution (feature supports prediction).

- **Cool/dark colours** → negative or near-zero attribution.

- **Consistent bright columns** → features the model reliably uses across instances.

- **Variable columns** → feature influence is instance-specific.

- Most useful with 10+ instances to reveal global patterns and outliers.

---

### 5 · Beeswarm Distribution (`beeswarm`)







**What it shows:** A global view of attribution distributions across a batch of

instances — the GEMEX equivalent of SHAP's beeswarm summary plot.

**How to read it:**

- Each dot is one instance. Dots are spread vertically to avoid overlap.

- **Horizontal position** → GSF attribution value (positive right, negative left).

- **Dot colour** → feature value (warm = high, cool = low) using the original scale.

- Wide horizontal spread = feature has variable importance across instances.

- Narrow cluster near zero = feature rarely matters regardless of its value.

- Requires `batch_results=batch` argument. Run `exp.explain_batch(X_test[:50])` first.

---

### 6 · Feature Interaction Network (`network`)







**What it shows:** A graph of pairwise feature interactions measured via

**Parallel Transport Interaction (PTI)** — the holonomy angle accumulated when

parallel-transporting a feature's attribution vector around a closed loop on the

statistical manifold.

**How to read it:**

- **Node size** → absolute GSF importance. Larger = more important.

- 🟢 **Green node** → feature supports the prediction.

- 🔴 **Red node** → feature opposes the prediction.

- **Edge thickness** → strength of the nonlinear interaction. Thick = strong

  co-dependency that no additive method (SHAP, LIME) can capture.

- 🟡 **Yellow edge** → amplifying interaction (features reinforce each other).

- 🟣 **Purple edge** → suppressing interaction (features partially cancel).

- **Dashed ring** → Bias Trap risk (geodesic over-attends this feature).

- **"HUB" label** → high geometric interaction with many other features.

---

### 7 · Geodesic Arc-Length Profile (`curvature`)







**What it shows:** Two panels: (left) how the geodesic path length evolves from

baseline to instance for multiple instances; (right) the distribution of Ricci

scalars across a batch.

**How to read it:**

- The **dashed diagonal** = perfectly flat manifold (what SHAP and Integrated

  Gradients assume). Deviations reveal genuine curvature.

- **Deviation above diagonal** → model surface curves upward; high sensitivity region.

- **Sharp steps** → decision boundary crossings (visible in GBM/Random Forest).

  GEMEX follows the actual path over these; SHAP walks straight through them.

- **Right panel (Ricci histogram)** → higher = more curved decision boundary.

  High Ricci = prediction in a geometrically complex region requiring careful interpretation.

  *This metric does not exist in SHAP, LIME, or GradCAM.*

---

### 8 · Feature Attention Heatmap (`attention_heatmap`)







**What it shows:** The **Feature Attention Sequence (FAS)** — how long the

geodesic path "dwells" near each feature's axis during the journey from baseline

to the explained instance. A form of geometric attention map.

**How to read it:**

- **Bright yellow cell** → geodesic spends a long time moving in this feature's

  direction — the model is geometrically "attentive" to it along the path.

- **Dark cell** → resolved quickly; feature's influence is concentrated and local.

- **Rows** = instances; **columns** = features.

- Stable bright columns (same across rows) = features the model consistently uses.

- Variable columns = instance-specific dependencies.

- Unusually bright cells that do not correspond to high GSF scores may indicate

  Bias Trap risk (see plot 11).

---

### 9 · Attention Dwell Bar Chart (`attention_dwell`)

**What it shows:** Per-feature geodesic dwell time as a simple bar chart for one

instance — a more readable version of one row from the heatmap.

**How to read it:**

- Taller bar = geodesic spends more time near this feature axis.

- Compare to the GSF bar chart: features with high dwell but low GSF are potential

  Bias Trap candidates — the model "thinks about" them a lot but doesn't assign them

  much final importance.

- Useful for understanding the model's internal geometric reasoning process.

---

### 10 · Attention vs Effect Scatter (`attention_vs_effect`)

**What it shows:** A scatter plot of geodesic dwell time (x-axis) vs absolute

GSF attribution (y-axis) for all features in one instance.

**How to read it:**

- **Top-right quadrant** → high dwell, high effect: features the model both

  attends to and acts on. These are the most geometrically important features.

- **Top-left quadrant** → high effect, low dwell: the model resolves these features

  quickly but strongly. Sharp, local decision boundaries.

- **Bottom-right quadrant** → high dwell, low effect: **potential Bias Trap**.

  The model spends geometric attention here but produces little final output.

- **Bottom-left quadrant** → low dwell, low effect: unimportant features.

---

### 11 · Bias Trap Detection (`bias`)







**What it shows:** Features where the model has developed **geometric bias** —

the geodesic path spends disproportionately long near a feature's axis relative

to its actual effect on the prediction. A novel GEMEX-exclusive diagnostic.

**How to read it:**

- **High risk bar** → dwell time is disproportionate to effect. The model may be

  over-relying on this feature in a geometrically unstable way.

  *Flag for review — especially if the feature is a sensitive attribute (age, sex, ethnicity).*

- **Low risk** → dwell and effect are proportionate. Attribution is geometrically stable.

- This plot has no equivalent in any other XAI library. The concept draws from the

  Riemannian geometry of the model's statistical manifold and cannot be computed

  from SHAP values, LIME coefficients, or gradient maps.

---

### 12 · GeodesicCAM / ManifoldSeg / PerturbFlow (`image_trio`)

**What it shows:** Three-panel image explanation for `data_type='image'`:

- **Panel 1 (Original)** — the input image.

- **Panel 2 (GeodesicCAM)** — GSF attribution upsampled to pixel space. Bright

  regions = model most sensitive here along the geodesic path. *Model-agnostic

  equivalent of GradCAM — no CNN access required.*

- **Panel 3 (ManifoldSeg)** — iso-information regions: areas where the model's

  prediction sensitivity is approximately equal. Similar to a level-set of the

  probability function on the manifold.

- **Panel 4 (PerturbFlow)** — geodesic gradient field: arrows show how pixel

  changes would propagate on the manifold surface.

**How to read it:** Bright GeodesicCAM regions are the most discriminative for

the prediction. ManifoldSeg boundaries separate regions of qualitatively different

model sensitivity. PerturbFlow arrows show the local direction of maximum

prediction change.

---

### 13 · Triplet Hypergraph (`triplet_hypergraph`)

**What it shows:** Three-way feature interactions computed via the

**Riemannian Curvature Triplet (RCT)** — a visualisation of the Riemann curvature

tensor projected onto feature triplets. Requires `interaction_order=3`.

**How to read it:**

- Each triangle represents a feature triplet (i, j → k).

- **Gold triangle** → synergistic triplet (positive curvature): the three features

  together produce more than their pairwise sum.

- **Purple triangle** → antagonistic triplet (negative curvature): features partially

  cancel each other's three-way contribution.

- Triangle opacity encodes magnitude.

- Node size encodes absolute GSF importance.

- *GEMEX is the only XAI library that computes three-way feature interactions.*

---

## Validated comparative results

> **These results are to appear in a peer-reviewed study**. Currently submitted to IEEE HORA 2026.

> Three datasets × three model families (GBM, MLP, DeepMLP) × 5 random seeds.

### Key finding 1 — Stability: GEMEX is 6–8× more consistent than SHAP

Across all three datasets and all model families, GEMEX produces the most

consistent explanations for similar inputs (Lipschitz stability metric, lower is

better):

| Dataset | GEMEX | SHAP | Ratio |

|---|---|---|---|

| Pima Diabetes (GBM) | **0.145** | 0.891 | 6.1× |

| Heart Disease (GBM) | **0.139** | 1.064 | 7.7× |

| Breast Cancer (GBM) | **0.304** | 2.397 | 7.9× |

In clinical settings this matters directly — a clinician comparing two similar

patients should not receive contradictory attribution outputs from the same method.

### Key finding 2 — Monotonicity improves with model smoothness

GEMEX monotonicity (directional accuracy of attributions) improves substantially

on smooth neural network models, reaching 0.538 (MLP) and 0.587 (DeepMLP) on

Heart Disease vs 0.125 on GBM. This reflects the geodesic integrator's superior

performance on smooth probability surfaces where the path does not cross

piecewise-constant boundaries.

### Key finding 3 — Ricci scalar tracks model geometry

The Ricci scalar decreases as model depth increases on Heart Disease:

GBM 0.563 → MLP 0.355 → DeepMLP 0.310. This reflects the well-known

tendency of deeper networks toward flatter probability landscapes, and GEMEX

is the only XAI tool that captures this as a quantitative geometric signal.

### Understanding the evaluation metrics

| Metric | ↑/↓ | What it measures | Reference |

|---|---|---|---|

| **Faithfulness** | ↑ higher | Spearman correlation between attribution rank and prediction drop when features removed in that order | Alvarez-Melis & Jaakkola (2018, NeurIPS)¹ |

| **Monotonicity** | ↑ higher | Fraction of features where attribution sign matches the direction of prediction change when that feature is perturbed | Luss et al. (2019, KDD)² |

| **Completeness error** | ↓ lower | |Σ attributions − (f(x) − f(baseline))| — how far attributions deviate from the actual prediction change | Sundararajan et al. (2017, ICML)³ |

| **Stability** | ↓ lower | Lipschitz ratio: attribution distance / input distance, averaged over random pairs — lower = more consistent explanations for similar inputs | Alvarez-Melis & Jaakkola (2018, NeurIPS)¹ |

| **Ricci scalar** | — | Intrinsic curvature of the model's statistical manifold. Higher = more curved decision boundary. **GEMEX-exclusive — no equivalent in other methods.** | Amari & Nagaoka (2000)⁴ |

*¹ Alvarez-Melis, D. & Jaakkola, T.S. (2018). Towards Robust Interpretability with Self-Explaining Neural Networks. NeurIPS 31.*

*² Luss, R., Chen, P.Y., Dhurandhar, A., et al. (2019). Generating Contrastive Explanations with Monotonic Attribute Functions. arXiv:1905.12698. Published KDD 2021.*

*³ Sundararajan, M., Taly, A. & Yan, Q. (2017). Axiomatic Attribution for Deep Networks. ICML 2017, PMLR 70:3319–3328.*

*⁴ Amari, S. & Nagaoka, H. (2000). Methods of Information Geometry. AMS.*

### Stability showcase







### Ricci scalar — exclusive to GEMEX







### Complete metric comparison







---

## Model-agnostic image XAI

GEMEX treats CNN image classifiers as genuine black boxes. It requires only

a probability output function — no layer hooks, no gradient access, no

architectural knowledge. The examples below cover greyscale X-ray,

greyscale CT organs, and **3-channel RGB** blood cell microscopy.

### Chest X-ray — PneumoniaMNIST







GEMEX (GeodesicCAM) identifies bilateral lower lobe infiltrates in this pneumonia

case using only `predict_proba()`. GradCAM requires full internal CNN access.

Deletion AUC: GEMEX=0.835, GradCAM=0.939, GradCAM++=0.708.

### Blood cell microscopy — BloodMNIST (RGB, 8 classes)







BloodMNIST is a **3-channel RGB dataset** (blood cell microscopy, 8 cell types).

For the Basophil class, GEMEX highlights the cell body as the dominant region

(Ricci=0.545, FIM=good) — matching GradCAM and GradCAM++ — using only

output probabilities, with no CNN layer access.

### Abdominal CT organs — OrganAMNIST (11 classes)







GEMEX achieves mean Deletion AUC of 0.335 across 11 organ classes, outperforming

GradCAM (0.291) and Saliency (0.234). Mean Ricci scalar: 0.637±0.100 (all FIM

quality ratings: good).

> **Important distinction:** GradCAM treats CNNs as white boxes — it hooks into

> convolutional layer activations and backpropagates gradients. It cannot explain

> proprietary APIs, non-CNN models, or any model where internal layers are

> inaccessible. GEMEX explains the same CNN as a genuine black box.

## What no other XAI method provides

| Capability | GEMEX | SHAP | LIME | GradCAM |

|---|:---:|:---:|:---:|:---:|

| Reparametrisation-invariant attribution | ✓ | ✗ | ✗ | ✗ |

| Holonomy pairwise interactions (PTI) | ✓ | ✗ | ✗ | ✗ |

| Three-way Riemann curvature (RCT) | ✓ | ✗ | ✗ | ✗ |

| Geometric uncertainty per feature | ✓ | ✗ | ✗ | ✗ |

| Manifold curvature (Ricci scalar) | ✓ | ✗ | ✗ | ✗ |

| Bias Trap Detection (BTD) | ✓ | ✗ | ✗ | ✗ |

| Feature Attention Sequence (FAS) | ✓ | ✗ | ✗ | ✗ |

| Black-box image XAI (greyscale + RGB) | ✓ | ✗ | partial | ✗ |

| Tabular + time series + image | ✓ | partial | partial | ✗ |

---

## Supported data types

```python

# Tabular (medical, financial, scientific)

exp = Explainer(model, data_type='tabular', ...)

# Time series (ECG, HAR, sensor signals)

exp = Explainer(model, data_type='timeseries', ...)

# Image — patch-based, genuine black box (no architecture access)

cfg = GemexConfig(image_patch_size=4)   # 28×28 → 7×7=49 patches, ~6× faster

exp = Explainer(model, data_type='image', ..., config=cfg)

```

---

## Installation

```bash

pip install gemex                  # core

pip install gemex[torch]           # + PyTorch CNN support

pip install medmnist               # MedMNIST medical imaging

pip install gemex[full]            # all backends

```

---

## Examples

| # | Script | Data type | Datasets |

|---|---|---|---|

| 01 | `01_tabular_heart_diabetes.py` | Tabular | Cleveland Heart Disease, Pima Diabetes (data sets in csv files, also downloadable)|

| 02 | `02_comparative_study.py` | Tabular | Heart + Diabetes + Breast Cancer vs SHAP/LIME/ELI5 (GBM, MLP, DeepMLP) |

| 03 | `03_timeseries_ecg.py` | Time series | ECG5000 (real data included in ECG5000/ folder) |

| 04 | `04_ablation_study.py` | Tabular | Per-component ablation with Wilcoxon tests |

| 05 | `05_statistical_comparison.py` | Tabular | Multi-seed study with bootstrap CI |

| 06 | `06_pneumoniamnist.py` | Image | PneumoniaMNIST — GEMEX vs GradCAM vs GradCAM++ |

| 07 | `07_pathmnist.py` | Image | PathMNIST — Colorectal cancer tissue (9 classes) |

| 08 | `08_dermamnist.py` | Image | DermaMNIST — Skin lesions HAM10000 (7 classes) |

| 09 | `09_organamnist.py` | Image | OrganAMNIST — Abdominal CT organs (11 classes) |

| 10 | `10_bloodmnist.py` | Image | BloodMNIST — Blood cell microscopy (8 classes) |

---

## Configuration reference

| Parameter | Default | Description |

|---|---|---|

| `n_geodesic_steps` | 40 | 4th-order Runge-Kutta (RK4) integration steps along geodesic |

| `n_reference_samples` | 80 | Background distribution sample size |

| `fim_epsilon_auto` | True | Auto-expand step size for tree/GBM models |

| `fim_local_n` | 16 | Neighbourhood perturbation count |

| `interaction_order` | 2 | 1=attribution only · 2=+PTI holonomy · 3=+RCT triplets |

| `image_patch_size` | 1 | 1=pixel · 4=7×7 patches (~6× faster, stronger Ricci) |

| `model_type` | 'auto' | 'auto' · 'tree' · 'smooth' |

| `gsf_normalise` | False | Force sum(GSF) = f(x) − f(baseline) |

---

## Current limitations and design trade-offs

Like any XAI framework, GEMEX involves trade-offs between geometric richness,

computational cost, and axiomatic coverage. The points below are known limitations

in v1.2.1 that are being actively addressed in future versions.

- **Speed vs geometric richness:** Tracing a geodesic requires more model calls

  per instance than straight-line perturbation methods such as SHAP or LIME.

  This is a fundamental trade-off of the manifold approach, not a fixable bug.

  See [speed tips](#speed-tips) for settings that bring the cost down significantly.

  GPU parallelisation of the RK4 integrator is planned for v1.3.0.

- **Tree model FIM quality:** GBM, XGBoost and Random Forest produce flat

  probability regions between split boundaries, making FIM gradient estimation

  harder. GEMEX handles this automatically with adaptive step-size expansion,

  though results are less reliable than on smooth models. Always check

  `result.fim_quality` — a `'poor'` rating signals that Ricci scalar values

  for that instance should be treated with caution. Improved tree-specific FIM

  estimation is on the roadmap.

- **Completeness is a diagnostic, not a constraint:** GEMEX attributions are

  not forced to sum to f(x) − f(baseline). Ablation testing confirmed that

  adding this constraint destabilises the geodesic. Completeness error is best

  treated as a descriptive metric rather than a quality gate. Future versions

  may explore optional soft-completeness modes.

- **High-dimensional Ricci estimation:** Beyond roughly 100 input features,

  FIM neighbourhood sampling can become sparse and Ricci may return zero for

  some instances. Increasing `fim_local_n` and `fim_local_sigma` mitigates

  this in most cases. A more scalable high-dimensional FIM estimator is

  planned for v1.3.0.

---



## Speed tips

GEMEX trades computation for geometric richness. Here are the main levers:

| Goal | Setting | Effect |

|---|---|---|

| **Quick exploration** | `n_geodesic_steps=8, n_reference_samples=20` | ~5× faster, slightly less accurate geodesic |

| **Attribution only** | `interaction_order=1` | Skip PTI/RCT computation, saves ~30% |

| **Image data** | `image_patch_size=4` | 784 pixels → 49 patches, ~6× faster |

| **Tree models** | `model_type='tree'` | Starts at larger epsilon, avoids wasted zero-gradient calls |

| **Production** | `n_geodesic_steps=12, n_reference_samples=30, interaction_order=1` | ~0.25 s/instance |

```python

# Fast settings for exploration

cfg_fast = GemexConfig(

    n_geodesic_steps    = 8,

    n_reference_samples = 20,

    interaction_order   = 1,   # attribution only — no interactions

    verbose             = False

)

# Recommended settings for publication-quality results

cfg_full = GemexConfig(

    n_geodesic_steps    = 20,

    n_reference_samples = 60,

    interaction_order   = 2,   # + PTI pairwise interactions

    fim_local_n         = 16,

    fim_local_sigma     = 0.10,

)

```

**FIM quality check:** Always inspect `result.fim_quality` after explaining.

If it returns `'poor'`, increase `fim_local_n` or `fim_local_sigma`, or switch

to `model_type='tree'` for tree-based models.

```python

result = exp.explain(x, X_reference=X_train)

if result.fim_quality == 'poor':

    print("Warning: FIM poorly estimated. Try increasing fim_local_n.")

    print(f"  Ricci scalar may be unreliable: {result.manifold_curvature:.4f}")

```

**Batch explanation:** Use `explain_batch` rather than looping for efficiency:

```python

# Preferred — single reference computation

batch = exp.explain_batch(X_test[:50], X_reference=X_train)

fig = batch[0].plot('beeswarm', batch_results=batch, theme='dark')

```

---



## Publication

Submitted:

> **Kose, U. (2026). GEMEX: Model-Agnostic XAI via Geodesic Entropic Manifold Analysis.**

> *8th International Congress on Human-Computer Interaction, Optimization and Robotic

> Applications (HORA 2026), May 21–23, 2026. IEEE indexed.*

The peer-reviewed paper presents full experimental validation: 5-seed comparative

study (GBM, MLP, DeepMLP) across three medical datasets, ablation analysis,

ECG5000 time series evaluation, and image XAI comparison with GradCAM and

GradCAM++ on PneumoniaMNIST, OrganAMNIST and BloodMNIST (RGB).

```bibtex

@inproceedings{kose_gemex_2026,

  author    = {Kose, Utku},

  title     = {GEMEX: Model-Agnostic XAI via Geodesic Entropic Manifold Analysis},

  booktitle = {8th International Congress on Human-Computer Interaction,

               Optimization and Robotic Applications (HORA 2026)},

  year      = {2026},

  month     = {May},

  note      = {Submitted. IEEE indexed. ORCID: 0000-0002-9652-6415}

}

```

---

## References

1. Amari, S. & Nagaoka, H. (2000). *Methods of Information Geometry.* American Mathematical Society.

2. Rao, C.R. (1945). Information and accuracy in statistical estimation. *Bulletin of the Calcutta Mathematical Society*, 37, 81–91.

3. Kobayashi, S. & Nomizu, K. (1963). *Foundations of Differential Geometry.* Interscience.

4. **Alvarez-Melis, D. & Jaakkola, T.S. (2018).** Towards Robust Interpretability with Self-Explaining Neural Networks. *Advances in Neural Information Processing Systems 31 (NeurIPS 2018),* 7786–7795.

5. **Luss, R., Chen, P.Y., Dhurandhar, A., Sattigeri, P., Zhang, Y., Shanmugam, K. & Tu, C.C. (2019).** Generating Contrastive Explanations with Monotonic Attribute Functions. arXiv:1905.12698. Published *KDD 2021,* 1139–1149.

6. **Sundararajan, M., Taly, A. & Yan, Q. (2017).** Axiomatic Attribution for Deep Networks. *Proceedings of the 34th ICML,* PMLR 70:3319–3328.

7. Lundberg, S.M. & Lee, S.I. (2017). A unified approach to interpreting model predictions. *NeurIPS 2017,* 4765–4774.

8. Ribeiro, M.T., Singh, S. & Guestrin, C. (2016). "Why should I trust you?". *KDD 2016,* 1135–1144.

9. Yang, J. et al. (2023). MedMNIST v2. *Scientific Data,* 10(1), 41. https://doi.org/10.1038/s41597-022-01721-8

10. Chattopadhay, A. et al. (2018). Grad-CAM++. *WACV 2018,* 839–847. https://doi.org/10.1109/WACV.2018.00097 

---

## License & development note

MIT License · Copyright © 2026 Prof. Dr. Utku Kose

GEMEX was developed through a human-AI collaboration. The theoretical framework,

mathematical foundations, algorithmic design decisions, and research directions

are the original intellectual contribution of Prof. Dr. Utku Kose.

AI-assisted tools were used in implementation and documentation phases.
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/utkukose/gemex

Awesome Lists containing this project

README