Projects in Awesome Lists tagged with interpretability

https://github.com/shap/shap

A game theoretic approach to explain the output of any machine learning model.

deep-learning explainability gradient-boosting interpretability machine-learning shap shapley

Last synced: 15 Apr 2025

https://github.com/slundberg/shap

A game theoretic approach to explain the output of any machine learning model.

deep-learning explainability gradient-boosting interpretability machine-learning shap shapley

Last synced: 16 Nov 2024

https://github.com/jacobgil/pytorch-grad-cam

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

class-activation-maps computer-vision deep-learning explainable-ai explainable-ml grad-cam image-classification interpretability interpretable-ai interpretable-deep-learning machine-learning object-detection pytorch score-cam vision-transformers visualizations xai

Last synced: 15 Apr 2025

https://github.com/interpretml/interpret

Fit interpretable models. Explain blackbox machine learning.

ai artificial-intelligence bias blackbox differential-privacy explainability explainable-ai explainable-ml gradient-boosting iml interpretability interpretable-ai interpretable-machine-learning interpretable-ml interpretml machine-learning scikit-learn transparency xai

Last synced: 19 Apr 2025

https://github.com/pytorch/captum

Model interpretability and understanding for PyTorch

feature-attribution feature-importance interpretability interpretable-ai interpretable-ml

Last synced: 15 Apr 2025

https://github.com/tensorflow/lucid

A collection of infrastructure and tools for research in neural network interpretability.

colab interpretability jupyter-notebook machine-learning tensorflow visualization

Last synced: 19 Jan 2025

https://github.com/stellargraph/stellargraph

StellarGraph - Machine Learning on Graphs

data-science deep-learning gcn geometric-deep-learning graph-analysis graph-convolutional-networks graph-data graph-machine-learning graph-neural-networks graphs heterogeneous-networks interpretability link-prediction machine-learning machine-learning-algorithms networkx python saliency-map stellargraph-library

Last synced: 10 Apr 2025

https://github.com/maif/shapash

🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models

ethical-artificial-intelligence explainability explainable-ml interpretability lime machine-learning python shap transparency

Last synced: 08 Apr 2025

https://github.com/MAIF/shapash

🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models

ethical-artificial-intelligence explainability explainable-ml interpretability lime machine-learning python shap transparency

Last synced: 26 Mar 2025

https://github.com/seldonio/alibi

Algorithms for explaining machine learning models

counterfactual explanations interpretability machine-learning xai

Last synced: 09 Apr 2025

https://github.com/SeldonIO/alibi

Algorithms for explaining machine learning models

counterfactual explanations interpretability machine-learning xai

Last synced: 27 Mar 2025

https://github.com/frgfm/torch-cam

Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)

activation-maps class-activation-map cnn deep-learning grad-cam gradcam gradcam-plus-plus interpretability interpretable-deep-learning python pytorch saliency-map score-cam smoothgrad

Last synced: 09 Apr 2025

https://github.com/google-deepmind/penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

fine-tuning interpretability jax neural-networks visualization

Last synced: 10 Apr 2025

https://github.com/ramprs/grad-cam

[ICCV 2017] Torch code for Grad-CAM

convolutional-neural-networks deep-learning grad-cam heatmap iccv17 interpretability visual-explanation

Last synced: 08 Apr 2025

https://github.com/microsoft/responsible-ai-toolbox

Responsible AI Toolbox is a suite of tools providing model and data exploration and assessment user interfaces and libraries that enable a better understanding of AI systems. These interfaces and libraries empower developers and stakeholders of AI systems to develop and monitor AI more responsibly, and take better data-driven actions.

data-analysis data-science data-visualization error-analysis explainability explainable-ai explainable-ml fairness fairness-ai fairness-ml interpretability jupyter machine-learning machinelearning ml responsible-ai ui visualization widget widgets

Last synced: 09 Apr 2025

https://github.com/csinva/imodels

Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).

ai artificial-intelligence bayesian-rule-list data-science explainable-ai explainable-ml imodels interpretability machine-learning ml optimal-classification-tree python rule-learning rulefit rules scikit-learn statistics supervised-learning

Last synced: 09 Apr 2025

https://modeloriented.github.io/DALEX/

moDel Agnostic Language for Exploration and eXplanation

black-box dalex data-science explainable-ai explainable-artificial-intelligence explainable-ml explanations explanatory-model-analysis fairness iml interpretability interpretable-machine-learning machine-learning model-visualization predictive-modeling responsible-ai responsible-ml xai

Last synced: 20 Nov 2024

https://github.com/modeloriented/dalex

moDel Agnostic Language for Exploration and eXplanation

black-box dalex data-science explainable-ai explainable-artificial-intelligence explainable-ml explanations explanatory-model-analysis fairness iml interpretability interpretable-machine-learning machine-learning model-visualization predictive-modeling responsible-ai responsible-ml xai

Last synced: 09 Apr 2025

https://github.com/ModelOriented/DALEX

moDel Agnostic Language for Exploration and eXplanation

black-box dalex data-science explainable-ai explainable-artificial-intelligence explainable-ml explanations explanatory-model-analysis fairness iml interpretability interpretable-machine-learning machine-learning model-visualization predictive-modeling responsible-ai responsible-ml xai

Last synced: 14 Mar 2025

https://github.com/cdpierse/transformers-interpret

Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

captum computer-vision deep-learning explainable-ai interpretability machine-learning model-explainability natural-language-processing neural-network nlp transformers transformers-model

Last synced: 14 Apr 2025

https://github.com/ethicalml/xai

XAI - An eXplainability toolbox for machine learning

ai artificial-intelligence bias bias-evaluation downsampling evaluation explainability explainable-ai explainable-ml feature-importance imbalance interpretability machine-learning machine-learning-explainability ml upsampling xai xai-library

Last synced: 07 Apr 2025

https://github.com/EthicalML/xai

XAI - An eXplainability toolbox for machine learning

ai artificial-intelligence bias bias-evaluation downsampling evaluation explainability explainable-ai explainable-ml feature-importance imbalance interpretability machine-learning machine-learning-explainability ml upsampling xai xai-library

Last synced: 14 Mar 2025

https://github.com/stanfordnlp/pyreft

ReFT: Representation Finetuning for Language Models

interpretability reft representation-finetuning

Last synced: 10 Apr 2025

https://github.com/sicara/tf-explain

Interpretability Methods for tf.keras models with Tensorflow 2.x

deep-learning interpretability keras machine-learning tensorflow tf2 visualization

Last synced: 11 Apr 2025

https://github.com/hila-chefer/transformer-mm-explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

clip detr explainability explainable-ai interpretability lxmert transformer transformers visualbert visualization vqa

Last synced: 12 Apr 2025

https://github.com/shubhomoydas/ad_examples

A collection of anomaly detection methods (iid/point-based, graph and time series) including active learning for anomaly detection/discovery, bayesian rule-mining, description for diversity/explanation/interpretability. Analysis of incorporating label feedback with ensemble and tree-based detectors. Includes adversarial attacks with Graph Convolutional Network.

active-learning adversarial-attacks anogan anomaly-detection autoencoder concept-drift ensemble-learning explaination gan generative-adversarial-network graph-convolutional-networks interpretability lstm nettack rnn streaming time-series timeseries trees unsuperivsed

Last synced: 16 Mar 2025

https://github.com/hila-chefer/Transformer-MM-Explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

clip detr explainability explainable-ai interpretability lxmert transformer transformers visualbert visualization vqa

Last synced: 03 Apr 2025

https://github.com/pbiecek/xai_resources

Interesting resources related to XAI (Explainable Artificial Intelligence)

interpretability interpretable-machine-learning xai

Last synced: 13 Apr 2025

https://github.com/kundajelab/deeplift

Public facing deeplift repo

deeplift guided-backpropagation integrated-gradients interpretability interpretable-deep-learning saliency-map sensitivity-analysis

Last synced: 19 Apr 2025

https://github.com/MisaOgura/flashtorch

Visualization toolkit for neural networks in PyTorch! Demo -->

cnn deep-learning explainability interpretability machine-learning neural-networks pytorch visualization

Last synced: 27 Mar 2025

https://github.com/stanfordnlp/pyvene

Stanford NLP Python library for understanding and improving PyTorch models via interventions

activation-intervention activation-patching interpretability intervention mechanistic-interpretability

Last synced: 12 Apr 2025

https://github.com/jphall663/interpretable_machine_learning_with_python

Examples of techniques for training interpretable ML models, explaining ML models, and debugging ML models for accuracy, discrimination, and security.

accountability data-mining data-science decision-tree fairness fatml gradient-boosting-machine h2o iml interpretability interpretable interpretable-ai interpretable-machine-learning interpretable-ml lime machine-learning machine-learning-interpretability python transparency xai

Last synced: 12 Apr 2025

https://github.com/tensorflow/decision-forests

A collection of state-of-the-art algorithms for the training, serving and interpretation of Decision Forest models in Keras.

decision-forest decision-trees gradient-boosting interpretability keras machine-learning ml python random-forest tensorflow

Last synced: 10 Apr 2025

https://github.com/deel-ai/xplique

👋 Xplique is a Neural Networks Explainability Toolbox

explainable-ai explainable-ml interpretability xai

Last synced: 04 Apr 2025

https://github.com/tensorflow/tcav

Code for the TCAV ML interpretability project

interpretability machine-learning tcav

Last synced: 08 Apr 2025

https://github.com/alvinwan/neural-backed-decision-trees

Making decision trees competitive with neural networks on CIFAR10, CIFAR100, TinyImagenet200, Imagenet

cifar10 cifar100 decision-trees explainability image-classification imagenet interpretability neural-backed-decision-trees neural-networks pretrained-models pretrained-weights pytorch tiny-imagenet

Last synced: 09 Apr 2025

https://github.com/understandable-machine-intelligence-lab/quantus

Quantus is an eXplainable AI toolkit for responsible evaluation of neural network explanations

deep-learning explainable-ai interpretability machine-learning pytorch quantification-evaluation-methods reproducibility tensorflow xai

Last synced: 12 Apr 2025

https://github.com/kmeng01/rome

Locating and editing factual associations in GPT (NeurIPS 2022)

gpt interpretability pytorch transformers

Last synced: 26 Mar 2025

https://github.com/google/yggdrasil-decision-forests

A library to train, evaluate, interpret, and productionize decision forest models such as Random Forest and Gradient Boosted Decision Trees.

cart cli cpp decision-forest decision-trees distributed-computing go gradient-boosting interpretability javascript machine-learning ml pypi python random-forest tensorflow

Last synced: 11 Apr 2025

https://github.com/linkedin/FastTreeSHAP

Fast SHAP value computation for interpreting tree-based models

explainable-ai interpretability lightgbm machine-learning random-forest shap xgboost

Last synced: 27 Nov 2024

https://github.com/linkedin/fasttreeshap

Fast SHAP value computation for interpreting tree-based models

explainable-ai interpretability lightgbm machine-learning random-forest shap xgboost

Last synced: 13 Nov 2024

https://github.com/understandable-machine-intelligence-lab/Quantus

Quantus is an eXplainable AI toolkit for responsible evaluation of neural network explanations

deep-learning explainable-ai interpretability machine-learning pytorch quantification-evaluation-methods reproducibility tensorflow xai

Last synced: 15 Nov 2024

https://github.com/bcg-x-official/facet

Human-explainable AI.

data-analytics data-science explainable-ai hyperparameter-tuning interpretability machine-learning model-selection python shap-vector-decomposition simulation statistics

Last synced: 08 Apr 2025

https://github.com/BCG-X-Official/facet

Human-explainable AI.

data-analytics data-science explainable-ai hyperparameter-tuning interpretability machine-learning model-selection python shap-vector-decomposition simulation statistics

Last synced: 15 Nov 2024

https://github.com/h2oai/mli-resources

H2O.ai Machine Learning Interpretability Resources

accountability data-mining data-science explainable-ml fairness fatml h2o iml interpretability interpretable-ai interpretable-machine-learning interpretable-ml jupyter-notebooks machine-learning machine-learning-interpretability mli python transparency xai xgboost

Last synced: 05 Apr 2025

https://github.com/explainX/explainx

Explainable AI framework for data scientists. Explain & debug any blackbox machine learning model with a single line of code. We are looking for co-authors to take this project forward. Reach out @ [email protected]

aws-sagemaker bias blackbox explainability explainable-ai explainable-artificial-intelligence explainable-ml explainx interpretability interpretable-ai interpretable-machine-learning machine-learning machine-learning-interpretability scikit-learn transparency xai

Last synced: 04 Apr 2025

https://github.com/explainx/explainx

Explainable AI framework for data scientists. Explain & debug any blackbox machine learning model with a single line of code. We are looking for co-authors to take this project forward. Reach out @ [email protected]

aws-sagemaker bias blackbox explainability explainable-ai explainable-artificial-intelligence explainable-ml explainx interpretability interpretable-ai interpretable-machine-learning machine-learning machine-learning-interpretability scikit-learn transparency xai

Last synced: 12 Apr 2025

https://github.com/inseq-team/inseq

Interpretability for sequence generation models 🐛 🔍

attribution-methods captum deep-learning explainable-ai generative-ai huggingface interpretability language-generation language-model large-language-models natural-language-processing sequence-to-sequence transformers

Last synced: 26 Mar 2025

https://github.com/xmed-lab/CLIP_Surgery

CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks

clip explainability interpretability multilabel multimodal open-vocabulary sam segment-anything segmentation vision-transformer

Last synced: 16 Mar 2025

https://github.com/ndif-team/nnsight

The nnsight package enables interpreting and manipulating the internals of deep learned models.

interpretability machine-learning neural-networks python pytorch

Last synced: 26 Feb 2025

https://github.com/pratyushasharma/laser

The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

gpt-j interpretability laser llama2 llm llms model-compression transformers

Last synced: 05 Apr 2025

https://github.com/sergioburdisso/pyss3

A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainable AI :octocat:)

artificial-intelligence data-mining document-categorization document-classification early-classification explainable-artificial-intelligence interpretability interpretable-machine-learning interpretable-ml machine-learning machine-learning-algorithms multilabel-classification natural-language-processing nlp sentence-classification ss3-classifier text-classification text-labeling text-mining xai

Last synced: 14 Apr 2025

https://github.com/modeloriented/modelstudio

📍 Interactive Studio for Explanatory Model Analysis

ai explainable explainable-ai explainable-machine-learning explanatory-model-analysis human iml interactive interactivity interpretability interpretable interpretable-machine-learning learning machine model model-visualization r visualization xai

Last synced: 04 Apr 2025

https://github.com/ModelOriented/modelStudio

📍 Interactive Studio for Explanatory Model Analysis

ai explainable explainable-ai explainable-machine-learning explanatory-model-analysis human iml interactive interactivity interpretability interpretable interpretable-machine-learning learning machine model model-visualization r visualization xai

Last synced: 17 Nov 2024

https://github.com/hbaniecki/adversarial-explainable-ai

💡 Adversarial attacks on explanations and how to defend them

adversarial adversarial-attacks adversarial-examples adversarial-machine-learning attacks counterfactual deep defense evaluation explainability explainable-ai iml interpretability interpretable interpretable-machine-learning model responsible-ai robustness security xai

Last synced: 25 Mar 2025

https://github.com/joaolages/diffusers-interpret

Diffusers-Interpret 🤗🧨🕵️‍♀️: Model explainability for 🤗 Diffusers. Get explanations for your generated images.

computer-vision deep-learning diffusers diffusion explainable-ai image-generation interpretability model-explainability primary-attributions pytorch text2image transformers

Last synced: 05 Apr 2025

https://github.com/iancovert/sage

For calculating global feature importance using Shapley values.

explainability interpretability machine-learning shapley

Last synced: 26 Mar 2025

https://github.com/stevekgyang/mentalllama

This repository introduces MentaLLaMA, the first open-source instruction following large language model for interpretable mental health analysis.

chatgpt gpt4 interpretability language-model large-language-models llama2 mental-health natural-language-processing natural-language-understanding social-media

Last synced: 09 Apr 2025

https://github.com/AI4LIFE-GROUP/OpenXAI

OpenXAI : Towards a Transparent Evaluation of Model Explanations

benchmark explainability explainable-ai interpretability leaderboard reproducibility

Last synced: 11 Nov 2024

https://github.com/chr5tphr/zennit

Zennit is a high-level framework in Python using PyTorch for explaining/exploring neural networks using attribution methods like LRP.

attribution deep-learning explainability explainable-ai feature-attribution interpretability interpretable-ai interpretable-ml lrp machine-learning python pytorch xai

Last synced: 09 Apr 2025

https://github.com/SteveKGYang/MentalLLaMA

This repository introduces MentaLLaMA, the first open-source instruction following large language model for interpretable mental health analysis.

chatgpt gpt4 interpretability language-model large-language-models llama2 mental-health natural-language-processing natural-language-understanding social-media

Last synced: 01 Apr 2025

https://github.com/pralab/secml

A Python library for Secure and Explainable Machine Learning

adversarial-machine-learning algorithms artificial-intelligence attack-algorithms cleverhans evasion-attacks explainable-machine-learning foolbox interpretability machine-learning matplotlib neural-networks poisoning-attacks python python-library pytorch secml security sparse-data tensorflow

Last synced: 21 Apr 2025

https://github.com/jrieke/cnn-interpretability

🏥 Visualizing Convolutional Networks for MRI-based Diagnosis of Alzheimer’s Disease

alzheimer-disease-prediction alzheimers-disease cnn convolutional-neural-networks deep-learning interpretability interpretable-machine-learning machine-learning medical-imaging mri visualization-methods

Last synced: 14 Apr 2025

https://github.com/austinrochford/pycebox

⬛ Python Individual Conditional Expectation Plot Toolbox

interpretability machine-learning

Last synced: 10 Apr 2025

https://github.com/AustinRochford/PyCEbox

⬛ Python Individual Conditional Expectation Plot Toolbox

interpretability machine-learning

Last synced: 20 Apr 2025

https://github.com/graph-com/gsat

[ICML 2022] Graph Stochastic Attention (GSAT) for interpretable and generalizable graph learning.

deep-learning graph-neural-networks interpretability interpretable-machine-learning pytorch xai

Last synced: 03 Dec 2024

https://github.com/pietrobarbiero/pytorch_explain

PyTorch Explain: Interpretable Deep Learning in Python.

deep-learning entropy explainability explainable-ai interpretability interpretable-ai interpretable-deep-learning interpretable-machine-learning lens logic machine-learning neural-network python pytorch sympy

Last synced: 06 Apr 2025

https://github.com/google-research/reverse-engineering-neural-networks

A collection of tools for reverse engineering neural networks.

deep-learning interpretability machine-learning

Last synced: 07 Apr 2025

https://github.com/Graph-COM/GSAT

[ICML 2022] Graph Stochastic Attention (GSAT) for interpretable and generalizable graph learning.

deep-learning graph-neural-networks interpretability interpretable-machine-learning pytorch xai

Last synced: 28 Nov 2024

https://github.com/EleutherAI/knowledge-neurons

A library for finding knowledge neurons in pretrained transformer models.

interpretability transformers

Last synced: 15 Nov 2024

https://github.com/eleutherai/knowledge-neurons

A library for finding knowledge neurons in pretrained transformer models.

interpretability transformers

Last synced: 26 Dec 2024

https://github.com/poloclub/timbertrek

Explore and compare 1K+ accurate decision trees in your browser!

decision-tree interactive-visualizations interpretability rashomon visualization

Last synced: 15 Nov 2024

https://github.com/vanderschaarlab/autoprognosis

A system for automating the design of predictive modeling pipelines tailored for clinical prognosis.

automl healthcare interpretability survival-analysis

Last synced: 12 Apr 2025

https://github.com/mahmoodlab/survpath

Modeling Dense Multimodal Interactions Between Biological Pathways and Histology for Survival Prediction - CVPR 2024

histology-transcriptomics interpretability mahmoodlab pathology pathology-genomics pathology-representation pathways survpath

Last synced: 05 Apr 2025

https://github.com/csinva/hierarchical-dnn-interpretations

Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" 🧠 (ICLR 2019)

acd ai artificial-intelligence convolutional-neural-networks data-science deep-learning deep-neural-networks explainability explainable-ai feature-importance iclr interpretability interpretation jupyter-notebook machine-learning ml neural-network python pytorch statistics

Last synced: 12 Apr 2025

https://github.com/laura-rieger/deep-explanation-penalization

Code for using CDEP from the paper "Interpretations are useful: penalizing explanations to align neural networks with prior knowledge" https://arxiv.org/abs/1909.13584

ai artificial-intelligence cdep convolutional-neural-network data-science deep-learning explainability explainable-ai fairness fairness-ml feature-importance interpretability interpretable-deep-learning jupyter-notebook machine-learning ml neural-network python pytorch recurrent-neural-network

Last synced: 15 Nov 2024

https://github.com/kennethenevoldsen/asent

Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.

interpretability natural-language-processing nlp python3 sentiment-analysis spacy spacy-extensions

Last synced: 06 Apr 2025

https://github.com/interpretml/gam-changer

Editing machine learning models to reflect human knowledge and values

interpretability machine-learning visualization

Last synced: 10 Nov 2024

https://github.com/fredhohman/summit

🏔️ Summit: Scaling Deep Learning Interpretability by Visualizing Activation and Attribution Summarizations

deep-learning deep-learning-visualization interactive-interface interactive-visualization interpretability

Last synced: 11 Apr 2025

https://github.com/julia-xai/explainableai.jl

Explainable AI in Julia.

attribution-methods explainable-ai feature-attribution interpretability interpretable-ai julia lrp xai

Last synced: 05 Apr 2025

https://github.com/jasonjmcghee/livelove

Love2D LSP (VS Code / Neovim / Zed / etc.) extension for live coding and live variable tracking

interpretability language-server-protocol live live-coding love2d lsp neovim-plugin nvim-plugin observability vscode-extension zed-extension

Last synced: 14 Apr 2025

https://github.com/pbiecek/breakDown

Model Agnostics breakDown plots

data-science iml interpretability machine-learning visual-explanations xai

Last synced: 11 Nov 2024

https://github.com/pbiecek/breakdown

Model Agnostics breakDown plots

data-science iml interpretability machine-learning visual-explanations xai

Last synced: 09 Apr 2025

https://github.com/M-Nauta/ProtoTree

ProtoTrees: Neural Prototype Trees for Interpretable Fine-grained Image Recognition, published at CVPR2021

computer-vision cvpr2021 decision-trees deep-neural-networks explainability explainable-ai explainable-ml fine-grained-classification fine-grained-visual-categorization interpretability interpretable-deep-learning interpretable-machine-learning pytorch

Last synced: 15 Nov 2024

https://github.com/snehankekre/streamlit-shap

streamlit-shap provides a wrapper to display SHAP plots in Streamlit.

explainability interpretability machine-learning shap shapley streamlit streamlit-component

Last synced: 19 Dec 2024

https://github.com/alstonlo/torch-influence

A simple PyTorch implementation of influence functions.

deep-learning influence-functions interpretability machine-learning

Last synced: 17 Dec 2024

https://github.com/ModelOriented/iBreakDown

Break Down with interactions for local explanations (SHAP, BreakDown, iBreakDown)

breakdown iml interpretability shapley xai

Last synced: 14 Mar 2025

https://github.com/mertyg/debug-mistakes-cce

Meaningfully debugging model mistakes with conceptual counterfactual explanations. ICML 2022

concepts counterfactual-explanations explanations interpretability

Last synced: 19 Nov 2024

https://github.com/csinva/imodelsx

Scikit-learn friendly library to interpret, and prompt-engineer text datasets using large language models.

ai deep-learning explainability huggingface interpretability language-model machine-learning ml natural-language-processing natural-language-understanding neural-network pytorch scikit-learn text text-classification transformer-models xai

Last synced: 26 Feb 2025

https://github.com/csinva/imodelsX

Scikit-learn friendly library to interpret, and prompt-engineer text datasets using large language models.

ai deep-learning explainability huggingface interpretability language-model machine-learning ml natural-language-processing natural-language-understanding neural-network pytorch scikit-learn text text-classification transformer-models xai

Last synced: 13 Nov 2024

https://github.com/fat-forensics/fat-forensics

Modular Python Toolbox for Fairness, Accountability and Transparency Forensics

accountability explainability explainable-ai fairness interpretability interpretable-ai machine-learning transparency

Last synced: 27 Mar 2025

https://github.com/nyuvis/explanation_explorer

A user interface to interpret machine learning models.

interpretability machine-learning visual-interface visualization-application

Last synced: 03 Apr 2025

https://github.com/arabiaweather/athena

Automatic equation building and curve fitting. Runs on Tensorflow. Built for academia and research.

academia curve-fitting equation-solver interpretability machine-learning optimization research-tool simulation-framework symbolic-computation symbolic-regression tensorflow

Last synced: 25 Mar 2025

https://github.com/pfnet-research/bayesgrad

BayesGrad: Explaining Predictions of Graph Convolutional Networks

chainer chemistry deep-learning graph-convolutional-networks interpretability neural-network python saliency

Last synced: 13 Apr 2025

https://github.com/chirag-agarwall/VOG

Estimating Example Difficulty using Variance of Gradients

atypical-examples deep-learning explainability human-in-the-loop-auditing interpretability

Last synced: 15 Nov 2024

https://github.com/taufeeque9/codebook-features

Sparse and discrete interpretability tool for neural networks

codebook features interpretability language-model mechanistic-interpretability transformers

Last synced: 13 Apr 2025

https://github.com/aredier/trelawney

General Interpretability Package

graphics interpretability machine-learning python

Last synced: 27 Nov 2024

https://github.com/ramprs/neuron-importance-zsl

[ECCV 2018] code for Choose Your Neuron: Incorporating Domain Knowledge Through Neuron Importance

grad-cam interpretability neuron-importance zero-shot-learning

Last synced: 19 Nov 2024

https://github.com/mertyg/post-hoc-cbm

Code for the paper "Post-hoc Concept Bottleneck Models". Spotlight @ ICLR 2023

concept-based-explanations concept-based-models concepts explainability interpretability

Last synced: 19 Nov 2024

https://github.com/microsoft/automated-explanations

Generating and validating natural-language explanations.

artificial-intelligence automated-interpretability data-science explanation fmri fmri-data-analysis gpt gpt4 huggingface interpretability language-model large-language-models machine-learning mechanistic-interpretability neuroscience xai

Last synced: 10 Feb 2025