Projects in Awesome Lists tagged with interpretability
A curated list of projects in awesome lists tagged with interpretability .
https://github.com/shap/shap
A game theoretic approach to explain the output of any machine learning model.
deep-learning explainability gradient-boosting interpretability machine-learning shap shapley
Last synced: 15 Apr 2025
https://github.com/slundberg/shap
A game theoretic approach to explain the output of any machine learning model.
deep-learning explainability gradient-boosting interpretability machine-learning shap shapley
Last synced: 16 Nov 2024
https://github.com/jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
class-activation-maps computer-vision deep-learning explainable-ai explainable-ml grad-cam image-classification interpretability interpretable-ai interpretable-deep-learning machine-learning object-detection pytorch score-cam vision-transformers visualizations xai
Last synced: 15 Apr 2025
https://github.com/interpretml/interpret
Fit interpretable models. Explain blackbox machine learning.
ai artificial-intelligence bias blackbox differential-privacy explainability explainable-ai explainable-ml gradient-boosting iml interpretability interpretable-ai interpretable-machine-learning interpretable-ml interpretml machine-learning scikit-learn transparency xai
Last synced: 19 Apr 2025
https://github.com/pytorch/captum
Model interpretability and understanding for PyTorch
feature-attribution feature-importance interpretability interpretable-ai interpretable-ml
Last synced: 15 Apr 2025
https://github.com/tensorflow/lucid
A collection of infrastructure and tools for research in neural network interpretability.
colab interpretability jupyter-notebook machine-learning tensorflow visualization
Last synced: 19 Jan 2025
https://github.com/stellargraph/stellargraph
StellarGraph - Machine Learning on Graphs
data-science deep-learning gcn geometric-deep-learning graph-analysis graph-convolutional-networks graph-data graph-machine-learning graph-neural-networks graphs heterogeneous-networks interpretability link-prediction machine-learning machine-learning-algorithms networkx python saliency-map stellargraph-library
Last synced: 10 Apr 2025
https://github.com/maif/shapash
🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models
ethical-artificial-intelligence explainability explainable-ml interpretability lime machine-learning python shap transparency
Last synced: 08 Apr 2025
https://github.com/MAIF/shapash
🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models
ethical-artificial-intelligence explainability explainable-ml interpretability lime machine-learning python shap transparency
Last synced: 26 Mar 2025
https://github.com/seldonio/alibi
Algorithms for explaining machine learning models
counterfactual explanations interpretability machine-learning xai
Last synced: 09 Apr 2025
https://github.com/SeldonIO/alibi
Algorithms for explaining machine learning models
counterfactual explanations interpretability machine-learning xai
Last synced: 27 Mar 2025
https://github.com/frgfm/torch-cam
Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)
activation-maps class-activation-map cnn deep-learning grad-cam gradcam gradcam-plus-plus interpretability interpretable-deep-learning python pytorch saliency-map score-cam smoothgrad
Last synced: 09 Apr 2025
https://github.com/google-deepmind/penzai
A JAX research toolkit for building, editing, and visualizing neural networks.
fine-tuning interpretability jax neural-networks visualization
Last synced: 10 Apr 2025
https://github.com/ramprs/grad-cam
[ICCV 2017] Torch code for Grad-CAM
convolutional-neural-networks deep-learning grad-cam heatmap iccv17 interpretability visual-explanation
Last synced: 08 Apr 2025
https://github.com/microsoft/responsible-ai-toolbox
Responsible AI Toolbox is a suite of tools providing model and data exploration and assessment user interfaces and libraries that enable a better understanding of AI systems. These interfaces and libraries empower developers and stakeholders of AI systems to develop and monitor AI more responsibly, and take better data-driven actions.
data-analysis data-science data-visualization error-analysis explainability explainable-ai explainable-ml fairness fairness-ai fairness-ml interpretability jupyter machine-learning machinelearning ml responsible-ai ui visualization widget widgets
Last synced: 09 Apr 2025
https://github.com/csinva/imodels
Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).
ai artificial-intelligence bayesian-rule-list data-science explainable-ai explainable-ml imodels interpretability machine-learning ml optimal-classification-tree python rule-learning rulefit rules scikit-learn statistics supervised-learning
Last synced: 09 Apr 2025
https://modeloriented.github.io/DALEX/
moDel Agnostic Language for Exploration and eXplanation
black-box dalex data-science explainable-ai explainable-artificial-intelligence explainable-ml explanations explanatory-model-analysis fairness iml interpretability interpretable-machine-learning machine-learning model-visualization predictive-modeling responsible-ai responsible-ml xai
Last synced: 20 Nov 2024
https://github.com/modeloriented/dalex
moDel Agnostic Language for Exploration and eXplanation
black-box dalex data-science explainable-ai explainable-artificial-intelligence explainable-ml explanations explanatory-model-analysis fairness iml interpretability interpretable-machine-learning machine-learning model-visualization predictive-modeling responsible-ai responsible-ml xai
Last synced: 09 Apr 2025
https://github.com/ModelOriented/DALEX
moDel Agnostic Language for Exploration and eXplanation
black-box dalex data-science explainable-ai explainable-artificial-intelligence explainable-ml explanations explanatory-model-analysis fairness iml interpretability interpretable-machine-learning machine-learning model-visualization predictive-modeling responsible-ai responsible-ml xai
Last synced: 14 Mar 2025
https://github.com/cdpierse/transformers-interpret
Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.
captum computer-vision deep-learning explainable-ai interpretability machine-learning model-explainability natural-language-processing neural-network nlp transformers transformers-model
Last synced: 14 Apr 2025
https://github.com/ethicalml/xai
XAI - An eXplainability toolbox for machine learning
ai artificial-intelligence bias bias-evaluation downsampling evaluation explainability explainable-ai explainable-ml feature-importance imbalance interpretability machine-learning machine-learning-explainability ml upsampling xai xai-library
Last synced: 07 Apr 2025
https://github.com/EthicalML/xai
XAI - An eXplainability toolbox for machine learning
ai artificial-intelligence bias bias-evaluation downsampling evaluation explainability explainable-ai explainable-ml feature-importance imbalance interpretability machine-learning machine-learning-explainability ml upsampling xai xai-library
Last synced: 14 Mar 2025
https://github.com/stanfordnlp/pyreft
ReFT: Representation Finetuning for Language Models
interpretability reft representation-finetuning
Last synced: 10 Apr 2025
https://github.com/sicara/tf-explain
Interpretability Methods for tf.keras models with Tensorflow 2.x
deep-learning interpretability keras machine-learning tensorflow tf2 visualization
Last synced: 11 Apr 2025
https://github.com/hila-chefer/transformer-mm-explainability
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.
clip detr explainability explainable-ai interpretability lxmert transformer transformers visualbert visualization vqa
Last synced: 12 Apr 2025
https://github.com/shubhomoydas/ad_examples
A collection of anomaly detection methods (iid/point-based, graph and time series) including active learning for anomaly detection/discovery, bayesian rule-mining, description for diversity/explanation/interpretability. Analysis of incorporating label feedback with ensemble and tree-based detectors. Includes adversarial attacks with Graph Convolutional Network.
active-learning adversarial-attacks anogan anomaly-detection autoencoder concept-drift ensemble-learning explaination gan generative-adversarial-network graph-convolutional-networks interpretability lstm nettack rnn streaming time-series timeseries trees unsuperivsed
Last synced: 16 Mar 2025
https://github.com/hila-chefer/Transformer-MM-Explainability
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.
clip detr explainability explainable-ai interpretability lxmert transformer transformers visualbert visualization vqa
Last synced: 03 Apr 2025
https://github.com/pbiecek/xai_resources
Interesting resources related to XAI (Explainable Artificial Intelligence)
interpretability interpretable-machine-learning xai
Last synced: 13 Apr 2025
https://github.com/kundajelab/deeplift
Public facing deeplift repo
deeplift guided-backpropagation integrated-gradients interpretability interpretable-deep-learning saliency-map sensitivity-analysis
Last synced: 19 Apr 2025
https://github.com/MisaOgura/flashtorch
Visualization toolkit for neural networks in PyTorch! Demo -->
cnn deep-learning explainability interpretability machine-learning neural-networks pytorch visualization
Last synced: 27 Mar 2025
https://github.com/stanfordnlp/pyvene
Stanford NLP Python library for understanding and improving PyTorch models via interventions
activation-intervention activation-patching interpretability intervention mechanistic-interpretability
Last synced: 12 Apr 2025
https://github.com/jphall663/interpretable_machine_learning_with_python
Examples of techniques for training interpretable ML models, explaining ML models, and debugging ML models for accuracy, discrimination, and security.
accountability data-mining data-science decision-tree fairness fatml gradient-boosting-machine h2o iml interpretability interpretable interpretable-ai interpretable-machine-learning interpretable-ml lime machine-learning machine-learning-interpretability python transparency xai
Last synced: 12 Apr 2025
https://github.com/tensorflow/decision-forests
A collection of state-of-the-art algorithms for the training, serving and interpretation of Decision Forest models in Keras.
decision-forest decision-trees gradient-boosting interpretability keras machine-learning ml python random-forest tensorflow
Last synced: 10 Apr 2025
https://github.com/deel-ai/xplique
👋 Xplique is a Neural Networks Explainability Toolbox
explainable-ai explainable-ml interpretability xai
Last synced: 04 Apr 2025
https://github.com/tensorflow/tcav
Code for the TCAV ML interpretability project
interpretability machine-learning tcav
Last synced: 08 Apr 2025
https://github.com/alvinwan/neural-backed-decision-trees
Making decision trees competitive with neural networks on CIFAR10, CIFAR100, TinyImagenet200, Imagenet
cifar10 cifar100 decision-trees explainability image-classification imagenet interpretability neural-backed-decision-trees neural-networks pretrained-models pretrained-weights pytorch tiny-imagenet
Last synced: 09 Apr 2025
https://github.com/understandable-machine-intelligence-lab/quantus
Quantus is an eXplainable AI toolkit for responsible evaluation of neural network explanations
deep-learning explainable-ai interpretability machine-learning pytorch quantification-evaluation-methods reproducibility tensorflow xai
Last synced: 12 Apr 2025
https://github.com/kmeng01/rome
Locating and editing factual associations in GPT (NeurIPS 2022)
gpt interpretability pytorch transformers
Last synced: 26 Mar 2025
https://github.com/google/yggdrasil-decision-forests
A library to train, evaluate, interpret, and productionize decision forest models such as Random Forest and Gradient Boosted Decision Trees.
cart cli cpp decision-forest decision-trees distributed-computing go gradient-boosting interpretability javascript machine-learning ml pypi python random-forest tensorflow
Last synced: 11 Apr 2025
https://github.com/linkedin/FastTreeSHAP
Fast SHAP value computation for interpreting tree-based models
explainable-ai interpretability lightgbm machine-learning random-forest shap xgboost
Last synced: 27 Nov 2024
https://github.com/linkedin/fasttreeshap
Fast SHAP value computation for interpreting tree-based models
explainable-ai interpretability lightgbm machine-learning random-forest shap xgboost
Last synced: 13 Nov 2024
https://github.com/understandable-machine-intelligence-lab/Quantus
Quantus is an eXplainable AI toolkit for responsible evaluation of neural network explanations
deep-learning explainable-ai interpretability machine-learning pytorch quantification-evaluation-methods reproducibility tensorflow xai
Last synced: 15 Nov 2024
https://github.com/bcg-x-official/facet
Human-explainable AI.
data-analytics data-science explainable-ai hyperparameter-tuning interpretability machine-learning model-selection python shap-vector-decomposition simulation statistics
Last synced: 08 Apr 2025
https://github.com/BCG-X-Official/facet
Human-explainable AI.
data-analytics data-science explainable-ai hyperparameter-tuning interpretability machine-learning model-selection python shap-vector-decomposition simulation statistics
Last synced: 15 Nov 2024
https://github.com/h2oai/mli-resources
H2O.ai Machine Learning Interpretability Resources
accountability data-mining data-science explainable-ml fairness fatml h2o iml interpretability interpretable-ai interpretable-machine-learning interpretable-ml jupyter-notebooks machine-learning machine-learning-interpretability mli python transparency xai xgboost
Last synced: 05 Apr 2025
https://github.com/explainX/explainx
Explainable AI framework for data scientists. Explain & debug any blackbox machine learning model with a single line of code. We are looking for co-authors to take this project forward. Reach out @ [email protected]
aws-sagemaker bias blackbox explainability explainable-ai explainable-artificial-intelligence explainable-ml explainx interpretability interpretable-ai interpretable-machine-learning machine-learning machine-learning-interpretability scikit-learn transparency xai
Last synced: 04 Apr 2025
https://github.com/explainx/explainx
Explainable AI framework for data scientists. Explain & debug any blackbox machine learning model with a single line of code. We are looking for co-authors to take this project forward. Reach out @ [email protected]
aws-sagemaker bias blackbox explainability explainable-ai explainable-artificial-intelligence explainable-ml explainx interpretability interpretable-ai interpretable-machine-learning machine-learning machine-learning-interpretability scikit-learn transparency xai
Last synced: 12 Apr 2025
https://github.com/inseq-team/inseq
Interpretability for sequence generation models 🐛 🔍
attribution-methods captum deep-learning explainable-ai generative-ai huggingface interpretability language-generation language-model large-language-models natural-language-processing sequence-to-sequence transformers
Last synced: 26 Mar 2025
https://github.com/xmed-lab/CLIP_Surgery
CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
clip explainability interpretability multilabel multimodal open-vocabulary sam segment-anything segmentation vision-transformer
Last synced: 16 Mar 2025
https://github.com/ndif-team/nnsight
The nnsight package enables interpreting and manipulating the internals of deep learned models.
interpretability machine-learning neural-networks python pytorch
Last synced: 26 Feb 2025
https://github.com/pratyushasharma/laser
The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
gpt-j interpretability laser llama2 llm llms model-compression transformers
Last synced: 05 Apr 2025
https://github.com/sergioburdisso/pyss3
A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainable AI :octocat:)
artificial-intelligence data-mining document-categorization document-classification early-classification explainable-artificial-intelligence interpretability interpretable-machine-learning interpretable-ml machine-learning machine-learning-algorithms multilabel-classification natural-language-processing nlp sentence-classification ss3-classifier text-classification text-labeling text-mining xai
Last synced: 14 Apr 2025
https://github.com/modeloriented/modelstudio
📍 Interactive Studio for Explanatory Model Analysis
ai explainable explainable-ai explainable-machine-learning explanatory-model-analysis human iml interactive interactivity interpretability interpretable interpretable-machine-learning learning machine model model-visualization r visualization xai
Last synced: 04 Apr 2025
https://github.com/ModelOriented/modelStudio
📍 Interactive Studio for Explanatory Model Analysis
ai explainable explainable-ai explainable-machine-learning explanatory-model-analysis human iml interactive interactivity interpretability interpretable interpretable-machine-learning learning machine model model-visualization r visualization xai
Last synced: 17 Nov 2024
https://github.com/hbaniecki/adversarial-explainable-ai
💡 Adversarial attacks on explanations and how to defend them
adversarial adversarial-attacks adversarial-examples adversarial-machine-learning attacks counterfactual deep defense evaluation explainability explainable-ai iml interpretability interpretable interpretable-machine-learning model responsible-ai robustness security xai
Last synced: 25 Mar 2025
https://github.com/joaolages/diffusers-interpret
Diffusers-Interpret 🤗🧨🕵️♀️: Model explainability for 🤗 Diffusers. Get explanations for your generated images.
computer-vision deep-learning diffusers diffusion explainable-ai image-generation interpretability model-explainability primary-attributions pytorch text2image transformers
Last synced: 05 Apr 2025
https://github.com/iancovert/sage
For calculating global feature importance using Shapley values.
explainability interpretability machine-learning shapley
Last synced: 26 Mar 2025
https://github.com/stevekgyang/mentalllama
This repository introduces MentaLLaMA, the first open-source instruction following large language model for interpretable mental health analysis.
chatgpt gpt4 interpretability language-model large-language-models llama2 mental-health natural-language-processing natural-language-understanding social-media
Last synced: 09 Apr 2025
https://github.com/AI4LIFE-GROUP/OpenXAI
OpenXAI : Towards a Transparent Evaluation of Model Explanations
benchmark explainability explainable-ai interpretability leaderboard reproducibility
Last synced: 11 Nov 2024
https://github.com/chr5tphr/zennit
Zennit is a high-level framework in Python using PyTorch for explaining/exploring neural networks using attribution methods like LRP.
attribution deep-learning explainability explainable-ai feature-attribution interpretability interpretable-ai interpretable-ml lrp machine-learning python pytorch xai
Last synced: 09 Apr 2025
https://github.com/SteveKGYang/MentalLLaMA
This repository introduces MentaLLaMA, the first open-source instruction following large language model for interpretable mental health analysis.
chatgpt gpt4 interpretability language-model large-language-models llama2 mental-health natural-language-processing natural-language-understanding social-media
Last synced: 01 Apr 2025
https://github.com/pralab/secml
A Python library for Secure and Explainable Machine Learning
adversarial-machine-learning algorithms artificial-intelligence attack-algorithms cleverhans evasion-attacks explainable-machine-learning foolbox interpretability machine-learning matplotlib neural-networks poisoning-attacks python python-library pytorch secml security sparse-data tensorflow
Last synced: 21 Apr 2025
https://github.com/jrieke/cnn-interpretability
🏥 Visualizing Convolutional Networks for MRI-based Diagnosis of Alzheimer’s Disease
alzheimer-disease-prediction alzheimers-disease cnn convolutional-neural-networks deep-learning interpretability interpretable-machine-learning machine-learning medical-imaging mri visualization-methods
Last synced: 14 Apr 2025
https://github.com/austinrochford/pycebox
⬛ Python Individual Conditional Expectation Plot Toolbox
interpretability machine-learning
Last synced: 10 Apr 2025
https://github.com/AustinRochford/PyCEbox
⬛ Python Individual Conditional Expectation Plot Toolbox
interpretability machine-learning
Last synced: 20 Apr 2025
https://github.com/graph-com/gsat
[ICML 2022] Graph Stochastic Attention (GSAT) for interpretable and generalizable graph learning.
deep-learning graph-neural-networks interpretability interpretable-machine-learning pytorch xai
Last synced: 03 Dec 2024
https://github.com/pietrobarbiero/pytorch_explain
PyTorch Explain: Interpretable Deep Learning in Python.
deep-learning entropy explainability explainable-ai interpretability interpretable-ai interpretable-deep-learning interpretable-machine-learning lens logic machine-learning neural-network python pytorch sympy
Last synced: 06 Apr 2025
https://github.com/google-research/reverse-engineering-neural-networks
A collection of tools for reverse engineering neural networks.
deep-learning interpretability machine-learning
Last synced: 07 Apr 2025
https://github.com/Graph-COM/GSAT
[ICML 2022] Graph Stochastic Attention (GSAT) for interpretable and generalizable graph learning.
deep-learning graph-neural-networks interpretability interpretable-machine-learning pytorch xai
Last synced: 28 Nov 2024
https://github.com/EleutherAI/knowledge-neurons
A library for finding knowledge neurons in pretrained transformer models.
Last synced: 15 Nov 2024
https://github.com/eleutherai/knowledge-neurons
A library for finding knowledge neurons in pretrained transformer models.
Last synced: 26 Dec 2024
https://github.com/poloclub/timbertrek
Explore and compare 1K+ accurate decision trees in your browser!
decision-tree interactive-visualizations interpretability rashomon visualization
Last synced: 15 Nov 2024
https://github.com/vanderschaarlab/autoprognosis
A system for automating the design of predictive modeling pipelines tailored for clinical prognosis.
automl healthcare interpretability survival-analysis
Last synced: 12 Apr 2025
https://github.com/mahmoodlab/survpath
Modeling Dense Multimodal Interactions Between Biological Pathways and Histology for Survival Prediction - CVPR 2024
histology-transcriptomics interpretability mahmoodlab pathology pathology-genomics pathology-representation pathways survpath
Last synced: 05 Apr 2025
https://github.com/csinva/hierarchical-dnn-interpretations
Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" 🧠 (ICLR 2019)
acd ai artificial-intelligence convolutional-neural-networks data-science deep-learning deep-neural-networks explainability explainable-ai feature-importance iclr interpretability interpretation jupyter-notebook machine-learning ml neural-network python pytorch statistics
Last synced: 12 Apr 2025
https://github.com/laura-rieger/deep-explanation-penalization
Code for using CDEP from the paper "Interpretations are useful: penalizing explanations to align neural networks with prior knowledge" https://arxiv.org/abs/1909.13584
ai artificial-intelligence cdep convolutional-neural-network data-science deep-learning explainability explainable-ai fairness fairness-ml feature-importance interpretability interpretable-deep-learning jupyter-notebook machine-learning ml neural-network python pytorch recurrent-neural-network
Last synced: 15 Nov 2024
https://github.com/kennethenevoldsen/asent
Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.
interpretability natural-language-processing nlp python3 sentiment-analysis spacy spacy-extensions
Last synced: 06 Apr 2025
https://github.com/interpretml/gam-changer
Editing machine learning models to reflect human knowledge and values
interpretability machine-learning visualization
Last synced: 10 Nov 2024
https://github.com/fredhohman/summit
🏔️ Summit: Scaling Deep Learning Interpretability by Visualizing Activation and Attribution Summarizations
deep-learning deep-learning-visualization interactive-interface interactive-visualization interpretability
Last synced: 11 Apr 2025
https://github.com/julia-xai/explainableai.jl
Explainable AI in Julia.
attribution-methods explainable-ai feature-attribution interpretability interpretable-ai julia lrp xai
Last synced: 05 Apr 2025
https://github.com/jasonjmcghee/livelove
Love2D LSP (VS Code / Neovim / Zed / etc.) extension for live coding and live variable tracking
interpretability language-server-protocol live live-coding love2d lsp neovim-plugin nvim-plugin observability vscode-extension zed-extension
Last synced: 14 Apr 2025
https://github.com/pbiecek/breakDown
Model Agnostics breakDown plots
data-science iml interpretability machine-learning visual-explanations xai
Last synced: 11 Nov 2024
https://github.com/pbiecek/breakdown
Model Agnostics breakDown plots
data-science iml interpretability machine-learning visual-explanations xai
Last synced: 09 Apr 2025
https://github.com/M-Nauta/ProtoTree
ProtoTrees: Neural Prototype Trees for Interpretable Fine-grained Image Recognition, published at CVPR2021
computer-vision cvpr2021 decision-trees deep-neural-networks explainability explainable-ai explainable-ml fine-grained-classification fine-grained-visual-categorization interpretability interpretable-deep-learning interpretable-machine-learning pytorch
Last synced: 15 Nov 2024
https://github.com/snehankekre/streamlit-shap
streamlit-shap provides a wrapper to display SHAP plots in Streamlit.
explainability interpretability machine-learning shap shapley streamlit streamlit-component
Last synced: 19 Dec 2024
https://github.com/alstonlo/torch-influence
A simple PyTorch implementation of influence functions.
deep-learning influence-functions interpretability machine-learning
Last synced: 17 Dec 2024
https://github.com/ModelOriented/iBreakDown
Break Down with interactions for local explanations (SHAP, BreakDown, iBreakDown)
breakdown iml interpretability shapley xai
Last synced: 14 Mar 2025
https://github.com/mertyg/debug-mistakes-cce
Meaningfully debugging model mistakes with conceptual counterfactual explanations. ICML 2022
concepts counterfactual-explanations explanations interpretability
Last synced: 19 Nov 2024
https://github.com/csinva/imodelsx
Scikit-learn friendly library to interpret, and prompt-engineer text datasets using large language models.
ai deep-learning explainability huggingface interpretability language-model machine-learning ml natural-language-processing natural-language-understanding neural-network pytorch scikit-learn text text-classification transformer-models xai
Last synced: 26 Feb 2025
https://github.com/csinva/imodelsX
Scikit-learn friendly library to interpret, and prompt-engineer text datasets using large language models.
ai deep-learning explainability huggingface interpretability language-model machine-learning ml natural-language-processing natural-language-understanding neural-network pytorch scikit-learn text text-classification transformer-models xai
Last synced: 13 Nov 2024
https://github.com/fat-forensics/fat-forensics
Modular Python Toolbox for Fairness, Accountability and Transparency Forensics
accountability explainability explainable-ai fairness interpretability interpretable-ai machine-learning transparency
Last synced: 27 Mar 2025
https://github.com/nyuvis/explanation_explorer
A user interface to interpret machine learning models.
interpretability machine-learning visual-interface visualization-application
Last synced: 03 Apr 2025
https://github.com/arabiaweather/athena
Automatic equation building and curve fitting. Runs on Tensorflow. Built for academia and research.
academia curve-fitting equation-solver interpretability machine-learning optimization research-tool simulation-framework symbolic-computation symbolic-regression tensorflow
Last synced: 25 Mar 2025
https://github.com/pfnet-research/bayesgrad
BayesGrad: Explaining Predictions of Graph Convolutional Networks
chainer chemistry deep-learning graph-convolutional-networks interpretability neural-network python saliency
Last synced: 13 Apr 2025
https://github.com/chirag-agarwall/VOG
Estimating Example Difficulty using Variance of Gradients
atypical-examples deep-learning explainability human-in-the-loop-auditing interpretability
Last synced: 15 Nov 2024
https://github.com/taufeeque9/codebook-features
Sparse and discrete interpretability tool for neural networks
codebook features interpretability language-model mechanistic-interpretability transformers
Last synced: 13 Apr 2025
https://github.com/aredier/trelawney
General Interpretability Package
graphics interpretability machine-learning python
Last synced: 27 Nov 2024
https://github.com/ramprs/neuron-importance-zsl
[ECCV 2018] code for Choose Your Neuron: Incorporating Domain Knowledge Through Neuron Importance
grad-cam interpretability neuron-importance zero-shot-learning
Last synced: 19 Nov 2024
https://github.com/mertyg/post-hoc-cbm
Code for the paper "Post-hoc Concept Bottleneck Models". Spotlight @ ICLR 2023
concept-based-explanations concept-based-models concepts explainability interpretability
Last synced: 19 Nov 2024
https://github.com/microsoft/automated-explanations
Generating and validating natural-language explanations.
artificial-intelligence automated-interpretability data-science explanation fmri fmri-data-analysis gpt gpt4 huggingface interpretability language-model large-language-models machine-learning mechanistic-interpretability neuroscience xai
Last synced: 10 Feb 2025