Projects in Awesome Lists tagged with llm-interpretability
A curated list of projects in awesome lists tagged with llm-interpretability .
https://github.com/paulpauls/llama3_interpretability_sae
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible.
feature-extraction feature-steering llama3 llm-interpretability open-research pytorch sparse-autoencoder
Last synced: 12 Jul 2025
https://github.com/basics-lab/spectral-explain
Fast XAI with interactions at large scale. SPEX can help you understand the output of your LLM, even if you have a long context!
explainability explainable-ai llm-interpretability shap sparse-transformer xai
Last synced: 17 Mar 2026