An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with sparse-autoencoder

A curated list of projects in awesome lists tagged with sparse-autoencoder .

https://github.com/paulpauls/llama3_interpretability_sae

A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible.

feature-extraction feature-steering llama3 llm-interpretability open-research pytorch sparse-autoencoder

Last synced: 12 Jul 2025

https://github.com/explanare/ravel

Evaluate interpretability methods on localizing and disentangling concepts in LLMs.

causal-intervention disentangled-representations interpretability intervention probing sparse-autoencoder

Last synced: 10 Mar 2026

https://github.com/recombee/compressae

Sparse Embedding Compression for Scalable Retrieval in Recommender Systems

embedding-compression recommender-systems sae similarity-search sparse-autoencoder

Last synced: 04 Apr 2026

https://github.com/butanium/tiny-activation-dashboard

A tiny easily hackable implementation of a feature dashboard.

feature-dashboard feature-visualization sparse-autoencoder sparse-autoencoders

Last synced: 03 May 2026

https://github.com/seonglae/emgsd-hermes

Steering GPT2-EMGSD less biased & Generating stereotyped text with vanilla GPT2 without fine tuning or prompt engineering

bias-correction bias-mitigation emgsd gpt2 sparse-autoencoder steering-vector stereotype

Last synced: 19 Feb 2026

https://github.com/hinanohart/circuitbench

Integrated mechanistic interpretability + sparse autoencoder framework for Hybrid SSM-Attention models (Mamba-2, Hymba, RWKV-7). v0.1.2 alpha: real forward-pass intervention + mean-ablation patching shipped, CPU smoke; GPU/real adapters in v0.2.

alignment hymba interpretability mamba mamba-2 mechanistic-interpretability pytorch rwkv sae sparse-autoencoder ssm state-space-model transformer-alternatives

Last synced: 15 Jun 2026