Projects in Awesome Lists tagged with sparse-autoencoders
A curated list of projects in awesome lists tagged with sparse-autoencoders .
https://github.com/ysh329/Chinese-UFLDL-Tutorial
[UNMAINTAINED] 非监督特征学习与深度学习中文教程,该版本翻译自新版 UFLDL Tutorial 。建议新人们去学习斯坦福的CS231n课程,该门课程在网易云课堂上也有一个配有中文字幕的版本。
convolutional-neural-networks exercise sparse-autoencoders supervised-neural-network taught-learning unsupervised-learning
Last synced: 07 May 2025
https://github.com/dmis-lab/monet
[ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers
iclr iclr2025 interpretability large-language-models mixture-of-experts sparse-autoencoders
Last synced: 05 May 2025
https://github.com/openmoss/language-model-saes
For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.
interpretability mechanistic-interpretability sparse-autoencoders sparse-dictionary
Last synced: 12 Apr 2026
https://github.com/dynamical-inference/patchsae
Implementation of PatchSAE as presented in "Sparse autoencoders reveal selective remapping of visual concepts during adaptation"
explainable-ai pytorch sae sparse-autoencoders xai
Last synced: 11 Jan 2026
https://github.com/butanium/tiny-activation-dashboard
A tiny easily hackable implementation of a feature dashboard.
feature-dashboard feature-visualization sparse-autoencoder sparse-autoencoders
Last synced: 03 May 2026
https://github.com/255bits/sae-evolver
Use evolution with sparse autoencoders
evolutionary-algorithms python sparse-autoencoders
Last synced: 02 Sep 2025
https://github.com/mikolajszawerda/music-sae
Sparse Autoencoders (SAEs) for unsupervised music representation learning.
machine-learning music musicgen rave sparse-autoencoders yue
Last synced: 28 Oct 2025
https://github.com/jwuphysics/euclid-galaxy-morphology-saes
studying (self-)supervised representations of Euclid galaxy imaging via SAEs
astronomy computer-vision galaxies mechanistic-interpretability sparse-autoencoders
Last synced: 16 Jan 2026
https://github.com/designer-coderajay/glassbox-mech
Open-source EU AI Act Annex IV compliance toolkit. Mechanistic interpretability + circuit discovery for transformers. One function call generates a court-ready evidence package
alignment annex-iv attribution-patching black-box-testing circuit-discovery compliance-audit eu-ai-act explainability fastapi gpt2 llm-compliance logit-lens mcp mechanistic-interpretability pytorch regulatory-compliance sae sparse-autoencoders transformer-circuits transformerlens
Last synced: 04 Jun 2026
https://github.com/decoderesearch/synth-sae-bench-experiments
code for the paper: SynthSAEBench: Evaluating Sparse Autoencoders on Scalable Realistic Synthetic Data
ai interpretability machine-learning sparse-autoencoders
Last synced: 05 Mar 2026
https://github.com/lennart-finke/classifier-interp
Training Sparse Autoencoders on Prompt-Guard
ai-safety jailbreak sae sparse-autoencoders
Last synced: 14 Feb 2026
https://github.com/martinkersner/autoencoder-meetup
Presentation about Autoencoders for Seoul AI Meetup on July 8, 2017.
autoencoder sparse-autoencoders tensorflow variational-autoencoders
Last synced: 29 Apr 2026