Projects in Awesome Lists tagged with ai-interpretability
A curated list of projects in awesome lists tagged with ai-interpretability .
https://github.com/alextmjugador/redwoodresearch-interp-docker
📦 Redwood Research's transformer interpretability tools, conveniently packaged in a Docker container for simple and reproducible deployments.
ai ai-interpretability ai-safety docker redwood-research
Last synced: 21 Mar 2025
https://github.com/wondermongering/linguisticperturber
Probing linguistic robustness in transformers: a quantum-inspired approach to AI interpretability
adversarial-examples ai-interpretability ai-safety computational-linguistics language-model-analysis machine-learning natural-language-processing perturbation-analysis probabilistic-models transformer-models word-embeddings
Last synced: 16 Mar 2025