Projects in Awesome Lists tagged with behavioral-evaluation
A curated list of projects in awesome lists tagged with behavioral-evaluation .
https://github.com/solomonb14d3/knowledge-fidelity
Behavioral auditing toolkit for LLMs: rho-audit measures factual accuracy, bias, sycophancy, toxicity, and reasoning via teacher-forced confidence probes. SVD compression with knowledge preservation. Steering vectors for runtime behavioral control. 12-model merge audit across SLERP/TIES/DARE-TIES/Linear.
activation-engineering behavioral-evaluation bias-detection confidence interpretability llm-compression mergekit model-auditing model-merging pytorch rho-audit steering-vectors svd sycophancy transformers truthfulness
Last synced: 27 Feb 2026