Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/CarperAI/Polygraph

RLHF Mechanistic Interpretability and Deception
https://github.com/CarperAI/Polygraph

Last synced: 4 months ago
JSON representation

RLHF Mechanistic Interpretability and Deception

Lists

README

        

# Polygraph
RLHF Mechanistic Interpretability and Deception