Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/CarperAI/Polygraph

RLHF Mechanistic Interpretability and Deception
https://github.com/CarperAI/Polygraph

Last synced: 17 days ago
JSON representation

RLHF Mechanistic Interpretability and Deception

Awesome Lists containing this project

README

        

# Polygraph
RLHF Mechanistic Interpretability and Deception