An open API service indexing awesome lists of open source software.

https://github.com/CarperAI/Polygraph

RLHF Mechanistic Interpretability and Deception
https://github.com/CarperAI/Polygraph

Last synced: 22 days ago
JSON representation

RLHF Mechanistic Interpretability and Deception

Awesome Lists containing this project

README

        

# Polygraph
RLHF Mechanistic Interpretability and Deception