An open API service indexing awesome lists of open source software.

https://github.com/humancompatibleai/interpreting-rewards

Experiments in applying interpretability techniques to learned reward functions.
https://github.com/humancompatibleai/interpreting-rewards

deep-reinforcement-learning interpretability reward-learning

Last synced: 11 months ago
JSON representation

Experiments in applying interpretability techniques to learned reward functions.

Awesome Lists containing this project