https://github.com/humancompatibleai/learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
https://github.com/humancompatibleai/learning-from-human-preferences
Last synced: 11 months ago
JSON representation
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
- Host: GitHub
- URL: https://github.com/humancompatibleai/learning-from-human-preferences
- Owner: HumanCompatibleAI
- License: mit
- Created: 2019-11-13T22:57:33.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2021-07-27T19:46:00.000Z (almost 5 years ago)
- Last Synced: 2025-05-30T03:40:44.911Z (about 1 year ago)
- Language: Python
- Homepage:
- Size: 522 KB
- Stars: 31
- Watchers: 5
- Forks: 7
- Open Issues: 0