https://github.com/jeremy-collins/robot-rlhf
Robot Learning from Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.
https://github.com/jeremy-collins/robot-rlhf
alignment chatgpt reinforcement-learning rlhf robotics
Last synced: 22 days ago
JSON representation
Robot Learning from Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.
- Host: GitHub
- URL: https://github.com/jeremy-collins/robot-rlhf
- Owner: jeremy-collins
- Created: 2023-04-16T02:39:45.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2023-04-16T02:42:49.000Z (about 2 years ago)
- Last Synced: 2024-08-01T21:47:45.553Z (9 months ago)
- Topics: alignment, chatgpt, reinforcement-learning, rlhf, robotics
- Language: Python
- Homepage:
- Size: 348 KB
- Stars: 5
- Watchers: 2
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-human-in-the-loop - Github - jeremy-collins/robot-rlhf
README
# robot-rlhf
Robot Learning through Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.