Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/jeremy-collins/robot-rlhf
Robot Learning from Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.
https://github.com/jeremy-collins/robot-rlhf
alignment chatgpt reinforcement-learning rlhf robotics
Last synced: 3 months ago
JSON representation
Robot Learning from Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.
- Host: GitHub
- URL: https://github.com/jeremy-collins/robot-rlhf
- Owner: jeremy-collins
- Created: 2023-04-16T02:39:45.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2023-04-16T02:42:49.000Z (almost 2 years ago)
- Last Synced: 2024-08-01T21:47:45.553Z (6 months ago)
- Topics: alignment, chatgpt, reinforcement-learning, rlhf, robotics
- Language: Python
- Homepage:
- Size: 348 KB
- Stars: 5
- Watchers: 2
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-human-in-the-loop - Github - jeremy-collins/robot-rlhf
README
# robot-rlhf
Robot Learning through Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.