An open API service indexing awesome lists of open source software.

https://github.com/jeremy-collins/robot-rlhf

Robot Learning from Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.
https://github.com/jeremy-collins/robot-rlhf

alignment chatgpt reinforcement-learning rlhf robotics

Last synced: 22 days ago
JSON representation

Robot Learning from Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.

Awesome Lists containing this project

README

        

# robot-rlhf
Robot Learning through Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.