Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/jeremy-collins/robot-rlhf

Robot Learning from Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.
https://github.com/jeremy-collins/robot-rlhf

alignment chatgpt reinforcement-learning rlhf robotics

Last synced: 4 months ago
JSON representation

Robot Learning from Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.

Lists

README

        

# robot-rlhf
Robot Learning through Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.