https://github.com/jeremy-collins/robot-rlhf

Robot Learning from Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.
https://github.com/jeremy-collins/robot-rlhf

alignment chatgpt reinforcement-learning rlhf robotics

Last synced: 3 months ago
JSON representation

Robot Learning from Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.

Host: GitHub
URL: https://github.com/jeremy-collins/robot-rlhf
Owner: jeremy-collins
Created: 2023-04-16T02:39:45.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2023-04-16T02:42:49.000Z (about 2 years ago)
Last Synced: 2024-08-01T21:47:45.553Z (11 months ago)
Topics: alignment, chatgpt, reinforcement-learning, rlhf, robotics
Language: Python
Homepage:
Size: 348 KB
Stars: 5
Watchers: 2
Forks: 0
Open Issues: 1
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

awesome-human-in-the-loop - Github - jeremy-collins/robot-rlhf

README

        # robot-rlhf

Robot Learning through Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/jeremy-collins/robot-rlhf

Awesome Lists containing this project

README