https://github.com/vicgalle/zero-shot-reward-models
ZYN: Zero-Shot Reward Models with Yes-No Questions
https://github.com/vicgalle/zero-shot-reward-models
llm reinforcement-learning reward-models rlaif rlhf trlx zero-shot
Last synced: 4 months ago
JSON representation
ZYN: Zero-Shot Reward Models with Yes-No Questions
- Host: GitHub
- URL: https://github.com/vicgalle/zero-shot-reward-models
- Owner: vicgalle
- License: mit
- Created: 2023-03-03T19:50:41.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2023-08-15T14:12:24.000Z (almost 2 years ago)
- Last Synced: 2024-05-01T15:58:30.339Z (about 1 year ago)
- Topics: llm, reinforcement-learning, reward-models, rlaif, rlhf, trlx, zero-shot
- Language: Python
- Homepage:
- Size: 1.3 MB
- Stars: 31
- Watchers: 2
- Forks: 7
- Open Issues: 1