https://github.com/yfzhang114/r1_reward

✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning
https://github.com/yfzhang114/r1_reward

Last synced: about 2 months ago
JSON representation

✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Host: GitHub
URL: https://github.com/yfzhang114/r1_reward
Owner: yfzhang114
License: apache-2.0
Created: 2025-05-05T16:16:47.000Z (about 2 months ago)
Default Branch: main
Last Pushed: 2025-05-05T16:57:54.000Z (about 2 months ago)
Last Synced: 2025-05-05T18:02:36.474Z (about 2 months ago)
Language: Python
Homepage:
Size: 1.85 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0

Awesome Lists containing this project