https://github.com/yfzhang114/r1_reward
✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning
https://github.com/yfzhang114/r1_reward
Last synced: about 2 months ago
JSON representation
✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning
- Host: GitHub
- URL: https://github.com/yfzhang114/r1_reward
- Owner: yfzhang114
- License: apache-2.0
- Created: 2025-05-05T16:16:47.000Z (about 2 months ago)
- Default Branch: main
- Last Pushed: 2025-05-05T16:57:54.000Z (about 2 months ago)
- Last Synced: 2025-05-05T18:02:36.474Z (about 2 months ago)
- Language: Python
- Homepage:
- Size: 1.85 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0