An open API service indexing awesome lists of open source software.

https://github.com/yfzhang114/r1_reward

✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning
https://github.com/yfzhang114/r1_reward

Last synced: about 2 months ago
JSON representation

✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Awesome Lists containing this project