An open API service indexing awesome lists of open source software.

https://github.com/Corleno/KEPO

KEPO: Knowledge-Enhanced Preference Optimization for Reinforcement Learning with Reasoning
https://github.com/Corleno/KEPO

Last synced: 15 days ago
JSON representation

KEPO: Knowledge-Enhanced Preference Optimization for Reinforcement Learning with Reasoning

Awesome Lists containing this project

README

          

KEPO: Knowledge-Enhanced Preference Optimization for Reinforcement Learning with Reasoning

This code is built upon the **Med-R1** repo, it supports GKD, GRPO and KEPO algorithns on sota models such as Qwen-3VL. The dataset mainly focus on the multimodal medical data.