An open API service indexing awesome lists of open source software.

https://github.com/nevercase/qwen-grpo

This is a reproduction of deepseek thinking based on qwen and grpo trainer.
https://github.com/nevercase/qwen-grpo

Last synced: 12 months ago
JSON representation

This is a reproduction of deepseek thinking based on qwen and grpo trainer.

Awesome Lists containing this project

README

          

# qwen-grpo
This is a reproduction of deepseek thinking based on qwen and grpo trainer.