Projects in Awesome Lists tagged with nanopi-r1
A curated list of projects in awesome lists tagged with nanopi-r1 .
https://github.com/fool824/open-r1
Fully open reproduction of DeepSeek-R1
bananapi china cran deepseek llm nanopi nanopi-r1 ollama-gui openwrt r r1 sbc sdn-switch topre
Last synced: 02 Mar 2025
https://github.com/mikesterner87/nano-r1
This project demonstrates the process of fine-tuning the Qwen2.5-3B-Instruct model using GRPO (Generalized Reward Policy Optimization) on the GSM8K dataset.
adapters build grpo huggingface nanopi nanopi-r1 nanopi-r1s openwrt python safetensors text-generation-inference transformer trl unsloth
Last synced: 10 Apr 2025