https://github.com/agentica-project/deepscaler

Last synced: 8 months ago
JSON representation

Democratizing Reinforcement Learning for LLMs

Awesome-RL-for-LRMs - deepscaler
Awesome-RL-for-LRMs - deepscaler
awesome-llm-and-aigc - DeepScaleR - project/deepscaler?style=social"/> : Democratizing Reinforcement Learning for LLMs. [www.agentica-project.com](https://www.agentica-project.com/). ["DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL"](https://pretty-radio-b75.notion.site/DeepScaleR-Surpassing-O1-Preview-with-a-1-5B-Model-by-Scaling-RL-19681902c1468005bed8ca303013a4e2) (Summary)
Awesome-Efficient-Reasoning-Models - DeepScaleR - project.com/) (Full list / Build SLM with Strong Reasoning Ability)
Awesome-Efficient-Reasoning-Models - DeepScaleR - project.com/) (Full list / Build SLM with Strong Reasoning Ability)
StarryDivineSky - agentica-project/deepscaler

ecosyste.ms