An open API service indexing awesome lists of open source software.

https://github.com/zyuanlim/awesome-open-reasoning

A curated list of awesome open-source and open-weight language models or methods focused on reasoning capabilities.
https://github.com/zyuanlim/awesome-open-reasoning

List: awesome-open-reasoning

artificial-intelligence awesome awesome-list chain-of-thought cot inference-time-compute language-models large-reasoning-models llm machine-learning nlp reasoning reasoning-language-models test-time-compute transformers

Last synced: 5 months ago
JSON representation

A curated list of awesome open-source and open-weight language models or methods focused on reasoning capabilities.

Awesome Lists containing this project

README

        

# Awesome Open Reasoning LLMs[![Awesome](https://awesome.re/badge.svg)](https://awesome.re)

A curated list of awesome open-source and open-weight language models or methods focused on reasoning capabilities.

> Open-weight models are models that have been released with their weights and can be used for inference. Open-source models are models that have been released with their weights and the source code and/or data used to train them.

## Contents

- [Awesome Open Reasoning LLMs](#awesome-open-reasoning-llms)
- [Contents](#contents)
- [LLMs](#llms)
- [Contributing](#contributing)
- [License](#license)

## LLMs

| Model / Method | Code Repository | Hugging Face | Blog / Announcement | Paper |
| --------------------------------------------------------------------------------- | --------------------------------------------------------- | -------------------------------------------------------------------------------------------------------- | --------------------------------------------------------------------------------------------------------------------------- | ----------------------------------------- |
| Entropix: Entropy Based Sampling and Parallel CoT Decoding | [GitHub](https://github.com/xjdr-alt/entropix) | | | |
| Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions | [GitHub](https://github.com/AIDC-AI/Marco-o1) | [Model](https://huggingface.co/AIDC-AI/Marco-o1) | | [Paper](https://arxiv.org/abs/2411.14405) |
| OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models | [GitHub](https://github.com/openreasoner/openr) | [Collection](https://huggingface.co/openreasoner) | | [Paper](https://arxiv.org/abs/2410.09671) |
| PRIME: Process Reinforcement Through Implicit Rewards | [GitHub](https://github.com/PRIME-RL/PRIME) | [Collection](https://huggingface.co/PRIME-RL) | [Blog](https://curvy-check-498.notion.site/Process-Reinforcement-through-Implicit-Rewards-15f4fcb9c42180f1b498cc9b2eaf896f) | |
| QVQ-72B-Preview | | [Model](https://huggingface.co/Qwen/QVQ-72B-Preview) | [Blog](https://qwenlm.github.io/blog/qvq-72b-preview/) | |
| QwQ-32B-Preview | | [Model](https://huggingface.co/Qwen/QwQ-32B-Preview) | [Blog](https://qwenlm.github.io/blog/qwq-32b-preview/) | |
| Scaling Test-Time Compute with Open Models | [GitHub](https://github.com/huggingface/search-and-learn) | [Discussion](https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute/discussions) | [Blog](https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute) | |
| SmallThinker-3B-Preview | | [Model](https://huggingface.co/PowerInfer/SmallThinker-3B-Preview) | [Reddit](https://www.reddit.com/r/LocalLLaMA/comments/1hpop3y/introducing_smallthinker3bpreview_an_o1like/) | |
| Steiner | | [Collection](https://huggingface.co/collections/peakji/steiner-preview-6712c6987110ce932a44e9a6) | [Blog](https://medium.com/@peakji/a-small-step-towards-reproducing-openai-o1-b9a756a00855) | |
| UwU | | [Model](https://huggingface.co/qingy2024/UwU-7B-Instruct) | [Reddit](https://www.reddit.com/r/LocalLLaMA/comments/1hudfsf/uwu_7b_instruct/) | |

## Contributing

Please read [CONTRIBUTING.md](CONTRIBUTING.md) for details on submitting pull requests.

## License

This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.