https://github.com/zyuanlim/awesome-open-reasoning

A curated list of awesome open-source and open-weight language models or methods focused on reasoning capabilities.
https://github.com/zyuanlim/awesome-open-reasoning

List: awesome-open-reasoning

artificial-intelligence awesome awesome-list chain-of-thought cot inference-time-compute language-models large-reasoning-models llm machine-learning nlp reasoning reasoning-language-models test-time-compute transformers

Last synced: 5 months ago
JSON representation

A curated list of awesome open-source and open-weight language models or methods focused on reasoning capabilities.

Host: GitHub
URL: https://github.com/zyuanlim/awesome-open-reasoning
Owner: zyuanlim
License: mit
Created: 2025-01-04T15:44:44.000Z (5 months ago)
Default Branch: main
Last Pushed: 2025-01-06T13:12:43.000Z (5 months ago)
Last Synced: 2025-01-06T14:25:55.830Z (5 months ago)
Topics: artificial-intelligence, awesome, awesome-list, chain-of-thought, cot, inference-time-compute, language-models, large-reasoning-models, llm, machine-learning, nlp, reasoning, reasoning-language-models, test-time-compute, transformers
Homepage:
Size: 2.93 KB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE

Awesome Lists containing this project

README

        # Awesome Open Reasoning LLMs[![Awesome](https://awesome.re/badge.svg)](https://awesome.re)

A curated list of awesome open-source and open-weight language models or methods focused on reasoning capabilities.

> Open-weight models are models that have been released with their weights and can be used for inference. Open-source models are models that have been released with their weights and the source code and/or data used to train them.

## Contents

- [Awesome Open Reasoning LLMs](#awesome-open-reasoning-llms)

  - [Contents](#contents)

  - [LLMs](#llms)

  - [Contributing](#contributing)

  - [License](#license)

## LLMs

| Model / Method                                                                    | Code Repository                                           | Hugging Face                                                                                             | Blog / Announcement                                                                                                         | Paper                                     |

| --------------------------------------------------------------------------------- | --------------------------------------------------------- | -------------------------------------------------------------------------------------------------------- | --------------------------------------------------------------------------------------------------------------------------- | ----------------------------------------- |

| Entropix: Entropy Based Sampling and Parallel CoT Decoding                        | [GitHub](https://github.com/xjdr-alt/entropix)            |                                                                                                          |                                                                                                                             |                                           |

| Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions                  | [GitHub](https://github.com/AIDC-AI/Marco-o1)             | [Model](https://huggingface.co/AIDC-AI/Marco-o1)                                                         |                                                                                                                             | [Paper](https://arxiv.org/abs/2411.14405) |

| OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models | [GitHub](https://github.com/openreasoner/openr)           | [Collection](https://huggingface.co/openreasoner)                                                        |                                                                                                                             | [Paper](https://arxiv.org/abs/2410.09671) |

| PRIME: Process Reinforcement Through Implicit Rewards                             | [GitHub](https://github.com/PRIME-RL/PRIME)               | [Collection](https://huggingface.co/PRIME-RL)                                                            | [Blog](https://curvy-check-498.notion.site/Process-Reinforcement-through-Implicit-Rewards-15f4fcb9c42180f1b498cc9b2eaf896f) |                                           |

| QVQ-72B-Preview                                                                   |                                                           | [Model](https://huggingface.co/Qwen/QVQ-72B-Preview)                                                     | [Blog](https://qwenlm.github.io/blog/qvq-72b-preview/)                                                                      |                                           |

| QwQ-32B-Preview                                                                   |                                                           | [Model](https://huggingface.co/Qwen/QwQ-32B-Preview)                                                     | [Blog](https://qwenlm.github.io/blog/qwq-32b-preview/)                                                                      |                                           |

| Scaling Test-Time Compute with Open Models                                        | [GitHub](https://github.com/huggingface/search-and-learn) | [Discussion](https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute/discussions) | [Blog](https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute)                                      |                                           |

| SmallThinker-3B-Preview                                                           |                                                           | [Model](https://huggingface.co/PowerInfer/SmallThinker-3B-Preview)                                       | [Reddit](https://www.reddit.com/r/LocalLLaMA/comments/1hpop3y/introducing_smallthinker3bpreview_an_o1like/)                 |                                           |

| Steiner                                                                           |                                                           | [Collection](https://huggingface.co/collections/peakji/steiner-preview-6712c6987110ce932a44e9a6)         | [Blog](https://medium.com/@peakji/a-small-step-towards-reproducing-openai-o1-b9a756a00855)                                  |                                           |

| UwU                                                                               |                                                           | [Model](https://huggingface.co/qingy2024/UwU-7B-Instruct)                                                | [Reddit](https://www.reddit.com/r/LocalLLaMA/comments/1hudfsf/uwu_7b_instruct/)                                             |                                           |

## Contributing

Please read [CONTRIBUTING.md](CONTRIBUTING.md) for details on submitting pull requests.

## License

This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/zyuanlim/awesome-open-reasoning

Awesome Lists containing this project

README