https://github.com/jianzhnie/Open-R1
The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.
https://github.com/jianzhnie/Open-R1
chatgpt gpt llama llm lora peft ppo rlhf stanford-alpaca
Last synced: 26 days ago
JSON representation
The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.
- Host: GitHub
- URL: https://github.com/jianzhnie/Open-R1
- Owner: jianzhnie
- License: apache-2.0
- Created: 2023-03-08T03:56:10.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-05-22T08:54:48.000Z (over 1 year ago)
- Last Synced: 2024-10-29T10:00:38.257Z (about 1 year ago)
- Topics: chatgpt, gpt, llama, llm, lora, peft, ppo, rlhf, stanford-alpaca
- Language: Python
- Homepage: https://jianzhnie.github.io/llmtech
- Size: 751 KB
- Stars: 175
- Watchers: 12
- Forks: 33
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-human-in-the-loop - Github - jianzhnie/open-chatgpt
- StarryDivineSky - jianzhnie/Open-R1 - R1 是一个开源项目,旨在复现 DeepSeek-R1 模型。它提供了 DeepSeek-R1 的开源实现,方便研究者和开发者学习和使用该模型。该项目可能包含模型的架构、训练代码、预训练权重等资源,方便用户进行二次开发和定制。通过这个项目,用户可以深入了解 DeepSeek-R1 的工作原理,并将其应用于各种自然语言处理任务。具体实现细节和使用方法请参考项目文档和代码。该项目由 jianzhnie 发起并维护。 (A01_文本生成_文本对话 / 大语言对话模型及数据)
README
# Open-R1: An open-source implementation of DeepSeek-R1
[](https://github.com/jianzhnie/Open-R1/blob/main/LICENSE)
[](https://www.python.org/downloads/release/python-390/)
[](https://github.com/psf/black)
## Introduction
`Open-R1` is a open-source library that allows you to train a hyper-personalized DeepSeek-R1-like model using your own data and the least amount of compute possible.
## License
`Open-R1` is released under the Apache 2.0 license.
## Acknowledgements
We appreciate the work by many open-source contributors, especially:
- [Hugging Face](https://huggingface.co/)
- [Vicuna](https://github.com/lm-sys/FastChat/)