https://github.com/jianzhnie/Open-R1

The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.
https://github.com/jianzhnie/Open-R1

chatgpt gpt llama llm lora peft ppo rlhf stanford-alpaca

Last synced: 8 months ago
JSON representation

The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.

Host: GitHub
URL: https://github.com/jianzhnie/Open-R1
Owner: jianzhnie
License: apache-2.0
Created: 2023-03-08T03:56:10.000Z (over 3 years ago)
Default Branch: main
Last Pushed: 2024-05-22T08:54:48.000Z (about 2 years ago)
Last Synced: 2024-10-29T10:00:38.257Z (over 1 year ago)
Topics: chatgpt, gpt, llama, llm, lora, peft, ppo, rlhf, stanford-alpaca
Language: Python
Homepage: https://jianzhnie.github.io/llmtech
Size: 751 KB
Stars: 175
Watchers: 12
Forks: 33
Open Issues: 2
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-human-in-the-loop - Github - jianzhnie/open-chatgpt
StarryDivineSky - jianzhnie/Open-R1 - R1 是一个开源项目，旨在复现 DeepSeek-R1 模型。它提供了 DeepSeek-R1 的开源实现，方便研究者和开发者学习和使用该模型。该项目可能包含模型的架构、训练代码、预训练权重等资源，方便用户进行二次开发和定制。通过这个项目，用户可以深入了解 DeepSeek-R1 的工作原理，并将其应用于各种自然语言处理任务。具体实现细节和使用方法请参考项目文档和代码。该项目由 jianzhnie 发起并维护。 (A01_文本生成_文本对话 / 大语言对话模型及数据)

README

          # Open-R1: An open-source implementation of DeepSeek-R1

[![Code License](https://img.shields.io/badge/Code%20License-Apache_2.0-green.svg)](https://github.com/jianzhnie/Open-R1/blob/main/LICENSE)

[![Python 3.9+](https://img.shields.io/badge/python-3.9+-blue.svg)](https://www.python.org/downloads/release/python-390/)

[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)

## Introduction

`Open-R1` is a open-source library that allows you to train a hyper-personalized DeepSeek-R1-like model using your own data and the least amount of compute possible.

## License

`Open-R1` is released under the Apache 2.0 license.

## Acknowledgements

We appreciate the work by many open-source contributors, especially:

- [Hugging Face](https://huggingface.co/)

- [Vicuna](https://github.com/lm-sys/FastChat/)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/jianzhnie/Open-R1

Awesome Lists containing this project

README