https://github.com/seanzhang-zhichen/llama3-chinese

Llama3-Chinese是以Meta-Llama-3-8B为底座，使用 DORA + LORA+ 的训练方法，在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。
https://github.com/seanzhang-zhichen/llama3-chinese

Last synced: about 1 month ago
JSON representation

Llama3-Chinese是以Meta-Llama-3-8B为底座，使用 DORA + LORA+ 的训练方法，在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。

Host: GitHub
URL: https://github.com/seanzhang-zhichen/llama3-chinese
Owner: seanzhang-zhichen
License: apache-2.0
Created: 2024-04-19T03:47:03.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2024-04-23T10:03:06.000Z (about 1 year ago)
Last Synced: 2024-10-29T00:32:14.466Z (6 months ago)
Language: Python
Homepage:
Size: 515 KB
Stars: 288
Watchers: 6
Forks: 20
Open Issues: 3
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

StarryDivineSky - seanzhang-zhichen/llama3-chinese - Chinese是以Meta-Llama-3-8B为底座，使用 DORA + LORA+ 的训练方法，在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。 (A01_文本生成_文本对话 / 大语言对话模型及数据)

README

        


    中文&nbsp ｜ &nbspEnglish



















  

    

 Llama3-Chinese 




      

        

      

      

        

      

      


      

        

      

      

        

      





## Introduce

**Llama3-Chinese** is a large model trained on 500k high-quality Chinese multi-turn SFT data, 100k English multi-turn SFT data, and 2k single-turn self-cognition data, using the training methods of [DORA](https://arxiv.org/pdf/2402.09353.pdf) and [LORA+](https://arxiv.org/pdf/2402.12354.pdf) based on **Meta-Llama-3-8B** as the base.

**Github:** [https://github.com/seanzhang-zhichen/llama3-chinese](https://github.com/seanzhang-zhichen/llama3-chinese)

![DEMO](./images/web_demo.png)

## Download Model

| Model             | Download  |

|:-------------------:|:-----------:|

| Meta-Llama-3-8B        |[ 🤗 HuggingFace](https://huggingface.co/meta-llama/Meta-Llama-3-8B) [  🤖 ModelScope](https://modelscope.cn/models/LLM-Research/Meta-Llama-3-8B)|

| Llama3-Chinese-Lora           |[ 🤗 HuggingFace](https://huggingface.co/zhichen/Llama3-Chinese-Lora) [  🤖 ModelScope](https://modelscope.cn/models/seanzhang/Llama3-Chinese-Lora)|

| Llama3-Chinese (merged model)           |[ 🤗 HuggingFace](https://huggingface.co/zhichen/Llama3-Chinese) [  🤖 ModelScope](https://modelscope.cn/models/seanzhang/Llama3-Chinese)|

## Merge LORA Model (Skippable)

1、Download [Meta-Llama-3-8B](https://modelscope.cn/models/LLM-Research/Meta-Llama-3-8B)

```bash

git clone https://www.modelscope.cn/LLM-Research/Meta-Llama-3-8B.git

```

2、Download [Llama3-Chinese-Lora](https://www.modelscope.cn/models/seanzhang/Llama3-Chinese-Lora)

**From ModelScope**

```bash

git lfs install

git clone https://www.modelscope.cn/seanzhang/Llama3-Chinese-Lora.git

```

**From HuggingFace**

```bash

git lfs install

git clone https://huggingface.co/zhichen/Llama3-Chinese-Lora

```

3、Merge Model

```bash

python merge_lora.py \

    --base_model path/to/Meta-Llama-3-8B \

    --lora_model path/to/lora/Llama3-Chinese-Lora  \

    --output_dir ./Llama3-Chinese

```

## Download Llama3-Chinese (Merged Model)

**From ModelScope**

```bash

git lfs install

git clone https://www.modelscope.cn/seanzhang/Llama3-Chinese.git

```

**From HuggingFace**

```bash

git lfs install

git clone https://huggingface.co/zhichen/Llama3-Chinese

```

## Inference

```python

from transformers import AutoTokenizer, AutoModelForCausalLM

model_id = "zhichen/Llama3-Chinese"

tokenizer = AutoTokenizer.from_pretrained(model_id)

model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype="auto", device_map="auto")

messages = [

    {"role": "system", "content": "You are a helpful assistant."},

    {"role": "user", "content": "你好"},

]

input_ids = tokenizer.apply_chat_template(

    messages, add_generation_prompt=True, return_tensors="pt"

).to(model.device)

outputs = model.generate(

    input_ids,

    max_new_tokens=2048,

    do_sample=True,

    temperature=0.7,

    top_p=0.95,

)

response = outputs[0][input_ids.shape[-1]:]

print(tokenizer.decode(response, skip_special_tokens=True))

```

## CLI DEMO

```bash

python cli_demo.py --model_path zhichen/Llama3-Chinese

```

## WEB DEMO

```bash

python web_demo.py --model_path zhichen/Llama3-Chinese

```

## VLLM WEB DEMO

1、Use [vllm](https://github.com/vllm-project/vllm) deploy model

```bash

python -m vllm.entrypoints.openai.api_server --served-model-name Llama3-Chinese --model ./Llama3-Chinese(Replace it with your own merged model path)

```

2、This command is executed on the CLI

```bash

python vllm_web_demo.py --model Llama3-Chinese

```

## Train Dataset

[deepctrl-sft-data](https://modelscope.cn/datasets/deepctrl/deepctrl-sft-data)

## LICENSE

This project can only be used for research purposes, and the project developer shall not bear any harm or loss caused by the use of this project (including but not limited to data, models, codes, etc.). For details, please refer to [DISCLAIMER](https://github.com/seanzhang-zhichen/Llama3-Chinese/blob/main/DISCLAIMER)。

The License agreement of the Llama3-Chinese project code is the [Apache License 2.0](./LICENSE). The code is free for commercial use, and the model weights and data can only be used for research purposes. Please attach a link to Llama3-Chinese and the licensing agreement in the product description.

## Citation

If you used Llama3-Chinese in your research, cite it in the following format:

```latex

@misc{Llama3-Chinese,

  title={Llama3-Chinese},

  author={Zhichen Zhang, Xin LU, Long Chen},

  year={2024},

  howpublished={\url{https://github.com/seanzhang-zhichen/llama3-chinese}},

}

```

## Acknowledgement

[meta-llama/llama3](https://github.com/meta-llama/llama3)




[hiyouga/LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory)

## Star History

[![Star History Chart](https://api.star-history.com/svg?repos=seanzhang-zhichen/Llama3-Chinese&type=Date)](https://star-history.com/#seanzhang-zhichen/Llama3-Chinese&Date)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/seanzhang-zhichen/llama3-chinese

Awesome Lists containing this project

README

Llama3-Chinese