https://github.com/WangRongsheng/MedQA-ChatGLM

🛰️ 基于真实医疗对话数据在ChatGLM上进行LoRA、P-Tuning V2、Freeze、RLHF等微调，我们的眼光不止于医疗问答
https://github.com/WangRongsheng/MedQA-ChatGLM

chatglm-6b chatgpt dataset fine-tuning freeze huggingface large-language-models llms lora medical rlhf transformer

Last synced: 9 months ago
JSON representation

🛰️ 基于真实医疗对话数据在ChatGLM上进行LoRA、P-Tuning V2、Freeze、RLHF等微调，我们的眼光不止于医疗问答

Host: GitHub
URL: https://github.com/WangRongsheng/MedQA-ChatGLM
Owner: WangRongsheng
Archived: true
Created: 2023-05-04T09:17:47.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2023-09-02T03:17:15.000Z (over 2 years ago)
Last Synced: 2024-05-22T08:04:33.753Z (over 1 year ago)
Topics: chatglm-6b, chatgpt, dataset, fine-tuning, freeze, huggingface, large-language-models, llms, lora, medical, rlhf, transformer
Language: Python
Homepage: https://www.wangrs.co/MedQA-ChatGLM/
Size: 20.7 MB
Stars: 283
Watchers: 5
Forks: 42
Open Issues: 3
Metadata Files:
- Readme: README.md
- Citation: CITATION.cff

Awesome Lists containing this project

StarryDivineSky - WangRongsheng/MedQA-ChatGLM - Tuning V2、Freeze、RLHF等微调 (A01_文本生成_文本对话 / 大语言对话模型及数据)
awesome-llm-and-aigc - MedQA-ChatGLM - ChatGLM?style=social"/> : 🛰️ 基于真实医疗对话数据在ChatGLM上进行LoRA、P-Tuning V2、Freeze、RLHF等微调，我们的眼光不止于医疗问答。[www.wangrs.co/MedQA-ChatGLM/](https://www.wangrs.co/MedQA-ChatGLM/). "MedQA-ChatGLM: A Medical QA Model Fine-tuned on ChatGLM Using Multiple fine-tuning Method and Real Medical QA Data". (Applications / 提示语（魔法）)

README

          > **Note**

> 

> **欢迎关注我们最新的工作：CareLlama (关怀羊驼)，它是一个医疗大语言模型，同时它集合了数十个公开可用的医疗微调数据集和开放可用的医疗大语言模型以促进医疗LLM快速发展：https://github.com/WangRongsheng/CareLlama**

# MedQA-ChatGLM ¹

![](./images/model.png)

¹ 使用的数据为[cMedQA2](https://github.com/zhangsheng93/cMedQA2)

# 资源

|项目|数据集|底座模型|

|:-|:-|:-|

|[ChatMed](https://github.com/michael-wzhu/ChatMed)|[Consult](https://huggingface.co/michaelwzhu/ChatMed-Consult) 包含50w+在线问诊+ChatGPT回复，TCM中医药诊疗数据集未公开|LLaMA-7B|

|[ChatDoctor](https://github.com/Kent0n-Li/ChatDoctor)|[HealthCareMagic-100k](https://drive.google.com/file/d/1lyfqIwlLSClhgrCutWuEe_IACNq6XNUt/view?usp=sharing) 包含100k+真实患者与医生对话数据集，[icliniq-10k](https://drive.google.com/file/d/1ZKbqgYqWc7DJHs3N9TQYQVPdDQmZaClA/view?usp=sharing) 包含10k+患者与医生对话数据集，[GenMedGPT-5k](https://drive.google.com/file/d/1nDTKZ3wZbZWTkFMBkxlamrzbNz0frugg/view?usp=sharing) 包含5k+由GPT生成的医患对话数据集|LLaMA-7B|

|[Med-ChatGLM](https://github.com/SCIR-HI/Med-ChatGLM)|[Huatuo-data](https://huggingface.co/datasets/wangrongsheng/Huatuo-data) 、[Huatuo-liver-cancer](https://huggingface.co/datasets/wangrongsheng/Huatuo-liver-cancer)|ChatGLM-6B|

|[Huatuo-Llama-Med-Chinese](https://github.com/SCIR-HI/Huatuo-Llama-Med-Chinese)|[Huatuo-data](https://huggingface.co/datasets/wangrongsheng/Huatuo-data) 、[Huatuo-liver-cancer](https://huggingface.co/datasets/wangrongsheng/Huatuo-liver-cancer)|LLaMA-7B|

|[DoctorGLM](https://github.com/xionghonglin/DoctorGLM)|[CMD.](https://huggingface.co/datasets/wangrongsheng/CMD-merged) 、[MedDialog](https://huggingface.co/datasets/wangrongsheng/MedDialog-1.1M) 、ChatDoctor项目数据集|ChatGLM-6B|

|[MedicalGPT-zh](https://github.com/MediaBrain-SJTU/MedicalGPT-zh)|数据未开源|ChatGLM-6B|

|[Dr.LLaMA](https://github.com/zguo0525/Dr.LLaMA)||LLaMA|

|[Medical_NLP](https://github.com/FreedomIntelligence/Medical_NLP) ²|-|-|

|[CMCQA](https://github.com/WENGSYX/CMCQA) ³|-|-|

|[QiZhenGPT](https://github.com/CMKRG/QiZhenGPT)|-|-|

|[LLM-Pretrain-FineTune](https://github.com/NLPxiaoxu/LLM-Pretrain-FineTune)|-|-|

|[PMC-LLaMA](https://github.com/chaoyi-wu/PMC-LLaMA)|-|LLaMA-7B|

|[BianQue](https://github.com/scutcyr/BianQue)|-|-|

|[medAlpaca](https://github.com/kbressem/medAlpaca)|-|LLaMA-7B|

|[MedicalGPT](https://github.com/shibing624/MedicalGPT)|-|-|

|[LLM-Pretrain-FineTune](https://github.com/X-jun-0130/LLM-Pretrain-FineTune)|-|-|

|[ShenNong-TCM-LLM](https://github.com/michael-wzhu/ShenNong-TCM-LLM)|-|-|

|[Sunsimiao](https://github.com/thomas-yanxin/Sunsimiao)|-|-|

|[CMLM-ZhongJing](https://github.com/pariskang/CMLM-ZhongJing)|-|-|

|[ZhongJing](https://github.com/SupritYoung/Zhongjing)|-|-|

|[Ming](https://github.com/MediaBrain-SJTU/MING)|-|-|

|[DISC-MedLLM](https://github.com/FudanDISC/DISC-MedLLM)|-|-|

- ² 为相关医学的大模型资源，请务必格外关注[FreedomIntelligence](https://github.com/FreedomIntelligence)

- ³ 来自中国医学对话问答网站春雨，在男科、耳科、妇产科等45个科室医学对话材料

- https://medical.chat-data.com/

- https://huggingface.co/datasets/shibing624/medical

# 使用

## 1. 安装环境

```python

pip install -r requirements.txt

```

## 2. 微调

### 2.1 LoRA

```python

CUDA_VISIBLE_DEVICES=0 python MedQA-ChatGLM/finetune.py \

                              --do_train \

                              --dataset merged-cMedQA \

                              --finetuning_type lora \

                              --output_dir ./med-lora \

                              --per_device_train_batch_size 32 \

                              --gradient_accumulation_steps 256 \

                              --lr_scheduler_type cosine \

                              --logging_steps 500 \

                              --save_steps 1000 \

                              --learning_rate 5e-5 \

                              --num_train_epochs 10.0 \

                              --fp16

```

### 2.2 Freeze微调

```python

CUDA_VISIBLE_DEVICES=0 python MedQA-ChatGLM/finetune.py \

                              --do_train \

                              --dataset merged-cMedQA \

                              --finetuning_type freeze \

                              --output_dir ./med-freeze \

                              --per_device_train_batch_size 32 \

                              --gradient_accumulation_steps 256 \

                              --lr_scheduler_type cosine \

                              --logging_steps 500 \

                              --save_steps 1000 \

                              --learning_rate 5e-5 \

                              --num_train_epochs 10.0 \

                              --fp16

```

### 2.3 P-Turning V2

```python

CUDA_VISIBLE_DEVICES=1 python MedQA-ChatGLM/finetune.py \

                              --do_train --dataset merged-cMedQA \

                              --finetuning_type p_tuning \

                              --output_dir ./med-p_tuning \

                              --per_device_train_batch_size 32 \

                              --gradient_accumulation_steps 256 \

                              --lr_scheduler_type cosine \

                              --logging_steps 500 \

                              --save_steps 1000 \

                              --learning_rate 5e-5 \

                              --num_train_epochs 10.0 \

                              --fp16

```

更多参数信息，可以查看[docs/参数详解.md](https://github.com/WangRongsheng/MedQA-ChatGLM/blob/main/docs/%E5%8F%82%E6%95%B0%E8%AF%A6%E8%A7%A3.md) .

多GPU分布式训练：

```python

# 配置分布式参数

accelerate config

# 分布式训练

accelerate launch src/finetune.py \

                  --do_train \

                  --dataset Huatuo,CMD,MedDialog,guanaco,cognition \

                  --finetuning_type lora \

                  --output_dir med-lora \

                  --per_device_train_batch_size 16 \

                  --gradient_accumulation_steps 4 \

                  --lr_scheduler_type cosine \

                  --logging_steps 10 \

                  --save_steps 1000 \

                  --learning_rate 5e-5 \

                  --num_train_epochs 3.0 \

                  --fp16 \

                  --ddp_find_unused_parameters False \ # 分布式训练时，LoRA微调需要添加防止报错

                  --plot_loss

```

## 3. 推理

### 3.1 可视化

```python

CUDA_VISIBLE_DEVICES=0 python MedQA-ChatGLM/web_demo.py \

                              --checkpoint_dir med-lora/

                                              (med-freez/)

                                              (med-p_tuning/)

```

### 3.2 命令行

```python

CUDA_VISIBLE_DEVICES=0 python MedQA-ChatGLM/infer.py \

                              --checkpoint_dir med-lora/

                                              (med-freez/)

                                              (med-p_tuning/)

```

## 4. 合并（可选）

合并模型：

```python

CUDA_VISIBLE_DEVICES=0 python MedQA-ChatGLM/export_weights.py \

                              --finetuning_weights_path ./med-lora \

                              --save_weights_path ./save_lora

```

加载合并模型：

```python

CUDA_VISIBLE_DEVICES=0 python MedQA-ChatGLM/load_export_weights.py \

                              --save_weights_path ./save_lora

```

# 结果

|微调方式|模型权重|训练时长|训练轮次|

|:-|:-|:-|:-|

|LoRA|[MedQA-ChatGLM-LoRA](https://huggingface.co/wangrongsheng/MedQA-ChatGLM-LoRA)|28h|10|

|P-Tuning V2|[MedQA-ChatGLM-PTuningV2](https://huggingface.co/wangrongsheng/MedQA-ChatGLM-PTuningV2)|27h|10|

|Freeze|[MedQA-ChatGLM-Freeze](https://huggingface.co/wangrongsheng/MedQA-ChatGLM-Freeze)|28h|10|

  训练设置

  
* 实验是在Linux系统，A100 (1X, 80GB)上进行的


# 免责声明

本项目相关资源仅供学术研究之用，严禁用于商业用途。使用涉及第三方代码的部分时，请严格遵循相应的开源协议。模型生成的内容受模型计算、随机性和量化精度损失等因素影响，本项目无法对其准确性作出保证。本项目数据集绝大部分由模型生成，即使符合某些医学事实，也不能被用作实际医学诊断的依据。对于模型输出的任何内容，本项目不承担任何法律责任，亦不对因使用相关资源和输出结果而可能产生的任何损失承担责任。

# 参考

1. https://github.com/zhangsheng93/cMedQA2

2. https://github.com/zhangsheng93/cMedQA

3. https://github.com/hiyouga/ChatGLM-Efficient-Tuning

4. https://github.com/jackaduma/ChatGLM-LoRA-RLHF-PyTorch

5. https://github.com/THUDM/ChatGLM-6B

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/WangRongsheng/MedQA-ChatGLM

Awesome Lists containing this project

README