Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/xionghonglin/DoctorGLM

基于ChatGLM-6B的中文问诊模型
https://github.com/xionghonglin/DoctorGLM

chatglm-6b

Last synced: 3 months ago
JSON representation

基于ChatGLM-6B的中文问诊模型

Host: GitHub
URL: https://github.com/xionghonglin/DoctorGLM
Owner: xionghonglin
Created: 2023-03-29T10:48:16.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2023-10-19T06:49:02.000Z (9 months ago)
Last Synced: 2024-01-22T14:29:31.907Z (5 months ago)
Topics: chatglm-6b
Language: Python
Homepage:
Size: 43.7 MB
Stars: 675
Watchers: 15
Forks: 68
Open Issues: 18
Metadata Files:
- Readme: README.md

Lists

awesome-LLMs-In-China - DoctorGLM
Awesome-Medical-Healthcare-Dataset-For-LLM - DoctorGLM - 6B的中文问诊模型，通过中文医疗对话数据集进行微调，实现了包括lora、p-tuningv2等微调及部署 | (Models / 英文)
Awesome-Domain-LLM - DoctorGLM
StarryDivineSky - xionghonglin/DoctorGLM - 6B的中文问诊模型 (文本生成、文本对话 / 类ChatGPT大语言对话模型及数据)

README

        


  

  


  

  


  [Project Page]

  [Arxiv]

  [最新博客]

  [在线体验]



# DoctorGLM

基于 ChatGLM-6B的中文问诊模型

## 最近更新

- (2023.4.18): P-Tuning & 多轮对话 & 模型可靠性提升

## 训练数据

| Dataset    | Department                | Language | Q&A  | Chat | Number | Syn. | Size  | Weight |

|------------|--------------------------|----------|------|------|--------|------|-------|-------|

| CMD.       | Surgical                 | CN       | ✔    | ×    | 116K   | ×    | 52MB  |       |

|            | Obstetrics and Gynecology| CN       | ✔    | ×    | 229K   | ×    | 78MB  |       |

|            | Pediatrics               | CN       | ✔    | ×    | 117K   | ×    | 47MB  |       |

|            | Internal Medicine        | CN       | ✔    | ×    | 307K   | ×    | 102MB |       |

|            | Andriatria               | CN       | ✔    | ×    | 113K   | ×    | 44MB  |       |

|            | Merged                   | CN       | ✔    | ×    | 1.9M   | ×    |       |Doctor_GLM/ckpt|

| MedDialog  | Multiple                 | CN&EN    | ✔    | ✔    | 3.4M   | ×    | 1.5GB |[ptuning_weight](https://pan.baidu.com/s/1Yf56egVGwI0XN2iOLcEGSQ?pwd=r4p0)   |

| ChatDoctor | Multiple                 | EN       | ✔    | ×    | 5.4K   | ✔    | 2.9MB |Coming soon    |

| HearlthcareMagic| Multiple            | EN       | ✔    | ×    | 200K   | ×    | 216MB |Coming soon    |

https://github.com/Toyhom/Chinese-medical-dialogue-data

## 使用

### lora

- 显存 >= 13G （未量化版本）

- pip install deep_training cpm_kernels icetk transformers>=4.26.1 

- torch >= 1.12.0 (icetk依赖cpu版torch, 建议先安装icetk后安装gpu版torch)

- lora的finetune代码来自 https://github.com/ssbuild/chatglm_finetuning

对于fp16模型，直接使用Doctor_GLM/chat_lora.ipynb，由于官方更新了chatglm的权重，我们将老版权重放在了

[old_pretrain_model](https://pan.baidu.com/s/1vuoBbOQVPJPAcurEfVRn7A?pwd=ahwc)

可以下载后解压到old_pretrain_model目录

量化的模型我们打了个包，使用方便，但是效果目前来看很成问题：INT4需要大约6G显存，INT8需要大约8G显存，在Doctor_GLM/chat_lora_quant.ipynb下使用

``` python

from load_quantization import load_int

tokenizer, model = load_int('DoctorGLM-6B-INT8-6merge-int8.pt',8)

response, history = model.chat(tokenizer,

                               "我爷爷高血压可以喝咖啡吗",

                               history=[],

                               max_length=2048)

print(response)

```

模型下载链接：

[INT4](https://pan.baidu.com/s/1nHQ1EQ2OBuWCyBZKBnBHYw?pwd=x6l4) [INT8](https://pan.baidu.com/s/1v2hWl1dPnh8xoJzxtpbugw?pwd=y4hu)

量化方法均为分层的线性量化。

目前量化模型的性能**仍有较大问题**，后期我们会对量化方法和模型进行更新

### p-tuningv2

官方提供了p-tuningv2的实现，新版本权重可以在hugging face上下载，也可以从我们的链接下载 [pretrain_model](https://pan.baidu.com/s/1WaG-NQeXVR7BNZs_zlUFmQ?pwd=h88g)   

p-tuningv2的权重在

[ptuning_weight](https://pan.baidu.com/s/1Yf56egVGwI0XN2iOLcEGSQ?pwd=r4p0) ， 下载后解压到ckpt/ptuningv2目录下, 然后使用Doctor_GLM/chat_ptuning_v2.ipynb，根据需要调整quantization_bit为4或8

## 模型在线部署

为了方便部署并随时调整模型生成回答时的参数，我们提供了基于 `Gradio` 库的部署代码，路径为 `Doctor_GLM/gradio.ipynb`。运行之后，访问本机的7860或者代码声明的其他端口即可以运行Demo，模型在生成回答时的参数可以由用户自由调控。若想让部署的模型可以被局域网之外的其他用户访问，需要将sharing设置为 `True`（默认为`False`）。部署之后运行效果如下所示：



  

  




## 最近更新

-  (2023.4.3) 初版的权重，来自LoRA SFT 1 epcoh

-  (2023.4.13) LoRA-INT4/8量化权重，以及我们实验发现LoRA一直会丢失对话能力，放弃该方式，转向P-Tuning

-  (2023.4.18) P-Tuning 多轮对话数据集训练的新权重和arxiv

## 即将到来的更新 

- [ ]  (2023.4.21) 对话中加入参考文献，模型上传到huggingface

第一次运行会下载chatGLM-6B权重, 如果已有chatGLM-6B权重可以将data_utils.py里的路径修改为自己的权重目录

## 结果示例



  

  




我们随机跑了100个结果，在 ./results目录下，两份json文件分别为由ChatGLM, DoctorGLM得到的结果，目前存在大量复读机。

## 引用

```

@article{xiong2023doctorglm,

  title={Doctorglm: Fine-tuning your chinese doctor is not a herculean task},

  author={Xiong, Honglin and Wang, Sheng and Zhu, Yitao and Zhao, Zihao and Liu, Yuxiao and Wang, Qian and Shen, Dinggang},

  journal={arXiv preprint arXiv:2304.01097},

  year={2023}

}

```