https://github.com/buaadreamer/chinese-llava-med

中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine
https://github.com/buaadreamer/chinese-llava-med

ai chinese gpt4v huggingface-datasets llama-factory llava medical minigpt4 mllm multimodal qwen1-5 transformers

Last synced: about 1 month ago
JSON representation

中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine

Host: GitHub
URL: https://github.com/buaadreamer/chinese-llava-med
Owner: BUAADreamer
License: apache-2.0
Created: 2024-05-08T02:40:56.000Z (over 1 year ago)
Default Branch: master
Last Pushed: 2024-05-22T11:25:13.000Z (over 1 year ago)
Last Synced: 2025-09-09T20:36:31.962Z (about 1 month ago)
Topics: ai, chinese, gpt4v, huggingface-datasets, llama-factory, llava, medical, minigpt4, mllm, multimodal, qwen1-5, transformers
Language: Python
Homepage:
Size: 2.26 MB
Stars: 93
Watchers: 1
Forks: 6
Open Issues: 4
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # Chinese-LLaVA-Med

## Benchmark

|                            Method                            | [llava-med-zh-eval](https://huggingface.co/datasets/BUAADreamer/llava-med-zh-eval) Qwen Score |

| :----------------------------------------------------------: | :----------------------------------------------------------: |

|                      GPT4 Ground Truth                       |                            68.26                             |

| [LLaVA-1.5-7B](https://huggingface.co/llava-hf/llava-1.5-7b-hf) |                            53.13                             |

| [Chinese-LLaVA-Med-7B](https://huggingface.co/BUAADreamer/Chinese-LLaVA-Med-7B) |                          **58.78**                           |

## Demo

分析组织切片

![分析组织切片](imgs/1.png)

分析胸片

![分析胸片](imgs/2.png)

分析CT

![分析CT](imgs/3.png)

## Training your own Medical MLLM

### Dataset

|                           Dataset                            |         Description         |

| :----------------------------------------------------------: | :-------------------------: |

| [llava-med-zh-instruct-60k](https://huggingface.co/datasets/BUAADreamer/llava-med-zh-instruct-60k) | 60k instruction tuning data |

| [llava-med-zh-eval](https://huggingface.co/datasets/BUAADreamer/llava-med-zh-eval) |     115 evaluation data     |

### Environment

```shell

# install LLaMA-Factory

git clone https://github.com/hiyouga/LLaMA-Factory.git

cd LLaMA-Factory

pip install -e .[torch,metrics]

```

### Finetuning

We recommend using `full` finetuning, but you could also use `lora` yaml.

```shell

# full finetuning

CUDA_VISIBLE_DEVICES=0,1 python -m torch.distributed.run \

    --nproc_per_node 2 \

    --nnodes 1 \

    --standalone \

    ../LLaMA-Factory/src/train.py config/llava1_5_full_sft.yaml

# export

# modify your own export_hub_model_id and hf_hub_token in the config/llava1_5_full_sft_export.yaml

CUDA_VISIBLE_DEVICES=0 llamafactory-cli export config/llava1_5_full_sft_export.yaml

```

### Evaluation

```shell

# generate output results

python3 evaluation/generate_eval_content.py --model_name_or_path models/llava1_5-7b-med

# eval by qwen-1.5-14b-chat

python3 evaluation/eval_qwen_score.py --input_path outputs/llava_med_zh_eval_llava1_5-7b-med.json

```

### Inference

```shell

# with final model

llamafactory-cli webchat config/llava1_5_full_sft_infer.yaml

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/buaadreamer/chinese-llava-med

Awesome Lists containing this project

README