https://github.com/liangzw599/Co-developed-by-LiteMedSAM

Last synced: 7 months ago
JSON representation

Host: GitHub
URL: https://github.com/liangzw599/Co-developed-by-LiteMedSAM
Owner: liangzw599
License: apache-2.0
Created: 2024-03-20T01:23:07.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-08-05T01:46:21.000Z (11 months ago)
Last Synced: 2024-08-05T03:16:12.144Z (11 months ago)
Language: Python
Size: 127 KB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

Awesome-Segment-Anything - [code

README

# LiteMedSAM

A lightweight version of MedSAM for fast training and inference. The model was trained with the following two states:

- Stage 1. Distill a lightweight image encoder `TinyViT` from the MedSAM image encoder `ViT` by imposing the image embedding outputs to be the same
- State 2. Replace the MedSAM image encoder `ViT` with `TinyViT` and fine-tune the whole pipeline

# Obtained training test results

- [The best model](https://pan.baidu.com/s/11Cs1hOmGBaPWtf3BBvFo8w?pwd=1111) can be downloaded. Or you can download it from the release section.

# Sanity test

- Run the following command for a sanity test.

```bash
python CVPR24_LiteMedSAM_infer_v2.py -i test_demo/imgs/ -o test_demo/segs
```
We have improved the original code to save the process of model transformation and shorten the test time.

# Installation

1. Create a virtual environment `conda create -n medsam python=3.10 -y` and activate it `conda activate medsam`
2. Install [Pytorch 2.0](https://pytorch.org/get-started/locally/)
3. `git clone -b LiteMedSAM https://github.com/bowang-lab/MedSAM/`
4. Enter the MedSAM folder `cd MedSAM` and run `pip install -e .`

# Model Training

## Data preprocessing
1. Download the Lite-MedSAM [checkpoint](https://drive.google.com/file/d/18Zed-TUTsmr2zc5CHUWd5Tu13nb6vq6z/view?usp=sharing) and put it under the current directory.
2. The data set is divided into the training set and the test set according to the ratio of 4:1.
3. The ability to convert training data from 'npz' to 'npy' format was added to the training file: `train_one_gpu.py`

## Loss function
1. `ShapeDistLoss`is newly introduced.
2. `AutomaticWeightedLoss` is added to adjust the weight of each loss function by means of adaptation.

The definitions of `ShapeDistLoss` and `AutomaticWeightedLoss` can be viewed at `loss_op.py`

## Single GPU

We trained Lite-MedSAM on a single GPU, run:
```bash
python train_one_gpu.py \
-data_root data/MedSAM_train \
-pretrained_checkpoint lite_medsam.pth \
-work_dir work_dir \
-num_workers 4 \
-batch_size 4 \
-num_epochs 10
```

To resume interrupted training from a checkpoint, run:
```bash
python train_one_gpu.py \
-data_root data/MedSAM_train \
-resume work_dir/medsam_lite_latest.pth \
-work_dir work_dir \
-num_workers 4 \
-batch_size 4 \
-num_epochs 10
```

For additional command line arguments, see `python train_one_gpu.py -h`.

# Acknowledgements
We thank the authors of [MobileSAM](https://github.com/ChaoningZhang/MobileSAM) and [TinyViT](https://github.com/microsoft/Cream/tree/main/TinyViT) for making their source code publicly available.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/liangzw599/Co-developed-by-LiteMedSAM

Awesome Lists containing this project

README