https://github.com/wysnzzzz/DIT

Last synced: 7 months ago
JSON representation

Host: GitHub
URL: https://github.com/wysnzzzz/DIT
Owner: wysnzzzz
Created: 2024-02-01T07:48:58.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-11-15T14:27:26.000Z (7 months ago)
Last Synced: 2024-11-15T15:32:01.628Z (7 months ago)
Language: Python
Size: 12 MB
Stars: 12
Watchers: 1
Forks: 2
Open Issues: 1
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

Awesome-Segment-Anything - [code

README

        # DIT

[![Python](https://img.shields.io/badge/python-blue.svg)](https://www.python.org/)

![PyTorch](https://img.shields.io/badge/pytorch-%237732a8)



	



This is the official implementation of "Deep Instruction Tuning for Segment Anything Model", which propose two simple yet effective deep instruction tuning (DIT) methods for text-guided SAM.

## News

- **2024.07.16: Our work has been accepted as poster by ACM MM 2024.**

## Installation

```

pip install -r requirements.txt

wget https://github.com/explosion/spacy-models/releases/download/en_vectors_web_lg-2.1.0/en_vectors_web_lg-2.1.0.tar.gz -O en_vectors_web_lg-2.1.0.tar.gz

pip install en_vectors_web_lg-2.1.0.tar.gz

```

## Training and Evaluation 

1. Prepare your settings. To train a model, you should  modify ``./config/config.yaml``  to adjust the settings  you want. 

2. Train the model. run ` train.py`  under the main folder to start training:

```

python train.py --config ./config/config.yaml

```

3. Test the model.   Then, you can run ` test.py`  by

```

python test.py --eval-weights ./logs/dit/1/weights/seg_best.pth

```

4. Training log.  Logs are stored in ``./logs`` directory, which records the detailed training curve and accuracy per epoch. If you want to log the visualizations, please  set  ``LOG_IMAGE`` to ``True`` in ``config.yaml``.   

## Model Weights

Following the steps of Data preparation and Training, you can reproduce and get better results in our paper. We provide the model weights for RefCOCO, RefCOCO+, RefCOCOg and GRES. 

1. RefCOCO [Download link](https://drive.google.com/file/d/11gVgwnWI8c0m54gZFJIEcyYMzN7u9lmF/view?usp=sharing)

| val               | test A            | test B            |

| - | - | -|

| 76.2 | 77.85  | 73.53|

2. RefCOCO+ [Download link](https://drive.google.com/file/d/1T3jYiR9BLDJxvThYulySrDv0S2qonq1Z/view?usp=sharing)

| val               | test A            | test B            |

| - | - | -|

| 65.94 | 69.78  | 58.89|

3. RefCOCOg [Download link](https://drive.google.com/file/d/1HObuOQLv97NB3eD2X4ss2XsBlAifTc_L/view?usp=sharing)

| val               | test             | 

| - | - | 

| 67.4 | 68.07  | 

4. GRES [Download link](https://drive.google.com/file/d/1v9dcxKwOQM8i2NKXi4YfITJ9ZO00XVur/view?usp=sharing)

| val               | test A            | test B            |

| - | - | -|

| 63.76 | 67.19  | 61.85|

## Citation

```

@inproceedings{

	huang2024deep,

	title={Deep Instruction Tuning for Segment Anything Model},

	author={Xiaorui Huang and Gen Luo and Chaoyang Zhu and Bo Tong and Yiyi Zhou and Xiaoshuai Sun and Rongrong Ji},

	booktitle={ACM Multimedia 2024},

	year={2024}

}

```

## Acknowledgement

 Thanks a lot for the nicely organized code from the following repos

- [Segment Anything](https:////github.com/facebookresearch/segment-anything/)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/wysnzzzz/DIT

Awesome Lists containing this project

README