Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/Yangsenqiao/vida

[ICLR 2024] ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation
https://github.com/Yangsenqiao/vida

Last synced: about 1 month ago
JSON representation

[ICLR 2024] ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation

Awesome Lists containing this project

README

        

# ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation
[**ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation**](https://arxiv.org/abs/2306.04344)

Jiaming Liu*, Senqiao Yang*, Peidong Jia, Ming Lu, Yandong Guo, Wei Xue, Shanghang Zhang

## Overview

Since real-world machine systems are running in non-stationary and continually changing environments, Continual Test-Time Adaptation (CTTA) task is proposed to adapt the pre-trained model to continually changing target domains. Recently, existing methods mainly focus on model-based adaptation, which aims to leverage a self-training manner to extract the target domain knowledge. However, pseudo labels can be noisy and the updated model parameters are uncertain under dynamic data distributions, leading to error accumulation and catastrophic forgetting in the continual adaptation process. To tackle these challenges and maintain the model plasticity, we tactfully design a Visual Domain Adapter (ViDA) for CTTA, explicitly handling both domain-specific and domain-agnostic knowledge. Specifically, we first comprehensively explore the different domain representations of the adapters with trainable high and low-rank embedding space. Then we inject ViDAs into the pre-trained model, which leverages high-rank and low-rank prototypes to adapt the current domain distribution and maintain the continual domain-shared knowledge, respectively. To adapt to the various distribution shifts of each sample in target domains, we further propose a Homeostatic Knowledge Allotment (HKA) strategy, which adaptively merges knowledge from each ViDA with different rank prototypes. Extensive experiments conducted on four widely-used benchmarks demonstrate that our proposed method achieves state-of-the-art performance in both classification and segmentation CTTA tasks. In addition, our method can be regarded as a novel transfer paradigm and showcases promising results in zero-shot adaptation of foundation models to continual downstream tasks and distributions.

## Installation

Please create and activate the following conda envrionment.
```bash
# It may take several minutes for conda to solve the environment
conda update conda
conda env create -f environment.yml
conda activate vida
```

## Classification Experiments
### ImageNet-to-ImageNetC task
We release the code of the baseline method based on CNN and vit.
* **CNN** as the backbone
```bash
cd imagenet
bash ./bash/source_cnn.sh # Source model directly test on target domain
bash ./bash/tent_cnn.sh # Tent
bash ./bash/cotta_cnn.sh # CoTTA
```
* **ViT** as the backbone

Our source model is from timm, you can directly donwload it from the code.

And our source ViDA model is [here](https://drive.google.com/file/d/1-ft1sUROp6nb57ToLf4USifUGUguFlJF/view?usp=sharing)
```bash
cd imagenet
bash ./bash/source_vit.sh # Source model directly test on target domain
bash ./bash/tent_vit.sh # Tent
bash ./bash/cotta_vit.sh # CoTTA
bash ./bash/vida_vit.sh # ViDA
```

### Cifar10-to-Cifar10C task
Please load the source model from [here](https://drive.google.com/file/d/1pAoz4Wwos74DjWPQ5d-6ntyjQkmp9FPE/view?usp=sharing)

And our source ViDA model is [here](https://drive.google.com/file/d/13R0ZoHaUUXL4PYHWWIwl2NvEcU_3JJxU/view?usp=sharing)
```bash
cd cifar
bash ./bash/cifar10/source_vit.sh # Source model directly test on target domain
bash ./bash/cifar10/tent_vit.sh # Tent
bash ./bash/cifar10/cotta_vit.sh # CoTTA
bash ./bash/cifar10/vida_vit.sh # ViDA
```

### Cifar100-to-Cifar100C task
Please load the source model from [here](https://drive.google.com/file/d/1yRekkpkIdwX_LFsOh4Ba9ndaECnY-UC-/view?usp=sharing)

And our source ViDA model is [here](https://drive.google.com/file/d/14-i94U8pyzKIR-YAzCpTYIeGGc-HUvbM/view?usp=sharing)
```bash
cd cifar
bash ./bash/cifar100/source_vit.sh # Source model directly test on target domain
bash ./bash/cifar100/tent_vit.sh # Tent
bash ./bash/cifar100/cotta_vit.sh # CoTTA
bash ./bash/cifar100/vida_vit.sh # ViDA
```

For segmentation code, you can refer to [cotta](https://github.com/qinenergy/cotta) and [SVDP](https://github.com/Anonymous-012/SVDP). As for the source model, you can directly use Segformer trained on Cityscapes.
## Citation
Please cite our work if you find it useful.
```bibtex
@article{liu2023vida,
title={ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation},
author={Liu, Jiaming and Yang, Senqiao and Jia, Peidong and Lu, Ming and Guo, Yandong and Xue, Wei and Zhang, Shanghang},
journal={arXiv preprint arXiv:2306.04344},
year={2023}
}
```

## Acknowledgement
+ CoTTA code is heavily used. [official](https://github.com/qinenergy/cotta)
+ KATANA code is used for augmentation. [official](https://github.com/giladcohen/KATANA)
+ Robustbench [official](https://github.com/RobustBench/robustbench)

## Data links
+ ImageNet-C [Download](https://zenodo.org/record/2235448#.Yj2RO_co_mF)