https://github.com/vita-group/sandwich-batch-normalization

[WACV 2022] "Sandwich Batch Normalization: A Drop-In Replacement for Feature Distribution Heterogeneity" by Xinyu Gong, Wuyang Chen, Tianlong Chen and Zhangyang Wang
https://github.com/vita-group/sandwich-batch-normalization

adversarial-training batch-normalization gan nas neural-architecture-search normalization pytorch sabn sandwich-batch-normalization style-transfer

Last synced: 2 months ago
JSON representation

[WACV 2022] "Sandwich Batch Normalization: A Drop-In Replacement for Feature Distribution Heterogeneity" by Xinyu Gong, Wuyang Chen, Tianlong Chen and Zhangyang Wang

Host: GitHub
URL: https://github.com/vita-group/sandwich-batch-normalization
Owner: VITA-Group
License: mit
Created: 2021-02-16T02:39:54.000Z (over 4 years ago)
Default Branch: main
Last Pushed: 2021-12-29T09:45:55.000Z (almost 4 years ago)
Last Synced: 2025-04-19T18:52:42.417Z (6 months ago)
Topics: adversarial-training, batch-normalization, gan, nas, neural-architecture-search, normalization, pytorch, sabn, sandwich-batch-normalization, style-transfer
Language: Python
Homepage: https://github.com/VITA-Group/Sandwich-Batch-Normalization
Size: 21.9 MB
Stars: 50
Watchers: 9
Forks: 5
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # Sandwich Batch Normalization: A Drop-In Replacement for Feature Distribution Heterogeneity

[![MIT licensed](https://img.shields.io/badge/license-MIT-brightgreen.svg)](LICENSE.md)

Code for [Sandwich Batch Normalization: A Drop-In Replacement for Feature Distribution Heterogeneity](https://arxiv.org/abs/2102.11382).

## Introduction

We present Sandwich Batch Normalization (SaBN), an extremely easy improvement of Batch Normalization (BN) with only a few lines of code changes.

![method](imgs/architect.png)

We demonstrate the prevailing effectiveness of SaBN as a drop-in replacement in four tasks:

1. **conditional image generation**,

2. **neural architecture search**,

3. **adversarial training**,

4. **arbitrary neural style transfer**.

## Usage

Check each of them for more information:

1. [GAN](https://github.com/VITA-Group/Sandwich-Batch-Normalization/blob/main/GAN)

2. [NAS](https://github.com/VITA-Group/Sandwich-Batch-Normalization/blob/main/NAS)

3. [Adv](https://github.com/VITA-Group/Sandwich-Batch-Normalization/blob/main/Adv)

4. [NST](https://github.com/VITA-Group/Sandwich-Batch-Normalization/blob/main/NST)

## Main Results

### 1. Conditional Image Generation

Using SaBN in conditional generation task enables an immediate performance boost. Evaluation results on CIFAR-10 are shown below:

|       Model      | Inception Score ↑ |     FID ↓     |

|------------------|-----------------|--------------|

| AutoGAN          |       8.43      |        10.51 |

| BigGAN           |       8.91      |         8.57 |

| SNGAN            |       8.76      |        10.18 |

| **AutoGAN-SaBN** (ours) |   8.72 (+0.29)  |  9.11 (−1.40) |

| **BigGAN-SaBN** (ours) |   9.01 (+0.10)   | 8.03 (−0.54) |

| **SNGAN-SaBN** (ours) |   8.89 (+0.13)  |  8.97 (−1.21) |

Visual results on ImageNet (128*128 resolution):

SNGAN          |  SNGAN-SaBN (ours)

:-------------------------:|:-------------------------:

![CIFAR100](imgs/sngan_imagenet.png)  |  ![ImageNet](imgs/sngan_sabn_imagenet.png)

### 2. Neural Architecture Search

We adopted DARTS as the baseline search algorithm. Results on NAS-Bench-201 are presented below:

| Method            | CIFAR-100 (top1) |  ImageNet (top1)  |

|-------------------|:----------------:|:----------------:|

| DARTS             |   44.05 ± 7.47   |   36.47 ± 7.06   |

| DARTS-SaBN (ours) | **71.56 ± 1.39** | **45.85 ± 0.72** |

CIFAR-100            |  ImageNet16-120

:-------------------------:|:-------------------------:

![CIFAR100](imgs/DARTS_e35_cifar100.png)  |  ![ImageNet](imgs/DARTS_e35_imagenet100.png)

### 3. Adversarial Training

Evaluation results:

| Evaluation |   BN  | AuxBN (clean branch) | SaAuxBN (clean branch) (ours) |

|:----------:|:-----:|:--------------------:|:----------------------:|

| Clean (SA) | 84.84 |         94.47        |          **94.62**         |

|  Evaluation |   BN  | AuxBN (adv branch) | SaAuxBN (adv branch) (ours) |

|:-----------:|:-----:|:------------------:|:--------------------:|

|  Clean (SA) | **84.84** |        83.42       |         84.08        |

| PGD-10 (RA) | 41.57 |        43.05       |         **44.93**        |

| PGD-20 (RA) | 40.02 |        41.60       |         **43.14**        |

### 4. Arbitrary Neural Style Transfer

The model equipped with the proposed SaAdaIN achieves lower style & content loss on both training and testing set.

**Training loss**:

Training style loss            |  Training content loss

:-------------------------:|:-------------------------:

![st](imgs/st_losses.png)  |  ![ct](imgs/ct_losses.png)

**Validation loss**:

Validation style loss            | Validation content loss

:-------------------------:|:-------------------------:

![val_st](imgs/val_st_losses.png)  |  ![val_ct](imgs/val_ct_losses.png)

## Citation

If you find this work is useful to your research, please cite our paper:

```bibtex

@InProceedings{Gong_2022_WACV,

  title={Sandwich Batch Normalization: A Drop-In Replacement for Feature Distribution Heterogeneity},

  author={Gong, Xinyu and Chen, Wuyang and Chen, Tianlong and Wang, Zhangyang},

  journal={Winter Conference on Applications of Computer Vision (WACV)},

  year={2022}

}

```

## Acknowledgement

1. NAS codebase from [NAS-Bench-201](https://github.com/D-X-Y/AutoDL-Projects/blob/main/docs/NAS-Bench-201.md).

2. NST codebase from [AdaIN](https://github.com/naoto0804/pytorch-AdaIN).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/vita-group/sandwich-batch-normalization

Awesome Lists containing this project

README