https://github.com/vfdev-5/uda-pytorch

Unsupervised Data Augmentation experiments in PyTorch
https://github.com/vfdev-5/uda-pytorch

Last synced: about 1 year ago
JSON representation

Unsupervised Data Augmentation experiments in PyTorch

Host: GitHub
URL: https://github.com/vfdev-5/uda-pytorch
Owner: vfdev-5
Created: 2019-07-16T11:32:43.000Z (almost 7 years ago)
Default Branch: master
Last Pushed: 2019-07-22T09:43:42.000Z (almost 7 years ago)
Last Synced: 2025-04-30T10:11:49.352Z (about 1 year ago)
Language: Jupyter Notebook
Size: 478 KB
Stars: 59
Watchers: 6
Forks: 13
Open Issues: 1
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

          # Unsupervised Data Augmentation experiments in PyTorch

Experiments with "Unsupervised Data Augmentation" method on Cifar10 dataset.

Based on ["Unsupervised Data Augmentation"](https://arxiv.org/pdf/1904.12848.pdf)

## Unsupervised Data Augmentation in nutshell

![UDA](assets/uda.png)

## Requirements

All experiments are run using [`mlflow`](https://github.com/mlflow/mlflow), please install the latest version of this library

```

pip install --upgrade mlflow

```

## Experiments

### Start MLFlow UI server

Please create output folder (e.g. `$PWD/output`) and setup mlflow server:

```

export OUTPUT_PATH=/path/to/output

```

and 

```

mlflow server --backend-store-uri $OUTPUT_PATH/mlruns --default-artifact-root $OUTPUT_PATH/mlruns -p 5566 -h 0.0.0.0

```

MLflow dashboard is available in the browser at [0.0.0.0:5566](0.0.0.0:5566)

### CIFAR10 dataset

Create once "CIFAR10" experiment

```

export MLFLOW_TRACKING_URI=$OUTPUT_PATH/mlruns

mlflow experiments create -n CIFAR10

```

Implementation details:

- Models

  - FastResnet inspired from [cifar10-fast repository](https://github.com/davidcpage/cifar10-fast)

  - Wide-ResNet 28-2 from [Wide-ResNet repository](https://github.com/szagoruyko/wide-residual-networks/blob/master/pytorch/resnet.py)

- Consistency loss: KL

- Data augs: AutoAugment + Cutout

- Cosine LR decay

- Training Signal Annealing

- Updated UDA version: see [main_uda2.py](code/main_uda2.py)

  - training 4k batchs are also passed into unsupervised learning part

#### Fast ResNet

Start a single run

```

export MLFLOW_TRACKING_URI=$OUTPUT_PATH/mlruns

mlflow run experiments/ --experiment-name=CIFAR10 -P dataset=CIFAR10 -P network=fastresnet -P params="data_path=../input/cifar10;num_epochs=100;learning_rate=0.08;batch_size=512;TSA_proba_min=0.5;unlabelled_batch_size=1024"

```

#### Wide ResNet

Start a single run

```

export MLFLOW_TRACKING_URI=$OUTPUT_PATH/mlruns

mlflow run experiments/ --experiment-name=CIFAR10 -P dataset=CIFAR10 -P network=wideresnet -P params="data_path=../input/cifar10;num_epochs=100;learning_rate=0.1;batch_size=512;TSA_proba_min=0.1;unlabelled_batch_size=1024"

```

##### Paper's configuration

```

export MLFLOW_TRACKING_URI=$OUTPUT_PATH/mlruns

mlflow run experiments/ --experiment-name=CIFAR10 -P dataset=CIFAR10 -P network=wideresnet -P params="data_path=../input/cifar10;num_epochs=6250;learning_rate=0.03;batch_size=64;TSA_proba_min=0.1;unlabelled_batch_size=320;num_warmup_steps=20000"

```

Unfortunately, I can not reproduce paper's result with 5.3 test error.

#### Updated version of UDA

```

export MLFLOW_TRACKING_URI=$OUTPUT_PATH/mlruns

mlflow run experiments/ -e main_uda2 --experiment-name=CIFAR10 -P dataset=CIFAR10 -P network=fastresnet -P params="data_path=../input/cifar10;num_epochs=100;learning_rate=0.08;batch_size=512;unlabelled_batch_size=512"

```

#### Some results

![fastresnet_uda_vs_uda2](assets/fastresnet_uda_vs_uda2.png)

### Tensorboard 

All experiments are also logged to the Tensorboard. To visualize the experiments, please install `tensorboard` and run :

```

# tensorboard --logdir=$OUTPUT_PATH/mlruns/

tensorboard --logdir=$OUTPUT_PATH/mlruns/1

```

## Acknowledgements

In this repository we are using the code from 

- [DeepVoltaire/AutoAugment](https://github.com/DeepVoltaire/AutoAugment) 

- [cifar10-fast repository](https://github.com/davidcpage/cifar10-fast)

- [Wide-ResNet repository](https://github.com/szagoruyko/wide-residual-networks/blob/master/pytorch/resnet.py)

Thanks to the authors for sharing their code!

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/vfdev-5/uda-pytorch

Awesome Lists containing this project

README