https://github.com/sshh12/cida

An unofficial refactor of the code used by the ICML 2020 paper "Continuously Indexed Domain Adaptation"
https://github.com/sshh12/cida

cida domain-adaptation extrapolation mnist pytorch

Last synced: about 1 month ago
JSON representation

An unofficial refactor of the code used by the ICML 2020 paper "Continuously Indexed Domain Adaptation"

Host: GitHub
URL: https://github.com/sshh12/cida
Owner: sshh12
License: mit
Created: 2021-04-20T23:48:10.000Z (about 5 years ago)
Default Branch: main
Last Pushed: 2021-06-09T02:25:15.000Z (about 5 years ago)
Last Synced: 2025-06-21T00:08:10.520Z (about 1 year ago)
Topics: cida, domain-adaptation, extrapolation, mnist, pytorch
Language: Python
Homepage:
Size: 53.7 KB
Stars: 1
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # Continuously Indexed Domain Adaptation (CIDA)

An unofficial refactor of the code used by the ICML 2020 paper [Continuously Indexed Domain Adaptation](http://wanghao.in/paper/ICML20_CIDA.pdf). [[Official Code]](https://github.com/hehaodele/CIDA).

## Install

```

$ pip install git+https://github.com/sshh12/cida.git --upgrade

```

## Usage

> The code should work for any dataset that serves data in the correct format (see [RotatedMNIST](https://github.com/sshh12/cida/blob/main/cida/datasets/rotated_mnist.py)), although I created this repo for my specific use case so some trivial/generic features may be missing. Fill free to create an issue/PR.

**Important Note**: Domain adaptation models are a bit different from other models in terms of the train/test datasets. Rather than being distinct, the CIDA model is trained on both the train and test sets (e.g. the `.fit()` DataLoaders should feed both) however the model ignores the labels for any examples where `is_train` is set to `False`. This allows the model to learn the continuous relationship between the train and test domains while also optimizing the classification/regression task given in the `is_train` data. At no point (except of course in evaluation) does the model see the labels for the test data.

### Classification (Rotated MNIST)

```python

from torch.utils.data import DataLoader

from cida.datasets import RotatedMNIST

from cida.conv_model import ConvPCIDAClassifier

import os

if __name__ == "__main__":

    # RotatedMNIST.download()

    dataset = RotatedMNIST(

        os.path.join("data", "MNIST", "processed", "training.pt"), rotate_range=(0, 360), train_range=(0, 45)

    )

    dataloader = DataLoader(

        dataset=dataset,

        shuffle=True,

        batch_size=100,

        num_workers=1,

    )

    val_dataloader = DataLoader(

        dataset=dataset,

        shuffle=True,

        batch_size=100,

        num_workers=1,

    )

    model = ConvPCIDAClassifier(

        classes=10, input_dims=28 * 28, domain_dims=1, domains_to_labels=RotatedMNIST.domains_to_labels, verbose=True

    )

    model = model.to("cpu")

    model.fit(dataloader, val_dataloader, epochs=100)

    print(model.predict(next(iter(val_dataloader))))

```

### Regression

```python

from torch.utils.data import DataLoader

from cida.regr_model import PCIDARegressor

import os

def mse(y_pred, y):

    return ((y_pred - y) ** 2).mean()

if __name__ == "__main__":

    dataset = MyRegressionDataset()

    dataloader = DataLoader(

        dataset=dataset,

        shuffle=True,

        batch_size=128,

        num_workers=1,

    )

    val_dataloader = DataLoader(

        dataset=dataset,

        shuffle=True,

        batch_size=128,

        num_workers=1,

    )

    model = PCIDARegressor(

        input_dims=1000,

        domain_dims=1,

        domains_to_labels=dataset.domains_to_labels,

        lr=3e-4,

        beta1=0.9,

        gamma=0.99,

        weight_decay=5e-4,

        lambda_gan=lambda epoch: 3.0,

        latent_size=64,

        test_domain_known=True,

        metrics={"mse": mse},

        verbose=True,

    )

    model = model.to("cuda")

    model.fit(dataloader, val_dataloader, epochs=100, save_metric="test_mse", save_fn="cida-best.pth")

```

## Reference

For citing the original paper:

```

@inproceedings{DBLP:conf/icml/WangHK20,

  author    = {Hao Wang and

               Hao He and

               Dina Katabi},

  title     = {Continuously Indexed Domain Adaptation},

  booktitle = {ICML},

  year      = {2020}

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/sshh12/cida

Awesome Lists containing this project

README