https://github.com/google-research/maskgit

Official Jax Implementation of MaskGIT
https://github.com/google-research/maskgit

Last synced: 9 months ago
JSON representation

Official Jax Implementation of MaskGIT

Host: GitHub
URL: https://github.com/google-research/maskgit
Owner: google-research
License: apache-2.0
Archived: true
Created: 2022-04-06T15:05:11.000Z (over 3 years ago)
Default Branch: main
Last Pushed: 2022-11-18T17:05:15.000Z (about 3 years ago)
Last Synced: 2024-11-04T13:38:12.803Z (about 1 year ago)
Language: Jupyter Notebook
Homepage:
Size: 8.9 MB
Stars: 439
Watchers: 17
Forks: 50
Open Issues: 13
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE

Awesome Lists containing this project

awesome-multi-modal - https://github.com/google-research/maskgit
Awesome-MIM - [Code

README

          # MaskGIT: Masked Generative Image Transformer

Official Jax Implementation of the CVPR 2022 Paper

[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/maskgit-masked-generative-image-transformer/image-generation-on-imagenet-512x512)](https://paperswithcode.com/sota/image-generation-on-imagenet-512x512?p=maskgit-masked-generative-image-transformer)

[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/maskgit-masked-generative-image-transformer/image-generation-on-imagenet-256x256)](https://paperswithcode.com/sota/image-generation-on-imagenet-256x256?p=maskgit-masked-generative-image-transformer)

[[Paper](https://arxiv.org/abs/2202.04200)] [[Project Page](https://masked-generative-image-transformer.github.io/)] [[Demo Colab](https://colab.research.google.com/github/google-research/maskgit/blob/main/MaskGIT_demo.ipynb)]

![teaser](imgs/teaser.png)

## Summary

MaskGIT is a novel image synthesis paradigm using a bidirectional transformer decoder. During training, MaskGIT learns to predict randomly masked tokens by attending to tokens in all directions. At inference time, the model begins with generating all tokens of an image simultaneously, and then refines the image iteratively conditioned on the previous generation. 

## Running pretrained models

Class conditional Image Genration models:

| Dataset  | Resolution | Model | Link | FID |

| ------------- | ------------- | ------------- | ------------- | ------------- |

| ImageNet  | 256 x 256 | Tokenizer | [checkpoint](https://storage.googleapis.com/maskgit-public/checkpoints/tokenizer_imagenet256_checkpoint)| 2.28 (reconstruction) |

| ImageNet  | 512 x 512 | Tokenizer | [checkpoint](https://storage.googleapis.com/maskgit-public/checkpoints/tokenizer_imagenet512_checkpoint)| 1.97 (reconstruction) |

| ImageNet  | 256 x 256 | MaskGIT Transformer |[checkpoint](https://storage.googleapis.com/maskgit-public/checkpoints/maskgit_imagenet256_checkpoint)| 6.06 (generation) |

| ImageNet  | 512 x 512 | MaskGIT Transformer | [checkpoint](https://storage.googleapis.com/maskgit-public/checkpoints/maskgit_imagenet512_checkpoint) | 7.32 (generation) |

You can run these models for class-conditional image **generation** and **editing** in the [demo Colab](https://colab.research.google.com/github/google-research/maskgit/blob/main/MaskGIT_demo.ipynb).

![teaser](imgs/class-conditional-teaser-small.png)

## Training

[Coming Soon]

## BibTeX

```

@InProceedings{chang2022maskgit,

  title = {MaskGIT: Masked Generative Image Transformer},

  author={Huiwen Chang and Han Zhang and Lu Jiang and Ce Liu and William T. Freeman},

  booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},

  month = {June},

  year = {2022}

}

```

## Disclaimer

This is not an officially supported Google product.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/google-research/maskgit

Awesome Lists containing this project

README