Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/google-research/maskgit
Official Jax Implementation of MaskGIT
https://github.com/google-research/maskgit
Last synced: about 1 month ago
JSON representation
Official Jax Implementation of MaskGIT
- Host: GitHub
- URL: https://github.com/google-research/maskgit
- Owner: google-research
- License: apache-2.0
- Archived: true
- Created: 2022-04-06T15:05:11.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2022-11-18T17:05:15.000Z (about 2 years ago)
- Last Synced: 2024-08-01T13:24:19.550Z (4 months ago)
- Language: Jupyter Notebook
- Homepage:
- Size: 8.9 MB
- Stars: 410
- Watchers: 17
- Forks: 48
- Open Issues: 13
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
Awesome Lists containing this project
- awesome-multi-modal - https://github.com/google-research/maskgit
- awesome-multi-modal - https://github.com/google-research/maskgit
README
# MaskGIT: Masked Generative Image Transformer
Official Jax Implementation of the CVPR 2022 Paper[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/maskgit-masked-generative-image-transformer/image-generation-on-imagenet-512x512)](https://paperswithcode.com/sota/image-generation-on-imagenet-512x512?p=maskgit-masked-generative-image-transformer)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/maskgit-masked-generative-image-transformer/image-generation-on-imagenet-256x256)](https://paperswithcode.com/sota/image-generation-on-imagenet-256x256?p=maskgit-masked-generative-image-transformer)[[Paper](https://arxiv.org/abs/2202.04200)] [[Project Page](https://masked-generative-image-transformer.github.io/)] [[Demo Colab](https://colab.research.google.com/github/google-research/maskgit/blob/main/MaskGIT_demo.ipynb)]
![teaser](imgs/teaser.png)
## Summary
MaskGIT is a novel image synthesis paradigm using a bidirectional transformer decoder. During training, MaskGIT learns to predict randomly masked tokens by attending to tokens in all directions. At inference time, the model begins with generating all tokens of an image simultaneously, and then refines the image iteratively conditioned on the previous generation.## Running pretrained models
Class conditional Image Genration models:
| Dataset | Resolution | Model | Link | FID |
| ------------- | ------------- | ------------- | ------------- | ------------- |
| ImageNet | 256 x 256 | Tokenizer | [checkpoint](https://storage.googleapis.com/maskgit-public/checkpoints/tokenizer_imagenet256_checkpoint)| 2.28 (reconstruction) |
| ImageNet | 512 x 512 | Tokenizer | [checkpoint](https://storage.googleapis.com/maskgit-public/checkpoints/tokenizer_imagenet512_checkpoint)| 1.97 (reconstruction) |
| ImageNet | 256 x 256 | MaskGIT Transformer |[checkpoint](https://storage.googleapis.com/maskgit-public/checkpoints/maskgit_imagenet256_checkpoint)| 6.06 (generation) |
| ImageNet | 512 x 512 | MaskGIT Transformer | [checkpoint](https://storage.googleapis.com/maskgit-public/checkpoints/maskgit_imagenet512_checkpoint) | 7.32 (generation) |You can run these models for class-conditional image **generation** and **editing** in the [demo Colab](https://colab.research.google.com/github/google-research/maskgit/blob/main/MaskGIT_demo.ipynb).
![teaser](imgs/class-conditional-teaser-small.png)
## Training
[Coming Soon]## BibTeX
```
@InProceedings{chang2022maskgit,
title = {MaskGIT: Masked Generative Image Transformer},
author={Huiwen Chang and Han Zhang and Lu Jiang and Ce Liu and William T. Freeman},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2022}
}
```## Disclaimer
This is not an officially supported Google product.