Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/mayankgrwl97/awesome-gans

Latest resources on Generative Adversarial Networks
https://github.com/mayankgrwl97/awesome-gans

List: awesome-gans

Last synced: 3 months ago
JSON representation

Latest resources on Generative Adversarial Networks

Awesome Lists containing this project

README

        

## [![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/mayankgrwl97/awesome-gans) GANs

### Courses
- [ ] [Deeplearning.ai Generative Adversarial Networks Specialization](https://www.coursera.org/specializations/generative-adversarial-networks-gans)
- [X] Build Basic Generative Adversarial Networks (GANs)
- [X] Build Better Generative Adversarial Networks (GANs)
- [ ] Apply Generative Adversarial Networks (GANs)
- [ ] [Deep Unsupervised Learning - Spring 2020 (UC Berkeley)](https://sites.google.com/view/berkeley-cs294-158-sp20/home)
- [ ] [YouTube Lectures](https://www.youtube.com/watch?v=V9Roouqfu-M&list=PLwRJQ4m4UJjPiJP3691u-qWwPGVKzSlNP)
- [ ] [Assignments](https://github.com/rll/deepul)
- [ ] [Deep Generative Models - Fall 2019 (Stanford)](https://deepgenerativemodels.github.io/)
- [ ] [Course Notes/Blog](https://deepgenerativemodels.github.io/notes/index.html)
- [ ] [Probability Review](https://ermongroup.github.io/cs228-notes/preliminaries/probabilityreview/)

### Conference Workshops
- [ ] NeurIPS 2020: [Machine Learning for Creativity and Design](https://neurips2020creativity.github.io/)
- [ ] ECCV 2020: [Perception Through Structured Generative Models](http://generativeperception.com/)
- [ ] CVPR 2020: [AI for Content Creation Workshop](http://visual.cs.brown.edu/workshops/aicc2020/)

### Tutorials and Blogs
- [ ] [Discovering Interpretable GAN controls ](https://www.youtube.com/watch?v=oIzwe_MOeQI) `00:35:02`
- [X] [StyleGAN v2: notes on training and latent space exploration (Fashion Dataset)](https://towardsdatascience.com/stylegan-v2-notes-on-training-and-latent-space-exploration-e51cf96584b3)
- [ ] [Improving Initialization for GAN inversion](https://github.com/rolux/stylegan2encoder/issues/2)
- [ ] [Making Anime Faces With StyleGAN[1/2] - Tips and tricks for effective StyleGAN use](https://www.gwern.net/Faces)
- [X] [From GAN basic to StyleGAN2](https://medium.com/analytics-vidhya/from-gan-basic-to-stylegan2-680add7abe82) [Explains - Perceptual Path Length]
- [X] [CVPR'20 iMLCV tutorial: Exploring and Exploiting Interpretable Semantics in GANs by Bolei Zhou](https://youtu.be/rfx3whKgFVo) `0:46:46`
- [ ] [Using Artificial Intelligence to Augment Human Intelligence](https://distill.pub/2017/aia/)
- [X] [Apple Machine Learning Blog](https://machinelearning.apple.com/research/gan)

### Training GANs with Limited Data
- [X] NeurIPS 2020: [StyleGAN2 Adaptive Discriminator Augmentation - Training Generative Adversarial Networks with Limited Data](https://arxiv.org/abs/2006.06676) [[Tensorflow Code](https://github.com/NVlabs/stylegan2-ada)] [[PDF](https://nvlabs-fi-cdn.nvidia.com/stylegan2-ada/ada-paper.pdf)] [[Unofficial Video](https://youtu.be/EnUDHQ98C9A)]
- [X] [Blog](https://medium.com/swlh/training-gans-with-limited-data-22a7c8ffce78)
- [ ] NeurIPS 2020: [Differentiable Augmentation for Data-Efficient GAN Training](https://arxiv.org/abs/2007.00653)

### GAN Architectures (StyleGAN, ProgressiveGAN types)
- [ ] NeurIPS 2020: [SariGAN - Semantic-aware-relative-importance GAN - Learning Semantic-aware Normalization for Generative Adversarial Networks](https://proceedings.neurips.cc/paper/2020/file/f885a14eaf260d7d9f93c750e1174228-Paper.pdf)
- [X] CVPR 2020: [A U-Net Based Discriminator for Generative Adversarial Networks](https://arxiv.org/abs/2002.12655) [[PyTorch Code](https://github.com/boschresearch/unetgan)] [[Video](https://www.youtube.com/watch?v=BR9C4p3W9vw&feature=emb_title)]
- [X] CVPR 2020: [StyleGAN2 - Analyzing and Improving the Image Quality of StyleGAN](https://arxiv.org/abs/1912.04958)
- [X] [From GAN basic to StyleGAN2](https://medium.com/analytics-vidhya/from-gan-basic-to-stylegan2-680add7abe82) [Explains - Perceptual Path Length]
- [ ] Pytorch Code Repositories - [rosinality/stylegan2-pytorch](https://github.com/rosinality/stylegan2-pytorch/), [lucidrains/stylegan2-pytorch](https://github.com/lucidrains/stylegan2-pytorch)
- [X] CVPR 2019: [StyleGAN - A Style-Based Generator Architecture for Generative Adversarial Networks](https://arxiv.org/abs/1812.04948)
- [X] ICLR 2018: [Progressive GAN - Progressive Growing of GANs for Improved Quality, Stability, and Variation](https://arxiv.org/abs/1710.10196)

### GAN Editing
- [ ] CVPR 2020: [Editing in Style: Uncovering the Local Semantics of GANs](https://openaccess.thecvf.com/content_CVPR_2020/papers/Collins_Editing_in_Style_Uncovering_the_Local_Semantics_of_GANs_CVPR_2020_paper.pdf) [[Video](https://www.youtube.com/watch?v=l2RATZjpzwI)]
- [ ] CVPR 2020: [Disentangled Image Generation Through Structured Noise Injection](https://openaccess.thecvf.com/content_CVPR_2020/papers/Alharbi_Disentangled_Image_Generation_Through_Structured_Noise_Injection_CVPR_2020_paper.pdf) [[Video](https://www.youtube.com/watch?v=vM8kpVr6UQY)] [[Tensorflow Code](https://github.com/yalharbi/StructuredNoiseInjection)]
- [ ] CVPR 2020: [The GAN that Warped: Semantic Attribute Editing with Unpaired Data](https://garoe.github.io/papers/warp_gan_2020.pdf)

### GAN Inversion: Inverting Real Faces to Latent Code (Image2StyleGAN types)
- [ ] ACCV 2020: [MagGAN: High-Resolution Face Attribute Editing with Mask-Guided Generative Adversarial Network](https://arxiv.org/abs/2010.01424)
- [ ] ECCV 2020: [Pix2latent - Transforming and Projecting Images into Class-conditional Generative Networks](https://minyoungg.github.io/pix2latent/)
- [ ] arxiv 2020: [Generative Hierarchical Features from Synthesizing Images](https://arxiv.org/abs/2007.10379)
- [X] SIGGRAPH 2020: [MichiGAN: Multi-Input-Conditioned Hair Image Generation for Portrait Editing](https://mlchai.com/publication/tan2020michigan/)
- [X] ECCV 2020: [StyleGAN2 Distillation for Feed-forward Image Manipulation](https://arxiv.org/abs/2003.03581)
- **Comments**: Create synthetic datasets using style mixing and linear interpolation. Train paired pix2pixhd on these synthetic datasets for tasks such as male->female, female->male, etc.
- [ ] ECCV 2020: [Rewriting a Deep Generative Model](https://rewriting.csail.mit.edu/)
- [ ] ECCV 2020: [Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation](https://arxiv.org/abs/2003.13659)
- [ ] ECCV 2020: [DeepLandscape: Adversarial Modeling of Landscape Videos](https://saic-mdal.github.io/deep-landscape/)
- [ ] CVPR 2020: [SEAN: Image Synthesis with Semantic Region-Adaptive Normalization](https://zpdesu.github.io/SEAN/)
- [ ] CVPR 2020: [Image Processing Using Multi-Code GAN Prior](https://arxiv.org/abs/1912.07116)
- [X] CVPR 2020: [Image2StyleGAN++: How to Edit the Embedded Images?](https://arxiv.org/abs/1911.11544)
- [ ] SIGGRAPH 2019: [Semantic Photo Manipulation with a Generative Image Prior](https://ganpaint.io/)
- [X] ICCV 2019: [Image2StyleGAN - How to Embed Images Into the StyleGAN Latent Space?](https://arxiv.org/abs/1904.03189)
- **Comments**: Introduce extended latent codes to embed real images that are different from the dataset on which GAN has been trained. Each layer of Generator receives different latent codes from this extend latent space. *Problem*: Overfitting to given image; doesn't very well support the manipulation. Resulting code might be outside the original latent domain (due to unconstrained optimization)

#### Flow based networks (Invertible by design)
- [ ] SIGGRAPH ASIA 2020: [StyleFlow: Attribute-conditioned Exploration of StyleGAN-Generated Images using Conditional Continuous Normalizing Flows](https://arxiv.org/abs/2008.02401)
- [ ] What are Normalizing Flows? [[Video]](https://youtu.be/i7LjDvsLWCg)
- [ ] NeurIPS 2018: [Glow: Generative Flow with Invertible 1x1 Convolutions](https://arxiv.org/abs/1807.03039)

#### Adding encoder to GAN generator (Reconstructions are not good)
- [X] ECCV 2020: [In-Domain GAN Inversion for Real Image Editing](https://arxiv.org/abs/2004.00049)
- **Comments** Novel Encoder training for GAN Inversion - Introduce adversarial loss for training encoder; Encoder trained using real images. Resolves out-of-domain image inversion (from Image2StyleGAN) by doing an Encoder-constrained Optimization (i.e. minimize distance between encoder predicted latent code and optimized latent code). They compute loss on both reconstructed image and the predicted latent code. Hence, we can use these latent codes for image editing. Check application: Semantic Diffusion
- [ ] CVPR 2020: [Adversarial Latent Autoencoders](https://arxiv.org/abs/2004.04467)
- [ ] NeurIPS 2019: [BigBiGAN - Large Scale Adversarial Representation Learning](https://arxiv.org/abs/1907.02544)

### Interpretability
#### Require supervision in form of off-the-shelf supervised classifiers
- [ ] ICLR 2020: [On the "steerability" of generative adversarial networks](https://ali-design.github.io/gan_steerability/)
- **Comments**: Explore correspondence of latent space trajectories in GANs to simple image transformations. Dataset biases limit the extent of transformations (e.g: can't convert red firetruck to blue firetruck by moving in the blueness direction in the latent space). Data augmentation and jointly training the walk trajectory and the generator weights imroves steerability, resulting in larger transformation effects.
- [ ] CVPR 2020: [InterFaceGAN - Interpreting the Latent Space of GANs for Semantic Face Editing](https://genforce.github.io/interfacegan/)
- **Comments**: Similar to HiGAN below, they use off-the-shelf image classifiers (like male/female, old/young, smile/no-smile, artifacts/no-artifacts) to find semantic boundaries in the latent space. Check for metrics to measure the disentanglement of faces.
- [ ] CVPRW 2020: [HiGAN - Semantic Hierarchy Emerges in Deep Generative Representations for Scene Synthesis](https://arxiv.org/abs/1911.09267)
- **Comments**: Investigates the causality between latent space vectors and generated image attributes/semantics. For normal GANs, they use off-the-shelf image classifiers (like cloud/no-cloud, lighting/no-lighting) to find semantic boundaries in the latent space. For StyleGAN-like architectures, where stochasticity/randomness is introduced at multiple layers, they find that by perturbing input latent vectors at different layer depths, different semantics are controlled: Layout -> Objects -> Attributes -> Color Schemes
- [ ] ICCV 2019: [GANalyze: Toward Visual Definitions of Cognitive Image Properties](http://ganalyze.csail.mit.edu/)
- **Comments**: Learn transformation in latent space (via a Transformer network) to improve memorability of generated images. Also check [MemNet](https://arxiv.org/abs/1708.02209)
- [ ] ICLR 2019: [GAN Dissection: Visualizing and Understanding Generative Adversarial Networks](https://gandissect.csail.mit.edu/), [Video](https://www.youtube.com/embed/yVCgUYe4JTM?rel=0&autoplay=1)
- **Comments**: It's a framework to interpret and label the internal units inside the Generator. Labels are associated by checking correlation of feature activations of individual units with the segmentation mask of the generated image

#### Unsupervised Attribute Discovery in GANs
- [ ] NeurIPS 2020: [GANSpace: Discovering Interpretable GAN Controls](https://arxiv.org/abs/2004.02546) [[Video](https://www.youtube.com/watch?v=oIzwe_MOeQI)]
- **Comments** Discover Interpretable latent space directions by performing PCA analysis on the latent activations
- [ ] ICML 2020: [Unsupervised Discovery of Interpretable Directions in the GAN Latent Space](https://arxiv.org/abs/2002.03754)
- **Comments**: They learn a set of directions in the latet space that induce "orthogonal" image transformations that are easy to distinguish from each other. E.g. -background blur+, -background removal+, -hair+, etc.

### Disentanglement of Variation Factors in Generative Models
- [ ] NeurIPS 2020: [Self-learning Transformations for Improving Gaze and Head Redirection](https://ait.ethz.ch/projects/2020/STED-gaze/)
- [ ] arxiv 2020: [Closed-Form Factorization of Latent Semantics in GANs](https://arxiv.org/abs/2007.06600)
- [ ] ECCV 2020: [The Hessian Penalty - A Weak Prior for Unsupervised Disentanglement](https://arxiv.org/abs/2008.10599)
- [ ] CVPR 2020: [DiscoFaceGAN - Disentangled and Controllable Face Image Generation via 3D Imitative-Contrastive Learning](https://openaccess.thecvf.com/content_CVPR_2020/papers/Deng_Disentangled_and_Controllable_Face_Image_Generation_via_3D_Imitative-Contrastive_Learning_CVPR_2020_paper.pdf)
- [ ] arxiv 2020: [Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation](https://arxiv.org/abs/2008.00951)
- [X] arxiv 2020: [Face Identity Disentanglement via Latent Space Mapping](https://arxiv.org/abs/2005.07728)

### Image to Image (Pix2Pix and CycleGAN types)
- [ ] NeurIPS 2020: [Swapping Autoencoder for Deep Image Manipulation](https://arxiv.org/abs/2007.00653)
- [ ] ECCV 2020: [Contrastive Learning for Unpaired Image-to-Image Translation](https://arxiv.org/abs/2007.15651)
- [ ] ECCV 2020: [Learning to Factorize and Relight a City](https://arxiv.org/abs/2008.02796)
- [ ] CVPR 2020: [StarGAN v2 - Diverse Image Synthesis for Multiple Domains](https://arxiv.org/abs/1912.01865)
- [X] ICLR 2020: [Mask Based Unsupervised Content Transfer](https://arxiv.org/abs/1906.06558)
- **Comments**: They tackle the problem of translating domain specific content (such as glasses/no-glasses, moustache/no-moustache) between two domains. Their proposed method disentangles the common and separate parts of these domains, and, through the generation of a mask, focuses the attention of the underlying network to the desired augmentation alone, without wastefully reconstructing the entire target.
- [ ] CVPR 2019: [GauGAN/SPADE - Semantic Image Synthesis with Spatially-Adaptive Normalization](https://nvlabs.github.io/SPADE/)
- **Comments**: Parallel work to pix2pixhd (superior results) - CHECK
- [ ] ICCV 2019: [Everybody dance now](https://arxiv.org/abs/1808.07371) [[Code](https://github.com/carolineec/EverybodyDanceNow)]
- [ ] ICCV 2019: [SinGAN - Learning a Generative Model from a Single Natural Image](https://arxiv.org/abs/1905.01164)
- [ ] CVPR 2018: [StarGAN - Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation](https://arxiv.org/abs/1711.09020)
- [X] CVPR 2018: [Pix2PixHD - High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs](https://arxiv.org/abs/1711.11585)
- [X] ICCV 2017: [CycleGAN - Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks](https://arxiv.org/abs/1703.10593)
- [X] CVPR 2017: [Pix2Pix - Image-to-Image Translation with Conditional Adversarial Networks](https://arxiv.org/abs/1611.07004)
- [ ] CVPR 2017: [SimGAN - Learning from Simulated and Unsupervised Images through Adversarial Training](https://arxiv.org/abs/1612.07828v2)
- [X] [Apple Blog](https://machinelearning.apple.com/research/gan)
- [ ] SURVEY, Aug'20: [Generative Adversarial Networks for Image and Video Synthesis: Algorithms and Applications](https://arxiv.org/abs/2008.02793)

### Improving GANs
- [ ] NeurIPS 2020: [Top-k Training of GANs: Improving GAN Performance by Throwing Away Bad Samples](https://arxiv.org/abs/2002.06224)

### Other Applications
- [ ] CVPR 2020: [VOGUE: Try-On by StyleGAN Interpolation Optimization](https://vogue-try-on.github.io/) [[Video](https://youtu.be/AWd7x_3GaZk)]
- [ ] arxiv 2020: [House-GAN: Relational Generative Adversarial Networks for Graph-constrained House Layout Generation](https://ennauata.github.io/housegan/page.html)
- [ ] IJCV 2020: [Compositional GAN: Learning Image-Conditional Binary Composition](https://arxiv.org/abs/1807.07560)
- [ ] CVPR 2020: [PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer](https://arxiv.org/abs/1909.06956)
- [ ] CVPR 2020: [Learning to Simulate Dynamic Environments with GameGAN](https://nv-tlabs.github.io/gameGAN/)
- [ ] CVPR 2020: [Controllable Person Image Synthesis with Attribute-Decomposed GAN](https://arxiv.org/abs/2003.12267)
- [ ] WACV 2020: [TailorGAN: Making User-Defined Fashion Designs](https://arxiv.org/abs/2001.06427v2)
- [ ] SIGGRAPH 2020: [Unpaired Motion Style Transfer from Video to Animation](https://deepmotionediting.github.io/style_transfer)
- [ ] ICCV 2019: [InGAN: Capturing and Remapping the "DNA" of a Natural Image](https://arxiv.org/abs/1812.00231)
- [ ] SIGGRAPH 2018: [Non-Stationary Texture Synthesis by Adversarial Expansion](https://arxiv.org/abs/1805.04487)

### Quantitative Analysis
- [ ] CVPR 2019: [Perceptual Path Metric: StyleGAN](https://arxiv.org/abs/1812.04948)
- [ ] NeurIPS 2019: [Improved Precision and Recall Metric for Assessing Generative Models](https://arxiv.org/abs/1904.06991)
- [X] [Build Better Generative Adversarial Networks (GANs), Week 1](https://www.coursera.org/learn/build-better-generative-adversarial-networks-gans/home/week/1)
- [ ] NeurIPS 2018: [Assessing Generative Models via Precision and Recall](https://arxiv.org/abs/1806.00035)
- [ ] NeurIPS 2017: [Frechet Inception Distance](https://arxiv.org/abs/1706.08500)

### ====
- https://genforce.github.io/
- https://github.com/zhoubolei/awesome-generative-modeling/blob/master/README.md