Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/neverbiasu/awesome-portraits-style-transfer

An archive of studies related to portrait style transfer
https://github.com/neverbiasu/awesome-portraits-style-transfer

List: awesome-portraits-style-transfer

face-stylization style style-transfer styletransfer stylization

Last synced: about 1 month ago
JSON representation

An archive of studies related to portrait style transfer

Awesome Lists containing this project

README

        

# Awesome-Portraits-Style-Transfer [![Awesome](https://awesome.re/badge.svg)](https://awesome.re)

> An archive of studies related to portrait style transfer

## Table of Contents
+ [Papers](#Papers)
+ [Repositories](#Repositories)
+ [Datasets](#Datasets)

## Papers

### Based on GANs

| Date | Title | Publish | Paper | Code | Homepage | Recom |
|---------|---------------------------------------------------------------------------------------------------------------------|---------------------|--------------------------------------------------------------------------------------------|-----------------------------------------------------|-----------------------------------------------------|--------|
| 2024.03 | DoesFS: Deformable One-shot Face Stylization via DINO Semantic Guidance | CVPR 2024 | [[paper]](https://arxiv.org/pdf/2403.00459) | [[code]](https://github.com/zichongc/DoesFS) | [[homepage]](https://vcc.tech/research/2024/DoesFS) | ⭐️⭐️ |
| 2023.05 | MMFS: Multi-Modal Face Stylization with a Generative Prior | PG 2023 | [[paper]](https://arxiv.org/pdf/2305.18009) | [[code]](https://github.com/mmfs-paper/MMFS) | N/A | ⭐️⭐️ |
| 2023.03 | Fix the Noise: Disentangling Source Feature for Controllable Domain Translation | CVPR 2023 | [[paper]](https://arxiv.org/abs/2303.11545) | [[code]](https://github.com/LeeDongYeun/FixNoise) | N/A | ⭐️⭐️ |
| 2023.03 | StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces | ICCV 2023 | [[paper]](https://openaccess.thecvf.com/content/ICCV2023/papers/Yang_StyleGANEX_StyleGAN-Based_Manipulation_Beyond_Cropped_Aligned_Faces_ICCV_2023_paper.pdf) | [[code]](https://github.com/williamyang1991/StyleGANEX) | [[homepage]](https://www.mmlab-ntu.com/project/styleganex/) | ⭐️⭐️ |
| 2022.11 | DynaGAN: Dynamic Few-shot Adaptation of GANs to Multiple Domains | SIGGRAPH Asia 2022 | [[paper]](https://arxiv.org/pdf/2211.14554) | [[code]](https://github.com/blueGorae/DynaGAN) | [[homepage]](https://bluegorae.github.io/dynagan/) | ⭐️⭐️ |
| 2022.10 | TargetCLIP: Official PyTorch implementation of the paper Image-Based CLIP-Guided Essence Transfer | ECCV 2022 | [[paper]](https://arxiv.org/abs/2110.12427) | [[code]](https://github.com/hila-chefer/TargetCLIP) | [[homepage]](https://github.com/hila-chefer/TargetCLIP) | ⭐️⭐️ |
| 2022.09 | VToonify: Controllable High-Resolution Portrait Video Style Transfer | SIGGRAPH Asia 2022 | [[paper]](https://arxiv.org/pdf/2209.11224) | [[code]](https://github.com/williamyang1991/VToonify) | [[homepage]](https://www.mmlab-ntu.com/project/vtoonify/) | ⭐️⭐️ |
| 2022.07 | DCT-Net: Domain-Calibrated Translation for Portrait Stylization | SIGGRAPH 2022 (TOG) | [[paper]](https://arxiv.org/pdf/2207.02426) | [[code]](https://github.com/menyifang/DCT-Net) | [[homepage]](https://menyifang.github.io/projects/DCTNet/DCTNet.html) | ⭐️⭐️ |
| 2022.06 | GODA: Generalized One-shot Domain Adaptation of Generative Adversarial Networks | NeurIPS 2022 | [[paper]](https://arxiv.org/pdf/2209.03665) | [[code]](https://github.com/zhangzc21/Generalized-One-shot-GAN-adaptation) | N/A | ⭐️⭐️ |
| 2022.05 | MTG: Mind the Gap: Domain Gap Control for Single Shot Domain Adaptation for Generative Adversarial Networks | ICLR 2022 | [[paper]](https://arxiv.org/pdf/2110.08398) | [[code]](https://github.com/ZPdesu/MindTheGap) | [[homepage]](https://zpdesu.github.io/MindTheGap/) | ⭐️⭐️ |
| 2022.04 | Unsupervised Image-to-Image Translation with Generative Prior | CVPR 2022 | [[paper]](https://arxiv.org/pdf/2204.03641) | [[code]](https://github.com/williamyang1991/GP-UNIT) | [[homepage]](https://www.mmlab-ntu.com/project/gpunit/) | ⭐️⭐️ |
| 2022.03 | Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer | CVPR 2022 | [[paper]](https://arxiv.org/pdf/2203.13248.pdf) | N/A | [[homepage]](https://github.com/williamyang1991/DualStyleGAN) | ⭐️⭐️ |
| 2021.12 | SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing | CVPR 2022 | [[paper]](https://arxiv.org/pdf/2112.02236) | [[code]](https://github.com/seasonSH/SemanticStyleGAN) | [[homepage]](https://semanticstylegan.github.io/) | ⭐️⭐️ |
| 2021.12 | BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation | NeurIPS 2021 | [[paper]](https://arxiv.org/abs/2110.11728) | [[code]](https://github.com/onion-liu/BlendGAN) | [[homepage]](https://github.com/onion-liu/BlendGAN) | ⭐️ |
| 2021.09 | CariMe: Unpaired Caricature Generation with Multiple Exaggerations | TMM 2021 | [[paper]](https://ieeexplore.ieee.org/abstract/document/9454341) | [[code]](https://github.com/edward3862/CariMe-pytorch) | [[homepage]](https://github.com/edward3862/CariMe-pytorch) | ⭐️ |
| 2021.09 | StyleCariGAN: Caricature Generation via StyleGAN Feature Map Modulation | SIGGRAPH 2021 | [[paper]](https://wonjongg.me/StyleCariGAN/) | [[code]](https://github.com/wonjongg/StyleCariGAN) | [[homepage]](https://github.com/wonjongg/StyleCariGAN) | ⭐️ |
| 2021.09 | ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement | ICCV 2021 | [[paper]](https://yuval-alaluf.github.io/restyle-encoder/) | [[code]](https://github.com/yuval-alaluf/restyle-encoder?tab=readme-ov-file) | [[homepage]](https://github.com/yuval-alaluf/restyle-encoder?tab=readme-ov-file) | ⭐️⭐️ |
| 2021.09 | SPatchGAN: A Statistical Feature Based Discriminator for Unsupervised Image-to-Image Translation | ICCV 2021 | [[paper]](https://arxiv.org/abs/2103.16219) | [[code]](https://github.com/NetEase-GameAI/SPatchGAN) | [[homepage]](https://github.com/NetEase-GameAI/SPatchGAN) | ⭐️ |
| 2021.07 | Making Robots Draw A Vivid Portrait In Two Minutes | IROS 2020 | [[paper]](https://ricelll.github.io/AiSketcher/) | [[code]](https://github.com/fei-aiart/AiSketcher) | [[homepage]](https://github.com/fei-aiart/AiSketcher) | ⭐️⭐️ |
| 2021.07 | Line Drawings for Face Portraits from Photos using Global and Local Structure based GANs | IEEE 2020 | [[paper]](https://github.com/yiranran/APDrawingGAN2) | [[code]](https://github.com/yiranran/APDrawingGAN2) | [[homepage]](https://github.com/yiranran/APDrawingGAN2) | ⭐️⭐️ |
| 2021.03 | OneShotCLIP: One-Shot Adaptation of GAN in Just One CLIP | TPAMI 2023 | [[paper]](https://arxiv.org/pdf/2203.09301) | [[code]](https://github.com/cyclomon/OneshotCLIP) | N/A | ⭐️⭐️ |
| 2020.10 | Fine-Tuning StyleGAN2 For Cartoon Face Generation | N/A | [[paper]](https://arxiv.org/abs/2106.12445) | [[code]](https://github.com/happy-jihye/Cartoon-StyleGAN) | [[homepage]](https://github.com/happy-jihye/Cartoon-StyleGAN) | ⭐️⭐️ |
| 2020.10 | WarpGAN: Automatic Caricature Generation | CVPR 2019 | [[paper]](https://arxiv.org/abs/1811.10100) | [[code]](https://github.com/seasonSH/WarpGAN) | [[homepage]](https://github.com/seasonSH/WarpGAN) | ⭐️⭐️ |
| 2019.06 | APDrawingGAN: Generating Artistic Portrait Drawings from Face Photos with Hierarchical GANs | CVPR 2019 | [[paper]](https://openaccess.thecvf.com/content_CVPR_2019/html/Yi_APDrawingGAN_Generating_Artistic_Portrait_Drawings_From_Face_Photos_With_Hierarchical_CVPR_2019_paper.html) | [[code]](https://github.com/yiranran/APDrawingGAN) | [[homepage]](https://apdrawing.github.io/) | ⭐️⭐️ |
| 2019.07 | A Style-aware Discriminator for Controllable Image Translation | IEEE 2022 | [[paper]](https://ieeexplore.ieee.org/document/9880454) | [[code]](https://github.com/kunheek/style-aware-discriminator) | [[homepage]](https://github.com/kunheek/style-aware-discriminator) | ⭐️ |
| 2017.03 | Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization | ICCV 2017 | [[paper]](https://arxiv.org/abs/1703.06868) | [[code]](https://github.com/xunhuang1995/AdaIN-style) | [[homepage]](https://github.com/xunhuang1995/AdaIN-style) | ⭐️⭐️ |

### Based on Diffusion Models

| Date | Title | Publish | Paper | Code | Homepage | Recom |
|---------|---------------------------------------------------------------------------------------------------------------------|---------------------|--------------------------------------------------------------------------------------------|-----------------------------------------------------|-----------------------------------------------------|--------|
| 2024.05 | Pair Customization: Customizing Text-to-Image Models with a Single Image Pair | ArXiv 2024 | [[paper]](https://arxiv.org/pdf/2405.01536) | [[code]](https://github.com/PairCustomization/PairCustomization) | [[homepage]](https://paircustomization.github.io/) | ⭐️⭐️ |
| 2024.04 | InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation | ArXiv 2024 | [[paper]](https://arxiv.org/pdf/2404.02733) | [[code]](https://github.com/InstantStyle/InstantStyle) | [[homepage]](https://instantstyle.github.io/) | ⭐️⭐️ |
| 2024.04 | Style-Booth: Image Style Editing with Multimodal Instruction | ArXiv 2024 | [[paper]](https://arxiv.org/pdf/2404.12154) | [[code]](https://github.com/modelscope/scepter) | [[homepage]](https://ali-vilab.github.io/stylebooth-page/) | ⭐️⭐️ |
| 2024.04 | B-LoRA: Implicit Style-Content Separation using B-LoRA | ArXiv 2024 | [[paper]](https://arxiv.org/pdf/2403.14572) | [[code]](https://github.com/yardenfren1996/B-LoRA) | [[homepage]](https://b-lora.github.io/B-LoRA/) | ⭐️⭐️ |
| 2024.03 | ZePo: Zero-Shot Portrait Stylization with Faster Sampling | ACM Multimedia 2024 | [[paper]](https://openreview.net/pdf?id=mYG7uEVlQd) | N/A | N/A | ⭐️⭐️ |
| 2024.01 | CreativeSynth: Creative Blending and Synthesis of Visual Arts based on Multimodal Diffusion | ArXiv 2024 | [[paper]](https://arxiv.org/pdf/2401.14066) | [[code]](https://github.com/haha-lisa/CreativeSynth) | N/A | ⭐️⭐️ |
| 2024.01 | FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models | ArXiv 2024 | [[paper]](https://arxiv.org/pdf/2401.15636) | [[code]](https://github.com/FreeStyleFreeLunch/FreeStyle) | [[homepage]](https://freestylefreelunch.github.io/) | ⭐️⭐️ |
| 2024.01 | InstantID: Zero-shot Identity-Preserving Generation in Seconds | ArXiv 2024 | [[paper]](https://arxiv.org/pdf/2401.07519) | [[code]](https://github.com/InstantID/InstantID) | [[homepage]](https://instantid.github.io/) | ⭐️⭐️ |
| 2023.12 | VisualStylePrompt: Visual Style Prompting with Swapping Self-Attention | ArXiv 2023 | [[paper]](https://arxiv.org/pdf/2402.12974) | N/A | [[homepage]](https://curryjung.github.io/VisualStylePrompt/) | ⭐️⭐️ |
| 2023.12 | StyleAligned: Style Aligned Image Generation via Shared Attention | CVPR 2024 | [[paper]](https://arxiv.org/pdf/2312.02133) | [[code]](https://github.com/google/style-aligned) | [[homepage]](https://style-aligned-gen.github.io/) | ⭐️⭐️ |
| 2023.12 | StyleID: Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style Transfer | CVPR 2024 | [[paper]](https://arxiv.org/pdf/2312.09008) | [[code]](https://github.com/jiwoogit/StyleID) | [[homepage]](https://jiwoogit.github.io/StyleID_site/) | ⭐️⭐️ |
| 2023.12 | Portrait Diffusion: Training-free Face Stylization with Chain-of-Painting | ArXiv 2023 | [[paper]](https://arxiv.org/pdf/2312.02212) | [[code]](https://github.com/liujin112/PortraitDiffusion) | N/A | ⭐️⭐️ |
| 2023.11 | ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models | SIGGRAPH Asia 2023 | [[paper]](https://arxiv.org/pdf/2305.16225) | [[code]](https://github.com/zyxElsa/ProSpect) | N/A | ⭐️⭐️ |
| 2023.11 | VCT: General Image-to-Image Translation with One-Shot Image Guidance | ICCV 2023 | [[paper]](https://arxiv.org/pdf/2307.14352) | [[code]](https://github.com/CrystalNeuro/visual-concept-translator) | N/A | ⭐️⭐️ |
| 2023.06 | Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation | SIGGRAPH Asia 2023 | [[paper]](https://arxiv.org/pdf/2306.07954.pdf) | [[code]](https://www.mmlab-ntu.com/project/rerender/) | [[homepage]](https://www.mmlab-ntu.com/project/rerender/) | ⭐️⭐️ |
| 2023.05 | Style-A-Video: Agile Diffusion for Arbitrary Text-based Video Style Transfer | ArXiv 2023 | [[paper]](https://arxiv.org/abs/2305.05464) | [[code]](https://github.com/haha-lisa/Style-A-Video) | [[homepage]](https://github.com/haha-lisa/Style-A-Video) | ⭐️⭐️ |
| 2023.03 | ZeCon: Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style Transfer | ICCV 2023 | [[paper]](https://arxiv.org/pdf/2303.08622.pdf) | [[code]](https://github.com/YSerin/ZeCon) | N/A | ⭐️⭐️ |
| 2023.02 | IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models | ArXiv 2023 | [[paper]](https://arxiv.org/pdf/2308.06721) | [[code]](https://github.com/tencent-ailab/IP-Adapter) | [[homepage]](https://ip-adapter.github.io/) | ⭐️⭐️ |
| 2023.01 | SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation | CVPR 2024 | [[paper]](https://arxiv.org/pdf/2312.16272) | [[code]](https://github.com/Xiaojiu-z/SSR_Encoder) | [[homepage]](https://ssr-encoder.github.io/) | ⭐️⭐️ |
| 2022.11 | InST: Inversion-Based Style Transfer with Diffusion Models | CVPR 2023 | [[paper]](https://arxiv.org/pdf/2211.13203) | [[code]](https://github.com/zyxElsa/InST) | N/A | ⭐️⭐️ |

### Others

| Date | Title | Publish | Paper | Code | Homepage | Recom |
|---------|---------------------------------------------------------------------------------------------------------------------|---------------------|--------------------------------------------------------------------------------------------|-----------------------------------------------------|-----------------------------------------------------|--------|
| 2021.12 | CLIPstyler: Image Style Transfer with a Single Text Condition | CVPR 2022 | [[paper]](https://arxiv.org/abs/2112.00374) | [[code]](https://github.com/cyclomon/CLIPstyler) | N/A | ⭐️⭐️ |
| 2021.09 | ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows | CVPR 2021 | [[paper]](https://arxiv.org/pdf/2103.16877) | [[code]](https://github.com/pkuanjie/ArtFlow) | N/A | ⭐️⭐️ |
| 2021.08 | AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer | ICCV 2021 | [[paper]](https://arxiv.org/pdf/2108.03647) | [[code]](https://github.com/Huage001/AdaAttN) | N/A | ⭐️⭐️ |
| 2018.06 | Avatar-Net: Multi-scale Zero-shot Style Transfer by Feature Decoration | CVPR 2018 | [[paper]](https://openaccess.thecvf.com/content_cvpr_2018/papers/Sheng_Avatar-Net_Multi-Scale_Zero-Shot_CVPR_2018_paper.pdf) | [[code]](https://lucassheng.github.io/avatar-net/) | [[homepage]](https://lucassheng.github.io/avatar-net/) | ⭐️⭐️ |

## Repositories

| Title | Code |
|----------|--------------------------------------------------------------------------------------|
| FreeG | [[code]](https://github.com/bryandlee/FreezeG) |
| AnimeGANv2 | [[code]](https://github.com/TachibanaYoshino/AnimeGANv2) |

## Datasets

| Name | Description | License | Download |
|-----------------------|-----------------------------------------------------------------------------------------------------------------------|--------------------------------------------------|-----------------------------------------------------------------------------------------------|
| Danbooru2017 | Danbooru2017 is a large-scale anime image database with 2.9m+ images annotated with 77.5m+ tags; it can be useful for machine learning purposes such as image recognition and generation | [CC0 1.0](https://creativecommons.org/publicdomain/zero/1.0/) | [Download](https://gwern.net/Danbooru2017#download) |
| Danbooru2018 | Danbooru2018 is a large-scale anime image database with 3.3m+ images annotated with 92.7m+ tags; it can be useful for machine learning purposes such as image recognition and generation | [CC0 1.0](https://creativecommons.org/publicdomain/zero/1.0/) | [Download](https://gwern.net/Danbooru2018#download) |
| Danbooru2019 | Danbooru2019 is a large-scale anime image database with 3.69m+ anime images and illustrations annotated with 108m+ tags; it can be useful for machine learning purposes such as image recognition and generation | [CC0 1.0](https://creativecommons.org/publicdomain/zero/1.0/) | [Download](https://gwern.net/Danbooru2019#download) |
| Danbooru2020 | Danbooru2020 is a large-scale anime image database with 4.2m+ anime images and illustrations annotated with 130m+ tags; it can be useful for machine learning purposes such as image recognition and generation | [CC0 1.0](https://creativecommons.org/publicdomain/zero/1.0/) | [Download](https://gwern.net/Danbooru2020#download) |
| Metaface | Metaface is a dataset containing high-quality face images for machine learning applications | N/A | [Download](https://github.com/NVlabs/metfaces-dataset) |
| Aligned Ukiyo-e Faces | Over five thousand aligned ukiyo-e faces at 1024x1024 pixel resolution | [CC BY-SA 4.0](https://creativecommons.org/licenses/by-sa/4.0/) | [Download](https://drive.google.com/file/d/1zEgVLrKVp8oCZuX0NENcAeh-kdaKJzNG/view?usp=sharing) |
| Cartoon Faces | A dataset of cartoon faces for machine learning applications | N/A | [Download](https://mega.nz/file/HslSXS4a#7UBanJTjJqUl_2Z-JmAsreQYiJUKC-8UlZDR0rUsarw) |