Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/azad-academy/personalized-diffusion

A Tutorial on Customized Image Generation by Fine-Tuning the Stable Diffusion Models
https://github.com/azad-academy/personalized-diffusion

computer-vision deep-learning diffusion-models machine-learning

Last synced: about 2 months ago
JSON representation

A Tutorial on Customized Image Generation by Fine-Tuning the Stable Diffusion Models

Awesome Lists containing this project

README

        

# Beyond Diffusion: What is Personalized Image Generation and How Can You Customize Image Synthesis?
## Personalized Image Generation by Fine-Tuning the Stable Diffusion Models


![title cover image](cover.jpg)



## Requirements

If you would like to run it on your own PC instead then make sure you have sufficient hardware resources. Setup a Conda environment with python 3.10.6 and pytorch > 1.16.

## Running The Notebook

* Open the Notebook in Google Colab or local jupyter server
* Make sure GPU is selected in the runtime (Runtime->Change Type->GPU)
* Install the requirements
* Run the code in the example sections
* In case of GPU out of memory error, make sure that the model from one example is cleared before running another example. Alternatively, you can restart the runtime and run that particular example directly instead of running the whole notebook.

## The tutorial πŸ“ƒ

On Medium:

https://azad-wolf.medium.com/beyond-diffusion-what-is-personalized-image-generation-and-how-can-you-customize-image-synthesis-26a89d5b335

On Substack:

https://azadwolf.substack.com/p/beyond-diffusion-what-is-personalized




## Object Customization using Textual Inversion

![TextualInversion Object image](ti-object.jpg)




## Object Customization using DreamBooth

![DreamBooth Object image](dreambooth.jpg)




## Style Capture & Generation using TextualInversion




![TextualInversion Styles image](style-images.jpg)




## Style/Pose Transfer using TextualInversion

![TextualInversion Styles image](ti-styles.jpg)



## References

[1] Jonathan Ho, Ajay Jain, Pieter Abbeel, "Denoising Diffusion Probabilistic Models", 2020

[2] Robin Rombach and Andreas Blattmann and Dominik Lorenz and Patrick Esser and BjΓΆrn Ommer, "High-Resolution Image Synthesis with Latent Diffusion Models", arXiv:2112.10752, 2021

[3] Rinon Gal, Yuval Alaluf, Yuval Atzmon, Or Patashnik, Amit H. Bermano, Gal Chechik, Daniel Cohen-Or
, "An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion", arXiv:2208.01618, 2022

[4] Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Yael Pritch, Michael Rubinstein, Kfir Aberman, "DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation", arXiv:2208.12242, 2022