An open API service indexing awesome lists of open source software.

https://github.com/olaviinha/neuraltexttoimage

Colabs for text prompt steered image generators
https://github.com/olaviinha/neuraltexttoimage

ai-image big-sleep biggan colab dall-e dalle2 google-colab image-generation laion neural-network neural-networks openai rudalle stable-diffusion text2image txt2img vqgan

Last synced: 3 months ago
JSON representation

Colabs for text prompt steered image generators

Awesome Lists containing this project

README

        

# Colab notebooks for text-to-image generators

**❗️ This repository is not maintained since 2022, as closed-source state-of-the-art text-to-image solutions are now widely available for everyone.**

User-friendly Colab notebooks for various text prompt steered synthetic image generators. These notebooks are not actively maintained, though issues and pull requests are welcome.

**Available notebooks (latest to oldest):**

- [Stable Diffusion](#stable-diffusion-by-compvis-stability-ai-and-laion) (v1.4, v1.5, waifu diffusion)
- [DALL·E Mini (craiyon.ai)](#dalle-mini-craiyonai-by-boris-dayma)
- [Latent Diffusion LAION 400M](#latent-diffusion-laion-400m-by-compvis)
- [ruDALLE Malevich](#rudalle-malevich-by-sberbank-ai)
- [JAX CLIP Guided Diffusion](#jax-clip-guided-diffusion-by-nshepperd)
- [CLIP+VQGAN](#clipvqgan-by-katherine-crowson)
- [Latent Vision](#latent-vision-by-ryan-murdock)
- [Big Sleep](#big-sleep-by-phil-wang-more-precisely-this-fork)

---

These techniques are all more or less descendants of Big Sleep (2021), the original concept of turning text into imagery by using [CLIP](https://github.com/openai/CLIP) (OpenAI) to steer [BigGAN](https://arxiv.org/abs/1809.11096) (Andrew Brock et al.), as introduced by [Ryan Murdock](https://github.com/rynmurdock) in his [original notebook](https://colab.research.google.com/drive/1NCceX2mbiKOSlAd_o7IU7nA9UskKN5WR?usp=sharing). The technique has since branched and evolved into what we now know as Midjourney, Dall-E, Stable Diffusion, etc.

The point of this repository is to record the development history of image-generation AIs, and as long as they work, user-friendly and quick-to-run UIs for the different implementations. Some additional basic features have been added to some of the notebooks, such as saving all/selected progression images, saving progression video, batch processing, etc.

I do not use these notebooks on a regular basis, and hence I'm mostly unaware when some library gets updated and a notebook may no longer function; Please do raise an issue should any problems occur.

---

## Stable Diffusion by [CompVis](https://github.com/CompVis), [Stability AI](https://stability.ai) and [LAION](https://laion.ai)

This colab requires registering with [Hugging Face](https://huggingface.co) for login token and accepting model card terms to download required models [here](https://huggingface.co/CompVis/stable-diffusion-v1-4) and [here](https://huggingface.co/runwayml/stable-diffusion-v1-5).

[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/olaviinha/NeuralTextToImage/blob/main/Stable_Diffusion.ipynb?force_theme=dark)

![image](https://user-images.githubusercontent.com/50331907/186525426-33e89d0a-e5f1-4f7f-b347-64e3aec567ab.png)

_"Satanism in the 1900s"_

---

## DALL·E Mini (craiyon.ai) by [Boris Dayma](https://github.com/borisdayma)

This colab requires registering with [wandb](https://wandb.ai/) for an authentication key to download required models.

[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/olaviinha/NeuralTextToImage/blob/main/dalle_mini.ipynb?force_theme=dark)

![image](https://user-images.githubusercontent.com/50331907/179073528-935fe0d2-363e-438b-9706-3e4548dd7569.png)

_"Satanism in the 1900s"_

---

## Latent Diffusion LAION 400M by [CompVis](https://github.com/CompVis)

[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/olaviinha/NeuralTextToImage/blob/main/LAION_400M.ipynb?force_theme=dark)

![image](https://user-images.githubusercontent.com/50331907/163890889-a31370e6-e08d-4389-86af-947fe70a7140.png)

_"Satanism in the 1900s"_ (upscaled)

---

## ruDALLE Malevich by [Sberbank-AI](https://github.com/sberbank-ai)

[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/olaviinha/NeuralTextToImage/blob/main/ruDalle.ipynb?force_theme=dark)

![image](https://user-images.githubusercontent.com/50331907/159492943-572f7a52-83a9-4310-befb-ceb8473af77d.png)


_"Satanism in the 1900s"_

---

## JAX CLIP Guided Diffusion by [nshepperd](https://github.com/nshepperd)

Method originally by [Katherine Crowson](https://github.com/crowsonkb)

[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/olaviinha/NeuralTextToImage/blob/main/JAX_CLIP_Guided_diffusion_nshepperd.ipynb?force_theme=dark)

![image](https://user-images.githubusercontent.com/50331907/144711796-3933dbb1-d8a0-4656-812f-050ccb444118.png)


300 iterations of _"Satanism in the 1900s"_.

---

## CLIP+VQGAN by [Katherine Crowson](https://github.com/crowsonkb)

**Papers:** 1) [VQGAN by Patrick Esser et al](https://arxiv.org/abs/2012.09841), 2) [CLIP by OpenAI](https://arxiv.org/abs/2103.00020)

[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/olaviinha/NeuralTextToImage/blob/main/CLIP%2BVQGAN_crowsonkb.ipynb?force_theme=dark)

![image](https://user-images.githubusercontent.com/50331907/126226554-f53ae66d-21d9-428f-9c16-0a17bf8c0ef9.png)

400 iterations of _"Satanism in the 1900s"_.

---

## Latent Vision by [Ryan Murdock](https://github.com/rynmurdock)

**Papers:** 1) [VQGAN by Patrick Esser et al](https://arxiv.org/abs/2012.09841), 2) [CLIP by OpenAI](https://arxiv.org/abs/2103.00020)

[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/olaviinha/NeuralTextToImage/blob/main/LatentVision_rynmurdock.ipynb?force_theme=dark)

![image](https://user-images.githubusercontent.com/50331907/126967558-b68857b7-0792-4135-b535-687539a3d240.png)

400 iterations of _"Satanism in the 1900s"_.

---

## Big Sleep by [Phil Wang](https://github.com/lucidrains) (more precisely [this fork](https://github.com/olaviinha/big-sleep))

**Papers:** 1) [BigGAN by Andrew Brock et al](https://arxiv.org/abs/1809.11096), 2) [CLIP by OpenAI](https://arxiv.org/abs/2103.00020)

Original Big Sleep method.

[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/olaviinha/NeuralTextToImage/blob/main/BigSleep_lucidrains.ipynb?force_theme=dark)

![image](https://user-images.githubusercontent.com/50331907/126226982-815e6fa6-6d5b-4d99-b1b1-902261a88212.png)

400 iterations of _"Satanism in the 1900s"_.