Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-stable-diffusion

Curated list of awesome resources for the Stable Diffusion AI Model.
https://github.com/awesome-stable-diffusion/awesome-stable-diffusion

Last synced: about 3 hours ago
JSON representation

  • Official Resources

  • Actively Maintained Forks and Containers

      • KerasCV StableDiffusion - High performance implementation of stable diffusion on KerasCV.
      • AUTOMATIC1111/stable-diffusion-webui - Likely the fastest moving, most feature rich branch at the moment. Gradio based UI with extensive features such as in and outpainting, previews, xy plots, upscaling, clip-interrogation, textual inversion, negative prompting, a variety of upscaling features, training, checkpoint merging and switching capabilities and more. Comes with a handy install script that takes care of most dependencies and addons.
      • bes-dev/stable_diffusion.openvino - A fork for running the model using a CPU compatible with OpenVINO.
      • fast-stable-diffusion - Speed focused fork with Dreambooth integration.
      • Gauss - Native MacOS Stable Diffusion App.
      • imaginAIry - Pythonic generation of stable diffusion images. Unique in that it supports complex text-based masking. Has an interactive CLI, upscaling, face enhancement, tiling, and other standard features. No GUI.
      • invoke-ai/InvokeAI - (formerly known as lstein/stable-diffusion) - Very active fork adding a conversational CLI, basic web interface and support for GFPGAN, ESRGAN, Codeformer, weighted prompts, prompt blending, negative prompting, img2img, tiling, [textual-inversion](https://textual-inversion.github.io/) as well as inference on Apple M1.
      • NickLucche/stable-diffusion-nvidia-docker - Multi (Nvidia) GPU capable docker setup of SD
      • replicate/copg-stable-diffusion - [Cog machine learning container](https://github.com/replicate/cog) of SD v1.4.
      • stable-diffusion-jupyterlab-docker - A Docker setup ready to go with Jupyter notebooks for Stable Diffusion.
      • runwayml/stable-diffusion - Stable Diffusion Branch by [RunwayML](https://runwayml.com) with specifically trained inpainting model for high quality inpainting.
      • DreamArtist - With just one training image DreamArtist learns the content and style in it, generating diverse high-quality images with high controllability. Embeddings of DreamArtist can be easily combined with additional descriptions, as well as two learned embeddings. (standalone version)
      • basujindal/stable-diffusion - "Optimized Stable Diffusion"—a fork with dramatically reduced VRAM requirements through model splitting, enabling Stable Diffusion on lower-end graphics cards; includes a GradIO web interface and support for weighted prompts.
    • Potentially Stale/Less active branches

      • neonsecret/stable-diffusion - Fork focusing on bigger resolutions with less vram at the expense of speed, automatically adjusting to the GPUs abilities. Also includes upscaling, facial restoration via CodeFormer and [custom UI](https://github.com/neonsecret/stable-diffusion/blob/main/GUI_TUTORIAL.md)
  • Checkpoints and Weights

    • Potentially Stale/Less active branches

      • sd-2.1 Model Cards (768x768px) - Model Cards/Weights for Stable Diffusion 2.1 (diffusion, upscaling and inpainting checkpoints)**
      • sd-2.1 Base Model Cards (512px512px) - Model Cards/Weights for Stable Diffusion 2.1 (diffusion, upscaling and inpainting checkpoints)
      • sd-2.0 Model Cards - Model Cards/Weights for Stable Diffusion 2.0 (diffusion, upscaling and inpainting checkpoints)
      • sd-v1-5 from RunwayML - Stable Diffusion 1.5 Checkpoint released by runwayML.
      • Official Model Card 1.5 - Official Stability AI Model Card on Hugging Face with all versions of the model. Download requires sign-in and acceptance of terms of service.
      • stable-diffusion-v-1-4-original.chkpt - The original 1.4 model's card
      • RealESRGAN Models - Download location for the latest RealESRGAN models required to use the upscaling features implemented by many forks. Different models exist for realistic and anime content. Please refer to the fork documentation to identify the ones you
      • sd-v1-5-inpainting from RunwayML - Checkpoint optimized for inpainting on SD 1.5, released by runwayML.
      • RealESRGAN Models - Download location for the latest RealESRGAN models required to use the upscaling features implemented by many forks. Different models exist for realistic and anime content. Please refer to the fork documentation to identify the ones you
      • sd-v1-5 from RunwayML - Stable Diffusion 1.5 Checkpoint released by runwayML.
      • RealESRGAN Models - Download location for the latest RealESRGAN models required to use the upscaling features implemented by many forks. Different models exist for realistic and anime content. Please refer to the fork documentation to identify the ones you
  • Online Demos and Notebooks

  • Complementary Models and Tools

    • Potentially Stale/Less active branches

      • Civitai Model Repository - Repository for custom models and textual inversion embeddings.
      • sd-concepts-library - A library of user created [textual-inversion](https://textual-inversion.github.io/) embeddings to add new concepts to stable diffusion
      • Prompt to Prompt - Unofficial Implementation of Cross-attention-control for prompt to prompt image editing.
      • sd-prompt-graph - This is a React-based curve editor GUI for prompt interpolation animations made with Stable Diffusion.
      • DAAM - Diffusion attention attribution maps, generating heatmaps modelling the impact of specific terms and tokens in the prompt on the final diffusion result.
  • Training

    • GUIS

      • Auto SD Workflow - A UI for [lstein/stable-diffusion](https://github.com/lstein/stable-diffusion)'s dream.py with optimized UX for large-scale/production workflow around image synthesis. [Video Walkthrough](https://vimeo.com/748114237).
      • KIRI.ART - MUI) - mobile-first PWA with multiple models (incl. waifu diffusion). Run free locally or use free & paid credits on the live site. Built with React + MaterialUI. ([Source Code](https://github.com/gadicc/stable-diffusion-react-nextjs-mui-pwa)) `MIT License` `TypeScript`
      • NMKD GUI - Windows UI, fully featured. Closed source. Pick your own price.
      • sd-webui/stable-diffusion-webui - Very active fork with optional, highly featureful Gradio UI and support for txt2img, img2img inpainting, GFPGAN, ESRGAN, weighted prompts, optimized low memory version, optional [textual-inversion](https://textual-inversion.github.io/) and more.
      • Synesthetic - Prompt search engine and generator focused on remixing
      • Carefree Creator (local version) - User friendly GUI with a creator/artist centric workflow.
      • DiffusionBee - Self contained binary app for MacOS.
      • DiffusionUI - web UI made with Vue.js inspired by Dall-e using [diffusers](https://github.com/huggingface/diffusers), perfect for inpainting. [Video demo](https://www.youtube.com/watch?v=AFZvW5qURes)
      • Stable Diffusion Infinity - A proof of concept for outpainting with an infinite canvas interface. (requires powerful GPU).
      • Unstable Fusion - A Stable Diffusion desktop frontend with inpainting, img2img and more
      • stable-diffusion-webui-docker - A docker based frontend integrating the most popular forks.
      • cmdr2/stable-diffusion-ui - Another, simple to use UI for windows and Linux.
      • sd-webui/stable-diffusion-webui - Very active fork with optional, highly featureful Gradio UI and support for txt2img, img2img inpainting, GFPGAN, ESRGAN, weighted prompts, optimized low memory version, optional [textual-inversion](https://textual-inversion.github.io/) and more.
    • Upscaling

      • Upscale Wiki Model Database - Wiki and model database for checkpoints compatible with ESRGAN et al.
      • BasicSR - Open-source upscaling and restoration toolbox supporting several models.
      • BSRGAN - BSRGAN—another upscaling solution specialized in upscaling degraded images.
      • Cupscale - GUI for running various upscaling models, including ESRGAN and RealESRGAN.
      • RealESRGAN - ESRGAN Upscaling (2x, 4x) and content restoration. Python: ```pip install realesrgan```
      • jquesnelle/txt2imghd - A port of the GOBIG mode from progrockdiffusion, providing high quality upscaling on top of txt2img.
    • Task Chaining

      • chaiNNer - Graphical node-based editor for chaining image processing tasks.
      • ai-art-generator - AI art generation suite combining Stable Diffusion and other models for high volume art generation.
      • dfserver - hosted distributed GPU cluster to run the Stable Diffusion and various AI image or prompt building model.
      • chaiNNer - Graphical node-based editor for chaining image processing tasks.
    • Prompt Building

      • Stable diffusion prompt book - OpenAI's stable diffusion prompt book, a very comprehensive resource on prompt engineering.
      • ai-art.com/modifiers - A visual reference guide for keywords.
      • aipromptguide.com - Visual Database of styles, modifier, artists and persons
      • aiTransformer Prompt Builder - Help build text prompts intuitively using an Image To Text tool to generate detailed text descriptions from image, and 800+ special Textual Inversion terms and often-used generic terms with sample images.
      • krea.ai - Prompt search engine that also recommends similar prompts to the one that you click on.
      • Lexica.art - A searchable, visual database of images and the prompts settings used to create them.
      • Prompthero
      • PromptoMania - A visual prompt construction tool.
      • Stable Diffusion Prompt Generator - Gives suggestions for improving a given text prompt.
      • PromptSearch - Yet another Stable Diffusion search engine but with public API
      • Same Energy - A visual search engine that returns images that have the same 'energy'.
      • PublicPrompts - *Collection* of PublicPrompts
      • pharmapsychotic/clip-interrogator - Jupyter notebook uses CLIP models to suggest a prompt for images similar to a given image ([Demo](https://replicate.com/methexis-inc/img2prompt)).
      • rom1504/clip-retrieval - Searches for prompt keywords in the datasets used in training Stable Diffusion and other models ([Online GUI](https://rom1504.github.io/clip-retrieval/)). Some GUIS like Automatic1111 include this functionality.
      • aipromptguide.com - Visual Database of styles, modifier, artists and persons
      • krea.ai - Prompt search engine that also recommends similar prompts to the one that you click on.
      • Phraser - A visual prompt builder drawing on a database of examples. (Requires account creation)
      • PromptSearch - Yet another Stable Diffusion search engine but with public API
      • Phraser - A visual prompt builder drawing on a database of examples. (Requires account creation)
      • krea.ai - Prompt search engine that also recommends similar prompts to the one that you click on.
    • Content Restoration

      • lama-cleaner - Content aware AI inpainting tool useful for removing unwanted objects or defects from images. Python: ```pip install lama-cleaner```
      • GFPGAN - Face Restoration GAN included in several forks for automatically fixing the face deformation commonly found in SD output.
      • CodeFormer - Another Face Restoration model ([Paper](https://arxiv.org/abs/2206.11253)).
    • Potentially Stale/Less active branches

      • Dreambooth-Stable-Diffusion - Implementation of [Google's DreamBooth](https://arxiv.org/abs/2208.12242) for stable diffusion, allowing fine-tuning of the model for specific concepts.
      • LORA - LORA (Low-rank Adaptation) for Faster Text-to-Image Diffusion Fine-tuning (UNET+CLIP), a faster, more efficient Fine tuning method requiring less RAM than Dreambooth. [Huggingface demo here](https://huggingface.co/spaces/ysharma/Low-rank-Adaptation).
      • Stable Tuner - Stable Tuner, an easy to install Dreambooth trainer with a very comfortable user interface.
      • Stable Diffusion Trainer - Stable Diffusion trainer with scalable dataset size and hardware usage. Requires 10G of VRAM.
      • textual-inversion - Addition of personalized content to Stable Diffusion without retraining the model ([Paper](https://textual-inversion.github.io/), [Paper2](https://dreambooth.github.io/)).
    • Specialized Usecases

      • dream-textures - A blender addon leveraging stable diffusion for texture creation.
      • stable-dreamfusion - An open source implementation of Google's text-to-3D dreamfusion paper with imagegen replaced by stable diffusion.
  • Tutorials and Comparisons

  • Community Resources

    • Extending Functionality

      • 1 week of Stable Diffusion - A curated list of Stable Diffusion services, adaptations, user interfaces and integrations.
      • pharmapsychotic.com/tools - A curated list of Tools and Resources for AI Art, including but not limited to Stable Diffusion.
      • Stable Diffusion Resources - A thorough resource for answering pressing questions about Stable Diffusion, including guides, tutorials, and best software.
  • Social Media

  • Plugins for third party apps

  • Commercial SaaS and apps implementing Stable Diffusion

    • Extending Functionality

      • aiimagegenerator.org - Free online art generator for generating images using Stable Diffusion. Supports txt2img and img2img generation and inpainting.
      • aiTransformer - Free online image/video transformation tools, including Super Stylizer based on Stable Diffusion, supports txt2img, img2img and img2txt (clip-interrogator).
      • Barium.ai - Generate PBR (physics based rendering) textures from text. Free and paid plans.
      • Canva text-to-image - Text-to-image (beta) service from Canva
      • Dream Studio - Online art generation service by StabilityAI, the creators of Stable Diffusion. Similar to services like DALL-E or Midjourney, this operates on a credit model with a free allowance of credits given to signed up users on a monthly basis.
      • dream.ai - Online art generation service by Wombo.ai (mobile apps available).
      • GimmeAPIs - Third party REST API Stable Diffusion service.
      • Neural.love - Another online art generator with generous free credits as of Oct 2022.
      • replicate.com stable diffusion - Another SaaS offering for Stable Diffusion.
      • Stable Horde - Distributed stable diffusion cluster (think folding@home) with web, discord and telegram interfaces where joining with your GPU gives you priority.
      • Stable Diffusion as API - Third party REST API into table Diffusion service.
      • AI Art Generator (IOS) - iOS App to generate art using Stable Diffusion.
      • AI Art Generator (IOS) - iOS App to generate art using Stable Diffusion.
      • Starry AI (IOS) - Another IOS app offering stable diffusion with preset art styles.
      • AI Art Generator (IOS) - iOS App to generate art using Stable Diffusion.
      • AI Art Generator (IOS) - iOS App to generate art using Stable Diffusion.
  • Uncategorized