Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-stable-diffusion

Curated list of awesome resources for the Stable Diffusion AI Model.
https://github.com/awesome-stable-diffusion/awesome-stable-diffusion

Last synced: 3 days ago
JSON representation

  • Official Resources

    • Stable Diffusion 2.1 Release Notes - Stable Diffusion 2.1 release notes.
    • Public Release Announcement - StabilityAI's announcement about the public release of Stable Diffusion.
    • Official Discord - The official Stable Diffusion Discord by StabilityAI.
    • laion-aesthetic - The dataset used train stable diffusion, useful for querying to see if a concept is represented.
    • CompVis/Stable Diffusion - The official release of Stable Diffusion including a CLI, an AI-based Safety Classifier, which detects and suppresses sexualized content, and all the necessary files to get running.
    • stability-AI/stability-sdk - The official SDK used to build python applications integrated with StabilityAI's cloud platform instead of hosting the model locally. Operation requires an API Key (🖊️💵).
    • Public Release Announcement - StabilityAI's announcement about the public release of Stable Diffusion.
  • Actively Maintained Forks and Containers

      • KerasCV StableDiffusion - High performance implementation of stable diffusion on KerasCV.
      • AUTOMATIC1111/stable-diffusion-webui - Likely the fastest moving, most feature rich branch at the moment. Gradio based UI with extensive features such as in and outpainting, previews, xy plots, upscaling, clip-interrogation, textual inversion, negative prompting, a variety of upscaling features, training, checkpoint merging and switching capabilities and more. Comes with a handy install script that takes care of most dependencies and addons.
      • basujindal/stable-diffusion - "Optimized Stable Diffusion"—a fork with dramatically reduced VRAM requirements through model splitting, enabling Stable Diffusion on lower-end graphics cards; includes a GradIO web interface and support for weighted prompts.
      • bes-dev/stable_diffusion.openvino - A fork for running the model using a CPU compatible with OpenVINO.
      • fast-stable-diffusion - Speed focused fork with Dreambooth integration.
      • Gauss - Native MacOS Stable Diffusion App.
      • imaginAIry - Pythonic generation of stable diffusion images. Unique in that it supports complex text-based masking. Has an interactive CLI, upscaling, face enhancement, tiling, and other standard features. No GUI.
      • invoke-ai/InvokeAI - (formerly known as lstein/stable-diffusion) - Very active fork adding a conversational CLI, basic web interface and support for GFPGAN, ESRGAN, Codeformer, weighted prompts, prompt blending, negative prompting, img2img, tiling, [textual-inversion](https://textual-inversion.github.io/) as well as inference on Apple M1.
      • NickLucche/stable-diffusion-nvidia-docker - Multi (Nvidia) GPU capable docker setup of SD
      • replicate/copg-stable-diffusion - [Cog machine learning container](https://github.com/replicate/cog) of SD v1.4.
      • stable-diffusion-jupyterlab-docker - A Docker setup ready to go with Jupyter notebooks for Stable Diffusion.
      • runwayml/stable-diffusion - Stable Diffusion Branch by [RunwayML](https://runwayml.com) with specifically trained inpainting model for high quality inpainting.
    • Potentially Stale/Less active branches

      • neonsecret/stable-diffusion - Fork focusing on bigger resolutions with less vram at the expense of speed, automatically adjusting to the GPUs abilities. Also includes upscaling, facial restoration via CodeFormer and [custom UI](https://github.com/neonsecret/stable-diffusion/blob/main/GUI_TUTORIAL.md)
  • Checkpoints and Weights

    • Potentially Stale/Less active branches

  • Online Demos and Notebooks

  • Complementary Models and Tools

    • Potentially Stale/Less active branches

      • Civitai Model Repository - Repository for custom models and textual inversion embeddings.
      • sd-concepts-library - A library of user created [textual-inversion](https://textual-inversion.github.io/) embeddings to add new concepts to stable diffusion
      • Prompt to Prompt - Unofficial Implementation of Cross-attention-control for prompt to prompt image editing.
      • sd-prompt-graph - This is a React-based curve editor GUI for prompt interpolation animations made with Stable Diffusion.
      • DAAM - Diffusion attention attribution maps, generating heatmaps modelling the impact of specific terms and tokens in the prompt on the final diffusion result.
  • Training

    • GUIS

      • Auto SD Workflow - A UI for [lstein/stable-diffusion](https://github.com/lstein/stable-diffusion)'s dream.py with optimized UX for large-scale/production workflow around image synthesis. [Video Walkthrough](https://vimeo.com/748114237).
      • KIRI.ART - MUI) - mobile-first PWA with multiple models (incl. waifu diffusion). Run free locally or use free & paid credits on the live site. Built with React + MaterialUI. ([Source Code](https://github.com/gadicc/stable-diffusion-react-nextjs-mui-pwa)) `MIT License` `TypeScript`
      • NMKD GUI - Windows UI, fully featured. Closed source. Pick your own price.
      • sd-webui/stable-diffusion-webui - Very active fork with optional, highly featureful Gradio UI and support for txt2img, img2img inpainting, GFPGAN, ESRGAN, weighted prompts, optimized low memory version, optional [textual-inversion](https://textual-inversion.github.io/) and more.
      • Synesthetic - Prompt search engine and generator focused on remixing
      • Carefree Creator (local version) - User friendly GUI with a creator/artist centric workflow.
      • DiffusionBee - Self contained binary app for MacOS.
      • DiffusionUI - web UI made with Vue.js inspired by Dall-e using [diffusers](https://github.com/huggingface/diffusers), perfect for inpainting. [Video demo](https://www.youtube.com/watch?v=AFZvW5qURes)
      • Stable Diffusion Infinity - A proof of concept for outpainting with an infinite canvas interface. (requires powerful GPU).
      • Unstable Fusion - A Stable Diffusion desktop frontend with inpainting, img2img and more
      • stable-diffusion-webui-docker - A docker based frontend integrating the most popular forks.
      • cmdr2/stable-diffusion-ui - Another, simple to use UI for windows and Linux.
      • sd-webui/stable-diffusion-webui - Very active fork with optional, highly featureful Gradio UI and support for txt2img, img2img inpainting, GFPGAN, ESRGAN, weighted prompts, optimized low memory version, optional [textual-inversion](https://textual-inversion.github.io/) and more.
    • Upscaling

      • Upscale Wiki Model Database - Wiki and model database for checkpoints compatible with ESRGAN et al.
      • BasicSR - Open-source upscaling and restoration toolbox supporting several models.
      • BSRGAN - BSRGAN—another upscaling solution specialized in upscaling degraded images.
      • Cupscale - GUI for running various upscaling models, including ESRGAN and RealESRGAN.
      • RealESRGAN - ESRGAN Upscaling (2x, 4x) and content restoration. Python: ```pip install realesrgan```
      • jquesnelle/txt2imghd - A port of the GOBIG mode from progrockdiffusion, providing high quality upscaling on top of txt2img.
    • Task Chaining

      • chaiNNer - Graphical node-based editor for chaining image processing tasks.
      • ai-art-generator - AI art generation suite combining Stable Diffusion and other models for high volume art generation.
      • dfserver - hosted distributed GPU cluster to run the Stable Diffusion and various AI image or prompt building model.
      • chaiNNer - Graphical node-based editor for chaining image processing tasks.
    • Prompt Building

      • Stable diffusion prompt book - OpenAI's stable diffusion prompt book, a very comprehensive resource on prompt engineering.
      • ai-art.com/modifiers - A visual reference guide for keywords.
      • aipromptguide.com - Visual Database of styles, modifier, artists and persons
      • aiTransformer Prompt Builder - Help build text prompts intuitively using an Image To Text tool to generate detailed text descriptions from image, and 800+ special Textual Inversion terms and often-used generic terms with sample images.
      • krea.ai - Prompt search engine that also recommends similar prompts to the one that you click on.
      • Lexica.art - A searchable, visual database of images and the prompts settings used to create them.
      • Prompthero
      • PromptoMania - A visual prompt construction tool.
      • Stable Diffusion Prompt Generator - Gives suggestions for improving a given text prompt.
      • PromptSearch - Yet another Stable Diffusion search engine but with public API
      • Same Energy - A visual search engine that returns images that have the same 'energy'.
      • PublicPrompts - *Collection* of PublicPrompts
      • pharmapsychotic/clip-interrogator - Jupyter notebook uses CLIP models to suggest a prompt for images similar to a given image ([Demo](https://replicate.com/methexis-inc/img2prompt)).
      • rom1504/clip-retrieval - Searches for prompt keywords in the datasets used in training Stable Diffusion and other models ([Online GUI](https://rom1504.github.io/clip-retrieval/)). Some GUIS like Automatic1111 include this functionality.
      • PromptSearch - Yet another Stable Diffusion search engine but with public API
      • Phraser - A visual prompt builder drawing on a database of examples. (Requires account creation)
    • Content Restoration

      • lama-cleaner - Content aware AI inpainting tool useful for removing unwanted objects or defects from images. Python: ```pip install lama-cleaner```
      • GFPGAN - Face Restoration GAN included in several forks for automatically fixing the face deformation commonly found in SD output.
      • CodeFormer - Another Face Restoration model ([Paper](https://arxiv.org/abs/2206.11253)).
    • Potentially Stale/Less active branches

      • Dreambooth-Stable-Diffusion - Implementation of [Google's DreamBooth](https://arxiv.org/abs/2208.12242) for stable diffusion, allowing fine-tuning of the model for specific concepts.
      • LORA - LORA (Low-rank Adaptation) for Faster Text-to-Image Diffusion Fine-tuning (UNET+CLIP), a faster, more efficient Fine tuning method requiring less RAM than Dreambooth. [Huggingface demo here](https://huggingface.co/spaces/ysharma/Low-rank-Adaptation).
      • Stable Tuner - Stable Tuner, an easy to install Dreambooth trainer with a very comfortable user interface.
      • Stable Diffusion Trainer - Stable Diffusion trainer with scalable dataset size and hardware usage. Requires 10G of VRAM.
      • textual-inversion - Addition of personalized content to Stable Diffusion without retraining the model ([Paper](https://textual-inversion.github.io/), [Paper2](https://dreambooth.github.io/)).
    • Specialized Usecases

      • dream-textures - A blender addon leveraging stable diffusion for texture creation.
      • stable-dreamfusion - An open source implementation of Google's text-to-3D dreamfusion paper with imagegen replaced by stable diffusion.
  • Tutorials and Comparisons

  • Community Resources

    • Extending Functionality

      • 1 week of Stable Diffusion - A curated list of Stable Diffusion services, adaptations, user interfaces and integrations.
      • pharmapsychotic.com/tools - A curated list of Tools and Resources for AI Art, including but not limited to Stable Diffusion.
      • Stable Diffusion Resources - A thorough resource for answering pressing questions about Stable Diffusion, including guides, tutorials, and best software.
  • Social Media

  • Plugins for third party apps

  • Commercial SaaS and apps implementing Stable Diffusion

    • Extending Functionality

      • aiimagegenerator.org - Free online art generator for generating images using Stable Diffusion. Supports txt2img and img2img generation and inpainting.
      • aiTransformer - Free online image/video transformation tools, including Super Stylizer based on Stable Diffusion, supports txt2img, img2img and img2txt (clip-interrogator).
      • Barium.ai - Generate PBR (physics based rendering) textures from text. Free and paid plans.
      • Canva text-to-image - Text-to-image (beta) service from Canva
      • Dream Studio - Online art generation service by StabilityAI, the creators of Stable Diffusion. Similar to services like DALL-E or Midjourney, this operates on a credit model with a free allowance of credits given to signed up users on a monthly basis.
      • dream.ai - Online art generation service by Wombo.ai (mobile apps available).
      • GimmeAPIs - Third party REST API Stable Diffusion service.
      • Neural.love - Another online art generator with generous free credits as of Oct 2022.
      • replicate.com stable diffusion - Another SaaS offering for Stable Diffusion.
      • Stable Horde - Distributed stable diffusion cluster (think folding@home) with web, discord and telegram interfaces where joining with your GPU gives you priority.
      • Stable Diffusion as API - Third party REST API into table Diffusion service.
      • AI Art Generator (IOS) - iOS App to generate art using Stable Diffusion.
      • AI Art Generator (IOS) - iOS App to generate art using Stable Diffusion.
      • Starry AI (IOS) - Another IOS app offering stable diffusion with preset art styles.
  • Uncategorized