Projects in Awesome Lists tagged with diffusion

https://github.com/automatic1111/stable-diffusion-webui

Stable Diffusion web UI

ai ai-art deep-learning diffusion gradio image-generation image2image img2img pytorch stable-diffusion text2image torch txt2img unstable upscaling web

Last synced: 09 Sep 2025

https://github.com/AUTOMATIC1111/stable-diffusion-webui

Stable Diffusion web UI

ai ai-art deep-learning diffusion gradio image-generation image2image img2img pytorch stable-diffusion text2image torch txt2img unstable upscaling web

Last synced: 14 Mar 2025

https://github.com/huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

deep-learning diffusion flax flux hacktoberfest image-generation image2image image2video jax latent-diffusion-models pytorch score-based-generative-modeling stable-diffusion stable-diffusion-diffusers text2image text2video video2video

Last synced: 01 May 2026

https://github.com/sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

attention blackwell cuda deepseek diffusion glm gpt-oss inference llama llm minimax moe qwen qwen-image reinforcement-learning transformer vlm wan

Last synced: 16 May 2026

https://github.com/huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

adapter diffusion llm lora parameter-efficient-learning python pytorch transformers

Last synced: 12 May 2025

https://github.com/datawhalechina/leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

bert chatgpt cnn deep-learning diffusion gan leedl-tutorial machine-learning network-compression pruning reinforcement-learning rnn self-attention transfer-learning transformer tutorial

Last synced: 14 May 2025

https://github.com/easydiffusion/easydiffusion

An easy 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image.

art diffusion generative-art gui stable

Last synced: 11 May 2025

https://github.com/cloneofsimo/lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

diffusion dreambooth fine-tuning lora stable-diffusion

Last synced: 14 May 2025

https://github.com/open-mmlab/mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

aigc computer-vision deep-learning diffusion diffusion-models generative-adversarial-network generative-ai image-editing image-generation image-processing image-synthesis inpainting matting pytorch super-resolution text2image video-frame-interpolation video-interpolation video-super-resolution

Last synced: 13 Dec 2025

https://github.com/leejet/stable-diffusion.cpp

Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++

ai cplusplus diffusion flux flux-dev flux-schnell ggml image-generation image2image img2img latent-diffusion qwen-image stable-diffusion text2image txt2img videogeneration wan z-image z-image-turbo

Last synced: 23 May 2026

https://github.com/NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

diffusion dit pytorch sana text-to-image-generation transformers

Last synced: 07 Aug 2025

https://github.com/riffusion/riffusion-hobby

Stable diffusion for real-time music generation

ai audio diffusers diffusion music stable-diffusion

Last synced: 12 Jan 2026

https://github.com/jina-ai/discoart

🪩 Create Disco Diffusion artworks in one line

clip-guided-diffusion creative-ai creative-art cross-modal dalle diffusion disco-diffusion discodiffusion generative-art imgen latent-diffusion midjourney multimodal prompts stable-diffusion

Last synced: 14 May 2025

https://github.com/williamyang1991/rerender_a_video

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

controlnet diffusion video-processing

Last synced: 15 May 2025

https://github.com/williamyang1991/Rerender_A_Video

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

controlnet diffusion video-processing

Last synced: 11 Apr 2025

https://github.com/openvpi/DiffSinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

acoustic-model diffusion diffussion-model melody-frontend midi pitch-prediction rectified-flow singing-voice singing-voice-synthesis svs

Last synced: 02 Apr 2025

https://github.com/ai-forever/kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

diffusion image-generation image2image inpainting ipython-notebook kandinsky outpainting text-to-image text2image

Last synced: 15 May 2025

https://github.com/datawhalechina/tiny-universe

《大模型白盒子构建指南》：一个全手搓的Tiny-Universe

agent diffusion evaluation-metrics llama qwen rag transformers

Last synced: 14 May 2025

https://github.com/ai-forever/Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

diffusion image-generation image2image inpainting ipython-notebook kandinsky outpainting text-to-image text2image

Last synced: 08 Apr 2025

https://github.com/openvpi/diffsinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

acoustic-model diffusion diffussion-model melody-frontend midi pitch-prediction rectified-flow singing-voice singing-voice-synthesis svs

Last synced: 08 Jan 2026

https://github.com/playvoice/whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

change diff-svc diffusion diffusion-svc singing-voice-conversion sovits svc vits vits2 voice

Last synced: 14 May 2025

https://github.com/tmelyralab/musev

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

diffusion human-video-generation image2video infinite-length musev video-generation

Last synced: 26 Sep 2025

https://github.com/TMElyralab/MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

diffusion human-video-generation image2video infinite-length musev video-generation

Last synced: 11 Apr 2025

https://github.com/riffusion/riffusion-app-hobby

Stable diffusion for real-time music generation (web app)

ai audio diffusion music nextjs stable-diffusion threejs

Last synced: 15 May 2025

https://github.com/PlayVoice/whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

change diff-svc diffusion diffusion-svc singing-voice-conversion sovits svc vits vits2 voice

Last synced: 30 Aug 2025

https://github.com/prs-eth/marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

diffusion in-the-wild monocular-depth-estimation zero-shot

Last synced: 14 May 2025

https://github.com/prs-eth/Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

diffusion in-the-wild monocular-depth-estimation zero-shot

Last synced: 28 Mar 2025

https://github.com/Alpha-VLLM/Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

aigc diffusion diffusion-model diffusion-models diffusion-transformer generation-models transformer transformers

Last synced: 24 Mar 2025

https://github.com/alpha-vllm/lumina-t2x

Lumina-T2X is a unified framework for Text to Any Modality Generation

aigc diffusion diffusion-model diffusion-models diffusion-transformer generation-models transformer transformers

Last synced: 11 Apr 2025

https://github.com/rupeshs/fastsdcpu

Fast stable diffusion on CPU and AI PC

aipc api cli cpu desktopgui diffusers diffusion fastsdcpu flux gradio latentconsistencymodels lcmdiffusion openvino qt sdupcale sdxlturbo sdxs stablediffusion torch webui

Last synced: 12 Jan 2026

https://github.com/nunchaku-tech/ComfyUI-nunchaku

ComfyUI Plugin of Nunchaku

comfyui diffusion flux genai mlsys quantization

Last synced: 02 Sep 2025

https://github.com/pollinations/pollinations

Free Open-Source Image and Text Generation

colaboratory colaboratory-notebook diffusion gan generative ipfs javascript machinelearning nodejs python

Last synced: 13 May 2025

https://github.com/foundationvision/llamagen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

auto-regressive-model diffusion diffusion-models image-generation llama llm text2image

Last synced: 15 May 2025

https://github.com/FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

auto-regressive-model diffusion diffusion-models image-generation llama llm text2image

Last synced: 07 May 2025

https://github.com/varunshenoy/opendream

An extensible, easy-to-use, and portable diffusion web UI 👨‍🎨

ai automatic-1111 diffusion image-generation stable-diffusion

Last synced: 08 Apr 2025

https://github.com/maks-s/sd-akashic

A compendium of informations regarding Stable Diffusion (SD)

diffusion guide stable-diffusion

Last synced: 16 May 2025

https://github.com/Maks-s/sd-akashic

A compendium of informations regarding Stable Diffusion (SD)

diffusion guide stable-diffusion

Last synced: 24 Mar 2025

https://github.com/nvidia/cosmos-tokenizer

A suite of image and video neural tokenizers

diffusion tokenization transformers

Last synced: 30 Oct 2025

https://github.com/tencentarc/brushnet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image

Last synced: 14 May 2025

https://github.com/intellabs/fastrag

Efficient Retrieval Augmentation and Generation Framework

benchmark colbert diffusion generative-ai information-retrieval knowledge-graph llm multi-modal nlp question-answering semantic-search sentence-transformers summarization transformers

Last synced: 14 May 2025

https://github.com/TencentARC/BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image

Last synced: 28 Mar 2025

https://github.com/huggingface/finetrainers

Scalable and memory-optimized training of diffusion models

ai art artificial-intelligence diffusers diffusion diffusion-models pytorch transformers

Last synced: 14 Dec 2025

https://github.com/mini-sora/minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

diffusion sora video-generation

Last synced: 14 May 2025

https://github.com/0xCrunchyy/10x

Optimized inference and fine-tuning framework for diffusion (image & video) models. Up to 3x faster & 80% less VRAM.

artificial-inteligence diffusion diffusion-models fine-tuning flux gpt inference lora pytorch sdxl

Last synced: 09 Jan 2026

https://github.com/uminosachi/sd-webui-inpaint-anything

Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.

ai-art anything diffusers diffusion extension generative-art gradio huggingface huggingface-diffusers image-generation image2image img2img inpaint inpaint-anything inpainting latent-diffusion segment segment-anything segmentation stable-diffusion

Last synced: 16 May 2025

https://github.com/River-Zhang/ICEdit

Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enough to run!

diffusion diffusion-models diffusion-transformer dit editing-image gpt4o gpt4oimage image-editing in-context

Last synced: 12 Jun 2025

https://github.com/a-r-r-o-w/finetrainers

Scalable and memory-optimized training of diffusion models

ai art artificial-intelligence diffusers diffusion diffusion-models pytorch transformers

Last synced: 14 May 2025

https://github.com/thu-lyj-lab/t3bench

T3Bench: Benchmarking Current Progress in Text-to-3D Generation

3d diffusion nerf text-to-3d

Last synced: 16 May 2025

https://github.com/declare-lab/tango

A family of diffusion models for text-to-audio generation.

audio-generation diffusion diffusion-models language-models large-language-models text-to-audio

Last synced: 16 May 2025

https://github.com/EdVince/Stable-Diffusion-NCNN

Stable Diffusion in NCNN with c++, supported txt2img and img2img

android clip cpp diffusion executable img2img mnn ncnn onnx stable-diffusion tensorrt tnn txt2img

Last synced: 13 Apr 2025

https://github.com/cloneofsimo/mindiffusion

Self-contained, minimalistic implementation of diffusion models with Pytorch.

diffusion pytorch

Last synced: 13 Apr 2025

https://github.com/IntelLabs/fastRAG

Efficient Retrieval Augmentation and Generation Framework

benchmark colbert diffusion generative-ai information-retrieval knowledge-graph llm multi-modal nlp question-answering semantic-search sentence-transformers summarization transformers

Last synced: 24 Mar 2025

https://github.com/cloneofsimo/minDiffusion

Self-contained, minimalistic implementation of diffusion models with Pytorch.

diffusion pytorch

Last synced: 27 Mar 2025

https://github.com/Uminosachi/sd-webui-inpaint-anything

Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.

ai-art anything diffusers diffusion extension generative-art gradio huggingface huggingface-diffusers image-generation image2image img2img inpaint inpaint-anything inpainting latent-diffusion segment segment-anything segmentation stable-diffusion

Last synced: 16 Apr 2025

https://github.com/sail-sg/adan

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

adan artificial-intelligence bert-model convnext cuda-programming deep-learning diffusion dreamfusion fairseq gpt2 llm-training llms mae moe optimizer pytorch resnet timm transformer-xl vit

Last synced: 07 Jul 2025

https://github.com/sail-sg/Adan

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

adan artificial-intelligence bert-model convnext cuda-programming deep-learning diffusion dreamfusion fairseq gpt2 llm-training llms mae moe optimizer pytorch resnet timm transformer-xl vit

Last synced: 05 Apr 2025

https://github.com/castorini/daam

Diffusion attentive attribution maps for interpreting Stable Diffusion.

diffusion explainable-ai generative-ai huggingface pytorch stable-diffusion

Last synced: 16 May 2025

https://github.com/ChaofanTao/Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

acceleration autoregressive computer-vision deep-learning diffusion embodied-ai image-generation medical-ai motion-prediction multimodal point-cloud survey text-to-image video-generation

Last synced: 11 Jun 2026

https://github.com/fboulnois/stable-diffusion-docker

Run the official Stable Diffusion releases in a Docker container with txt2img, img2img, depth2img, pix2pix, upscale4x, and inpaint.

dall-e dalle diffusion docker generative-art huggingface image-generation inpainting midjourney pytorch stable-diffusion tensorflow text-to-image

Last synced: 13 Apr 2025

https://github.com/pku-yuangroup/consisid

[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition

diffusion diffusion-models identity-preserving text-to-video video-generation video-generation-dataset video-generator videogeneration

Last synced: 06 Jul 2025

https://github.com/thu-ml/riflex

Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)

cogvideox diffusion diffusion-models diffusion-transformer dit extrapolation generative-model hunyuan-video long-video-generation position-embedding rope video-generation

Last synced: 01 Jul 2025

https://github.com/cloneofsimo/paint-with-words-sd

Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentation map.

diffusion generative-model stable-diffusion

Last synced: 05 Apr 2025

https://github.com/some9000/StylePile

A prompt generation helper script for AUTOMATIC1111/stable-diffusion-webui and compatible forks

diffusion generation generator promt stable

Last synced: 08 May 2025

https://github.com/omriav/blended-latent-diffusion

Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]

computer-vision deep-learning diffusion diffusion-models generative-model image-generation multimodal multimodal-deep-learning pytorch text-driven-editing text-guided-manipulation text-to-image text-to-image-synthesis

Last synced: 28 Mar 2025

https://github.com/omriav/blended-diffusion

Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]

blended-diffusion deep-learning diffusion multimodal openai openai-clip text-guided-manipulation text-to-image

Last synced: 28 Mar 2025

https://github.com/williamyang1991/fresco

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

controlnet diffusion video-processing

Last synced: 05 Apr 2025

https://github.com/williamyang1991/FRESCO

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

controlnet diffusion video-processing

Last synced: 28 Mar 2025

https://github.com/microsoft/foldingdiff

Diffusion models of protein structure; trigonometry and attention are all you need!

diffusion diffusion-models protein protein-structure-generation proteins transformer

Last synced: 30 Mar 2025

https://github.com/AspirinCode/papers-for-molecular-design-using-DL

List of molecular design using Generative AI and Deep Learning

deep-generative-models diffusion drug-design energy-based-model gan generative-ai gnns lstm molecular-design prompt-learning reinforcement-learning rnn score-based-generative-models transformer vae

Last synced: 14 Mar 2025

https://github.com/aspirincode/papers-for-molecular-design-using-dl

List of molecular design using Generative AI and Deep Learning

deep-generative-models diffusion drug-design energy-based-model gan generative-ai gnns lstm molecular-design prompt-learning reinforcement-learning rnn score-based-generative-models transformer vae

Last synced: 24 Mar 2025

https://github.com/dromara/omega-ai

Omega-AI：基于java打造的深度学习框架，帮助你快速搭建神经网络，实现模型推理与训练，引擎支持自动求导，多线程与GPU运算，GPU支持CUDA，CUDNN。

ai deeplearning diffusion llm neural-network yolo

Last synced: 08 Jul 2025

https://github.com/LeCAR-Lab/dial-mpc

Official implementation for the paper "Full-Order Sampling-Based MPC for Torque-Level Locomotion Control via Diffusion-Style Annealing". DIAL-MPC is a novel sampling-based MPC framework for legged robot full-order torque-level control with both precision and agility in a training-free manner.

diffusion humanoid legged-robots mpc online-control optimal-control quadruped sampling-based-control

Last synced: 18 Oct 2025

https://github.com/afiaka87/clip-guided-diffusion

A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.

artificial-intelligence deep-learning diffusion image-generation multimodal multimodality openai openai-clip text-to-image text-to-image-synthesis

Last synced: 05 Aug 2025

https://github.com/thu-ml/Causal-Forcing

Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation"

auto-regressive-diffusion-model autoregressive-models consistency-models diffusion diffusion-models distillation few-step-generation generative-ai text-to-video text-to-video-generation video-diffusion-model video-generation wan-video wan2 wan21 world-model world-models

Last synced: 05 Mar 2026

https://github.com/Auto1111SDK/Auto1111SDK

An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models

ai ai-art api automatic1111 deep-learning diffusers diffusion image-generation image-to-image img2img python pytorch stable-diffusion stable-diffusion-webui text-to-image torch txt2img unstable upscaling web

Last synced: 29 Oct 2025

https://github.com/huggingface/open-muse

Open reproduction of MUSE for fast text2image generation.

cv deep-learning diffusion generative-art nlp text2image transformer

Last synced: 14 Oct 2025

https://github.com/scenediffuser/Scene-Diffuser

Official implementation of CVPR23 paper "Diffusion-based Generation, Optimization, and Planning in 3D Scenes"

3d-scene-understanding diffusion generative-model

Last synced: 27 Apr 2025

https://github.com/HaozheLiu-ST/T-GATE

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

cross-attention cross-attention-diffusers diffusers diffusion efficiency inference pytorch text2image training-free transformer

Last synced: 13 Mar 2025

https://github.com/ailab-cvc/freenoise

[ICLR 2024] Code for FreeNoise based on VideoCrafter

aigc diffusion generative-model video-diffusion-model

Last synced: 06 Apr 2025

https://github.com/AILab-CVC/FreeNoise

[ICLR 2024] Code for FreeNoise based on VideoCrafter

aigc diffusion generative-model video-diffusion-model

Last synced: 28 Mar 2025

https://github.com/p1atdev/leco

Low-rank adaptation for Erasing COncepts from diffusion models.

diffusion lora stable-diffusion

Last synced: 06 Apr 2025

https://github.com/woctezuma/stable-diffusion-colab

Colab notebook for Stable Diffusion Hyper-SDXL.

colab colab-notebook colaboratory deep-learning diffusers diffusion diffusion-models google-colab google-colab-notebook google-colaboratory huggingface-diffusers hyper-sd hyper-sdxl image-generation stable-diffusion stable-diffusion-xl text-to-image text-to-image-generation text-to-image-synthesis text2image

Last synced: 05 Apr 2025

https://github.com/rehglab/rave

RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models [CVPR 2024]

diffusion stable-diffusion video-editing

Last synced: 27 Jan 2026

https://github.com/nianticlabs/diffusionerf

[CVPR 2023] DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models

deep-learning diffusion diffusion-models nerf neuralradiance-fields radiance-field reconstruction regularization

Last synced: 07 Apr 2025

https://github.com/qitianwu/DIFFormer

The official implementation for ICLR23 spotlight paper "DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion"

attention diffusion diffusion-equation geometric-deep-learning graph-neural-networks graph-transformer iclr2023 image-classification large-graph node-classification pytorch pytorch-geometric pytorch-geometric-temporal spatial-temporal-forecasting text-classification transformer

Last synced: 27 Mar 2025

https://github.com/zibojia/COCOCO

Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.

cococo diffusion inpainting pytorch sam2 segment segment-anything text-guided text-guided-video-inpainting video-inpainting video-inpainting-with-prompt video-sam2-inpaint