An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with diffusion

A curated list of projects in awesome lists tagged with diffusion .

https://github.com/huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

adapter diffusion llm lora parameter-efficient-learning python pytorch transformers

Last synced: 12 May 2025

https://github.com/datawhalechina/leedl-tutorial

《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

bert chatgpt cnn deep-learning diffusion gan leedl-tutorial machine-learning network-compression pruning reinforcement-learning rnn self-attention transfer-learning transformer tutorial

Last synced: 14 May 2025

https://github.com/easydiffusion/easydiffusion

An easy 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image.

art diffusion generative-art gui stable

Last synced: 11 May 2025

https://github.com/cloneofsimo/lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

diffusion dreambooth fine-tuning lora stable-diffusion

Last synced: 14 May 2025

https://github.com/open-mmlab/mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

aigc computer-vision deep-learning diffusion diffusion-models generative-adversarial-network generative-ai image-editing image-generation image-processing image-synthesis inpainting matting pytorch super-resolution text2image video-frame-interpolation video-interpolation video-super-resolution

Last synced: 13 Dec 2025

https://github.com/NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

diffusion dit pytorch sana text-to-image-generation transformers

Last synced: 07 Aug 2025

https://github.com/riffusion/riffusion-hobby

Stable diffusion for real-time music generation

ai audio diffusers diffusion music stable-diffusion

Last synced: 12 Jan 2026

https://github.com/williamyang1991/rerender_a_video

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

controlnet diffusion video-processing

Last synced: 15 May 2025

https://github.com/williamyang1991/Rerender_A_Video

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

controlnet diffusion video-processing

Last synced: 11 Apr 2025

https://github.com/openvpi/DiffSinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

acoustic-model diffusion diffussion-model melody-frontend midi pitch-prediction rectified-flow singing-voice singing-voice-synthesis svs

Last synced: 02 Apr 2025

https://github.com/ai-forever/kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

diffusion image-generation image2image inpainting ipython-notebook kandinsky outpainting text-to-image text2image

Last synced: 15 May 2025

https://github.com/datawhalechina/tiny-universe

《大模型白盒子构建指南》:一个全手搓的Tiny-Universe

agent diffusion evaluation-metrics llama qwen rag transformers

Last synced: 14 May 2025

https://github.com/ai-forever/Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

diffusion image-generation image2image inpainting ipython-notebook kandinsky outpainting text-to-image text2image

Last synced: 08 Apr 2025

https://github.com/openvpi/diffsinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

acoustic-model diffusion diffussion-model melody-frontend midi pitch-prediction rectified-flow singing-voice singing-voice-synthesis svs

Last synced: 08 Jan 2026

https://github.com/playvoice/whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

change diff-svc diffusion diffusion-svc singing-voice-conversion sovits svc vits vits2 voice

Last synced: 14 May 2025

https://github.com/tmelyralab/musev

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

diffusion human-video-generation image2video infinite-length musev video-generation

Last synced: 26 Sep 2025

https://github.com/TMElyralab/MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

diffusion human-video-generation image2video infinite-length musev video-generation

Last synced: 11 Apr 2025

https://github.com/riffusion/riffusion-app-hobby

Stable diffusion for real-time music generation (web app)

ai audio diffusion music nextjs stable-diffusion threejs

Last synced: 15 May 2025

https://github.com/PlayVoice/whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

change diff-svc diffusion diffusion-svc singing-voice-conversion sovits svc vits vits2 voice

Last synced: 30 Aug 2025

https://github.com/prs-eth/marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

diffusion in-the-wild monocular-depth-estimation zero-shot

Last synced: 14 May 2025

https://github.com/prs-eth/Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

diffusion in-the-wild monocular-depth-estimation zero-shot

Last synced: 28 Mar 2025

https://github.com/Alpha-VLLM/Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

aigc diffusion diffusion-model diffusion-models diffusion-transformer generation-models transformer transformers

Last synced: 24 Mar 2025

https://github.com/alpha-vllm/lumina-t2x

Lumina-T2X is a unified framework for Text to Any Modality Generation

aigc diffusion diffusion-model diffusion-models diffusion-transformer generation-models transformer transformers

Last synced: 11 Apr 2025

https://github.com/foundationvision/llamagen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

auto-regressive-model diffusion diffusion-models image-generation llama llm text2image

Last synced: 15 May 2025

https://github.com/FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

auto-regressive-model diffusion diffusion-models image-generation llama llm text2image

Last synced: 07 May 2025

https://github.com/varunshenoy/opendream

An extensible, easy-to-use, and portable diffusion web UI 👨‍🎨

ai automatic-1111 diffusion image-generation stable-diffusion

Last synced: 08 Apr 2025

https://github.com/maks-s/sd-akashic

A compendium of informations regarding Stable Diffusion (SD)

diffusion guide stable-diffusion

Last synced: 16 May 2025

https://github.com/Maks-s/sd-akashic

A compendium of informations regarding Stable Diffusion (SD)

diffusion guide stable-diffusion

Last synced: 24 Mar 2025

https://github.com/nvidia/cosmos-tokenizer

A suite of image and video neural tokenizers

diffusion tokenization transformers

Last synced: 30 Oct 2025

https://github.com/tencentarc/brushnet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image

Last synced: 14 May 2025

https://github.com/TencentARC/BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image

Last synced: 28 Mar 2025

https://github.com/huggingface/finetrainers

Scalable and memory-optimized training of diffusion models

ai art artificial-intelligence diffusers diffusion diffusion-models pytorch transformers

Last synced: 14 Dec 2025

https://github.com/mini-sora/minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

diffusion sora video-generation

Last synced: 14 May 2025

https://github.com/0xCrunchyy/10x

Optimized inference and fine-tuning framework for diffusion (image & video) models. Up to 3x faster & 80% less VRAM.

artificial-inteligence diffusion diffusion-models fine-tuning flux gpt inference lora pytorch sdxl

Last synced: 09 Jan 2026

https://github.com/River-Zhang/ICEdit

Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enough to run!

diffusion diffusion-models diffusion-transformer dit editing-image gpt4o gpt4oimage image-editing in-context

Last synced: 12 Jun 2025

https://github.com/a-r-r-o-w/finetrainers

Scalable and memory-optimized training of diffusion models

ai art artificial-intelligence diffusers diffusion diffusion-models pytorch transformers

Last synced: 14 May 2025

https://github.com/thu-lyj-lab/t3bench

T3Bench: Benchmarking Current Progress in Text-to-3D Generation

3d diffusion nerf text-to-3d

Last synced: 16 May 2025

https://github.com/declare-lab/tango

A family of diffusion models for text-to-audio generation.

audio-generation diffusion diffusion-models language-models large-language-models text-to-audio

Last synced: 16 May 2025

https://github.com/EdVince/Stable-Diffusion-NCNN

Stable Diffusion in NCNN with c++, supported txt2img and img2img

android clip cpp diffusion executable img2img mnn ncnn onnx stable-diffusion tensorrt tnn txt2img

Last synced: 13 Apr 2025

https://github.com/cloneofsimo/mindiffusion

Self-contained, minimalistic implementation of diffusion models with Pytorch.

diffusion pytorch

Last synced: 13 Apr 2025

https://github.com/cloneofsimo/minDiffusion

Self-contained, minimalistic implementation of diffusion models with Pytorch.

diffusion pytorch

Last synced: 27 Mar 2025

https://github.com/castorini/daam

Diffusion attentive attribution maps for interpreting Stable Diffusion.

diffusion explainable-ai generative-ai huggingface pytorch stable-diffusion

Last synced: 16 May 2025

https://github.com/fboulnois/stable-diffusion-docker

Run the official Stable Diffusion releases in a Docker container with txt2img, img2img, depth2img, pix2pix, upscale4x, and inpaint.

dall-e dalle diffusion docker generative-art huggingface image-generation inpainting midjourney pytorch stable-diffusion tensorflow text-to-image

Last synced: 13 Apr 2025

https://github.com/pku-yuangroup/consisid

[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition

diffusion diffusion-models identity-preserving text-to-video video-generation video-generation-dataset video-generator videogeneration

Last synced: 06 Jul 2025

https://github.com/thu-ml/riflex

Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)

cogvideox diffusion diffusion-models diffusion-transformer dit extrapolation generative-model hunyuan-video long-video-generation position-embedding rope video-generation

Last synced: 01 Jul 2025

https://github.com/cloneofsimo/paint-with-words-sd

Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentation map.

diffusion generative-model stable-diffusion

Last synced: 05 Apr 2025

https://github.com/some9000/StylePile

A prompt generation helper script for AUTOMATIC1111/stable-diffusion-webui and compatible forks

diffusion generation generator promt stable

Last synced: 08 May 2025

https://github.com/omriav/blended-diffusion

Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]

blended-diffusion deep-learning diffusion multimodal openai openai-clip text-guided-manipulation text-to-image

Last synced: 28 Mar 2025

https://github.com/williamyang1991/fresco

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

controlnet diffusion video-processing

Last synced: 05 Apr 2025

https://github.com/williamyang1991/FRESCO

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

controlnet diffusion video-processing

Last synced: 28 Mar 2025

https://github.com/microsoft/foldingdiff

Diffusion models of protein structure; trigonometry and attention are all you need!

diffusion diffusion-models protein protein-structure-generation proteins transformer

Last synced: 30 Mar 2025

https://github.com/dromara/omega-ai

Omega-AI:基于java打造的深度学习框架,帮助你快速搭建神经网络,实现模型推理与训练,引擎支持自动求导,多线程与GPU运算,GPU支持CUDA,CUDNN。

ai deeplearning diffusion llm neural-network yolo

Last synced: 08 Jul 2025

https://github.com/LeCAR-Lab/dial-mpc

Official implementation for the paper "Full-Order Sampling-Based MPC for Torque-Level Locomotion Control via Diffusion-Style Annealing". DIAL-MPC is a novel sampling-based MPC framework for legged robot full-order torque-level control with both precision and agility in a training-free manner.

diffusion humanoid legged-robots mpc online-control optimal-control quadruped sampling-based-control

Last synced: 18 Oct 2025

https://github.com/afiaka87/clip-guided-diffusion

A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.

artificial-intelligence deep-learning diffusion image-generation multimodal multimodality openai openai-clip text-to-image text-to-image-synthesis

Last synced: 05 Aug 2025

https://github.com/huggingface/open-muse

Open reproduction of MUSE for fast text2image generation.

cv deep-learning diffusion generative-art nlp text2image transformer

Last synced: 14 Oct 2025

https://github.com/scenediffuser/Scene-Diffuser

Official implementation of CVPR23 paper "Diffusion-based Generation, Optimization, and Planning in 3D Scenes"

3d-scene-understanding diffusion generative-model

Last synced: 27 Apr 2025

https://github.com/HaozheLiu-ST/T-GATE

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

cross-attention cross-attention-diffusers diffusers diffusion efficiency inference pytorch text2image training-free transformer

Last synced: 13 Mar 2025

https://github.com/ailab-cvc/freenoise

[ICLR 2024] Code for FreeNoise based on VideoCrafter

aigc diffusion generative-model video-diffusion-model

Last synced: 06 Apr 2025

https://github.com/AILab-CVC/FreeNoise

[ICLR 2024] Code for FreeNoise based on VideoCrafter

aigc diffusion generative-model video-diffusion-model

Last synced: 28 Mar 2025

https://github.com/p1atdev/leco

Low-rank adaptation for Erasing COncepts from diffusion models.

diffusion lora stable-diffusion

Last synced: 06 Apr 2025

https://github.com/rehglab/rave

RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models [CVPR 2024]

diffusion stable-diffusion video-editing

Last synced: 15 Sep 2025

https://github.com/nianticlabs/diffusionerf

[CVPR 2023] DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models

deep-learning diffusion diffusion-models nerf neuralradiance-fields radiance-field reconstruction regularization

Last synced: 07 Apr 2025

https://github.com/zibojia/COCOCO

Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.

cococo diffusion inpainting pytorch sam2 segment segment-anything text-guided text-guided-video-inpainting video-inpainting video-inpainting-with-prompt video-sam2-inpaint

Last synced: 24 Jul 2025

https://github.com/yeungchenwa/FontDiffuser

[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

deep-learning diffusers diffusion font-generation image-generation

Last synced: 28 Mar 2025

https://github.com/RehgLab/RAVE

RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models [CVPR 2024]

diffusion stable-diffusion video-editing

Last synced: 28 Mar 2025

https://github.com/joaolages/diffusers-interpret

Diffusers-Interpret 🤗🧨🕵️‍♀️: Model explainability for 🤗 Diffusers. Get explanations for your generated images.

computer-vision deep-learning diffusers diffusion explainable-ai image-generation interpretability model-explainability primary-attributions pytorch text2image transformers

Last synced: 05 Apr 2025

https://github.com/baaivision/diva

[ICLR 2025] Diffusion Feedback Helps CLIP See Better

clip diffusion visual-perception

Last synced: 08 Oct 2025

https://github.com/byliutao/1Prompt1Story

🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

diffusion diffusion-model diffusion-models storytelling text-to-image

Last synced: 13 Oct 2025

https://github.com/RQ-Wu/LAMP

[CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation

aigc diffusion diffusion-model diffusion-models few-shot-learning stable-diffusion text-to-video video-editing

Last synced: 28 Mar 2025

https://github.com/ZichengDuan/TheChosenOne

Unofficial implementation of the paper "The Chosen One: Consistent Characters in Text-to-Image Diffusion Models"

deep-learning diffusion dinov2 generative-art generative-model

Last synced: 27 Mar 2025

https://sirui-xu.github.io/InterDiff/

[ICCV 2023] Official PyTorch implementation of the paper "InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion"

3d-human-pose 6d deep-learning diffusion diffusion-models generative-ai generative-model human-motion-prediction human-object-interaction human-scene-interaction motion-prediction object-pose

Last synced: 23 Apr 2025

https://github.com/Sirui-Xu/InterDiff

[ICCV 2023] Official PyTorch implementation of the paper "InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion"

3d-human-pose 6d deep-learning diffusion diffusion-models generative-ai generative-model human-motion-prediction human-object-interaction human-scene-interaction motion-prediction object-pose

Last synced: 23 Apr 2025

https://github.com/keonlee9420/diffsinger

PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)

ddpm diffsinger diffusion diffusion-models english fastspeech neural-tts non-autoregressive pytorch singing-voice speech-synthesis text-to-speech tts

Last synced: 12 Oct 2025

https://github.com/jiauzhang/dragdiffusion

Implementation of DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing

diffusion draggan image-editing

Last synced: 07 Oct 2025

https://github.com/mihirp1998/VADER

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.

alignment diffusion reinforcement-learning reinforcement-learning-human-feedback rl rlhf vader video-diffusion video-diffusion-alignment

Last synced: 28 Mar 2025

https://github.com/ssube/onnx-web

web UI for GPU-accelerated ONNX pipelines like Stable Diffusion, even on Windows and AMD

ai-art amd diffusion esrgan flask generative-art gfpgan image-generation linux nvidia python pytorch reactjs stable-diffusion super-resolution text2image upscaling web-app windows

Last synced: 16 May 2025