An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with diffusion

A curated list of projects in awesome lists tagged with diffusion .

https://github.com/sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

attention blackwell cuda deepseek diffusion glm gpt-oss inference llama llm minimax moe qwen qwen-image reinforcement-learning transformer vlm wan

Last synced: 16 May 2026

https://github.com/huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

adapter diffusion llm lora parameter-efficient-learning python pytorch transformers

Last synced: 12 May 2025

https://github.com/datawhalechina/leedl-tutorial

《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

bert chatgpt cnn deep-learning diffusion gan leedl-tutorial machine-learning network-compression pruning reinforcement-learning rnn self-attention transfer-learning transformer tutorial

Last synced: 14 May 2025

https://github.com/easydiffusion/easydiffusion

An easy 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image.

art diffusion generative-art gui stable

Last synced: 11 May 2025

https://github.com/cloneofsimo/lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

diffusion dreambooth fine-tuning lora stable-diffusion

Last synced: 14 May 2025

https://github.com/open-mmlab/mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

aigc computer-vision deep-learning diffusion diffusion-models generative-adversarial-network generative-ai image-editing image-generation image-processing image-synthesis inpainting matting pytorch super-resolution text2image video-frame-interpolation video-interpolation video-super-resolution

Last synced: 13 Dec 2025

https://github.com/NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

diffusion dit pytorch sana text-to-image-generation transformers

Last synced: 07 Aug 2025

https://github.com/riffusion/riffusion-hobby

Stable diffusion for real-time music generation

ai audio diffusers diffusion music stable-diffusion

Last synced: 12 Jan 2026

https://github.com/williamyang1991/rerender_a_video

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

controlnet diffusion video-processing

Last synced: 15 May 2025

https://github.com/williamyang1991/Rerender_A_Video

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

controlnet diffusion video-processing

Last synced: 11 Apr 2025

https://github.com/openvpi/DiffSinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

acoustic-model diffusion diffussion-model melody-frontend midi pitch-prediction rectified-flow singing-voice singing-voice-synthesis svs

Last synced: 02 Apr 2025

https://github.com/ai-forever/kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

diffusion image-generation image2image inpainting ipython-notebook kandinsky outpainting text-to-image text2image

Last synced: 15 May 2025

https://github.com/datawhalechina/tiny-universe

《大模型白盒子构建指南》:一个全手搓的Tiny-Universe

agent diffusion evaluation-metrics llama qwen rag transformers

Last synced: 14 May 2025

https://github.com/ai-forever/Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

diffusion image-generation image2image inpainting ipython-notebook kandinsky outpainting text-to-image text2image

Last synced: 08 Apr 2025

https://github.com/openvpi/diffsinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

acoustic-model diffusion diffussion-model melody-frontend midi pitch-prediction rectified-flow singing-voice singing-voice-synthesis svs

Last synced: 08 Jan 2026

https://github.com/playvoice/whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

change diff-svc diffusion diffusion-svc singing-voice-conversion sovits svc vits vits2 voice

Last synced: 14 May 2025

https://github.com/tmelyralab/musev

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

diffusion human-video-generation image2video infinite-length musev video-generation

Last synced: 26 Sep 2025

https://github.com/TMElyralab/MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

diffusion human-video-generation image2video infinite-length musev video-generation

Last synced: 11 Apr 2025

https://github.com/riffusion/riffusion-app-hobby

Stable diffusion for real-time music generation (web app)

ai audio diffusion music nextjs stable-diffusion threejs

Last synced: 15 May 2025

https://github.com/PlayVoice/whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

change diff-svc diffusion diffusion-svc singing-voice-conversion sovits svc vits vits2 voice

Last synced: 30 Aug 2025

https://github.com/prs-eth/marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

diffusion in-the-wild monocular-depth-estimation zero-shot

Last synced: 14 May 2025

https://github.com/prs-eth/Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

diffusion in-the-wild monocular-depth-estimation zero-shot

Last synced: 28 Mar 2025

https://github.com/Alpha-VLLM/Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

aigc diffusion diffusion-model diffusion-models diffusion-transformer generation-models transformer transformers

Last synced: 24 Mar 2025

https://github.com/alpha-vllm/lumina-t2x

Lumina-T2X is a unified framework for Text to Any Modality Generation

aigc diffusion diffusion-model diffusion-models diffusion-transformer generation-models transformer transformers

Last synced: 11 Apr 2025

https://github.com/foundationvision/llamagen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

auto-regressive-model diffusion diffusion-models image-generation llama llm text2image

Last synced: 15 May 2025

https://github.com/FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

auto-regressive-model diffusion diffusion-models image-generation llama llm text2image

Last synced: 07 May 2025

https://github.com/varunshenoy/opendream

An extensible, easy-to-use, and portable diffusion web UI 👨‍🎨

ai automatic-1111 diffusion image-generation stable-diffusion

Last synced: 08 Apr 2025

https://github.com/maks-s/sd-akashic

A compendium of informations regarding Stable Diffusion (SD)

diffusion guide stable-diffusion

Last synced: 16 May 2025

https://github.com/Maks-s/sd-akashic

A compendium of informations regarding Stable Diffusion (SD)

diffusion guide stable-diffusion

Last synced: 24 Mar 2025

https://github.com/nvidia/cosmos-tokenizer

A suite of image and video neural tokenizers

diffusion tokenization transformers

Last synced: 30 Oct 2025

https://github.com/tencentarc/brushnet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image

Last synced: 14 May 2025

https://github.com/TencentARC/BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image

Last synced: 28 Mar 2025

https://github.com/huggingface/finetrainers

Scalable and memory-optimized training of diffusion models

ai art artificial-intelligence diffusers diffusion diffusion-models pytorch transformers

Last synced: 14 Dec 2025

https://github.com/mini-sora/minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

diffusion sora video-generation

Last synced: 14 May 2025

https://github.com/0xCrunchyy/10x

Optimized inference and fine-tuning framework for diffusion (image & video) models. Up to 3x faster & 80% less VRAM.

artificial-inteligence diffusion diffusion-models fine-tuning flux gpt inference lora pytorch sdxl

Last synced: 09 Jan 2026

https://github.com/River-Zhang/ICEdit

Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enough to run!

diffusion diffusion-models diffusion-transformer dit editing-image gpt4o gpt4oimage image-editing in-context

Last synced: 12 Jun 2025

https://github.com/a-r-r-o-w/finetrainers

Scalable and memory-optimized training of diffusion models

ai art artificial-intelligence diffusers diffusion diffusion-models pytorch transformers

Last synced: 14 May 2025

https://github.com/thu-lyj-lab/t3bench

T3Bench: Benchmarking Current Progress in Text-to-3D Generation

3d diffusion nerf text-to-3d

Last synced: 16 May 2025

https://github.com/declare-lab/tango

A family of diffusion models for text-to-audio generation.

audio-generation diffusion diffusion-models language-models large-language-models text-to-audio

Last synced: 16 May 2025

https://github.com/EdVince/Stable-Diffusion-NCNN

Stable Diffusion in NCNN with c++, supported txt2img and img2img

android clip cpp diffusion executable img2img mnn ncnn onnx stable-diffusion tensorrt tnn txt2img

Last synced: 13 Apr 2025

https://github.com/cloneofsimo/mindiffusion

Self-contained, minimalistic implementation of diffusion models with Pytorch.

diffusion pytorch

Last synced: 13 Apr 2025

https://github.com/cloneofsimo/minDiffusion

Self-contained, minimalistic implementation of diffusion models with Pytorch.

diffusion pytorch

Last synced: 27 Mar 2025

https://github.com/castorini/daam

Diffusion attentive attribution maps for interpreting Stable Diffusion.

diffusion explainable-ai generative-ai huggingface pytorch stable-diffusion

Last synced: 16 May 2025

https://github.com/fboulnois/stable-diffusion-docker

Run the official Stable Diffusion releases in a Docker container with txt2img, img2img, depth2img, pix2pix, upscale4x, and inpaint.

dall-e dalle diffusion docker generative-art huggingface image-generation inpainting midjourney pytorch stable-diffusion tensorflow text-to-image

Last synced: 13 Apr 2025

https://github.com/pku-yuangroup/consisid

[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition

diffusion diffusion-models identity-preserving text-to-video video-generation video-generation-dataset video-generator videogeneration

Last synced: 06 Jul 2025

https://github.com/thu-ml/riflex

Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)

cogvideox diffusion diffusion-models diffusion-transformer dit extrapolation generative-model hunyuan-video long-video-generation position-embedding rope video-generation

Last synced: 01 Jul 2025

https://github.com/cloneofsimo/paint-with-words-sd

Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentation map.

diffusion generative-model stable-diffusion

Last synced: 05 Apr 2025

https://github.com/some9000/StylePile

A prompt generation helper script for AUTOMATIC1111/stable-diffusion-webui and compatible forks

diffusion generation generator promt stable

Last synced: 08 May 2025

https://github.com/omriav/blended-diffusion

Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]

blended-diffusion deep-learning diffusion multimodal openai openai-clip text-guided-manipulation text-to-image

Last synced: 28 Mar 2025

https://github.com/williamyang1991/fresco

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

controlnet diffusion video-processing

Last synced: 05 Apr 2025

https://github.com/williamyang1991/FRESCO

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

controlnet diffusion video-processing

Last synced: 28 Mar 2025

https://github.com/microsoft/foldingdiff

Diffusion models of protein structure; trigonometry and attention are all you need!

diffusion diffusion-models protein protein-structure-generation proteins transformer

Last synced: 30 Mar 2025

https://github.com/dromara/omega-ai

Omega-AI:基于java打造的深度学习框架,帮助你快速搭建神经网络,实现模型推理与训练,引擎支持自动求导,多线程与GPU运算,GPU支持CUDA,CUDNN。

ai deeplearning diffusion llm neural-network yolo

Last synced: 08 Jul 2025

https://github.com/LeCAR-Lab/dial-mpc

Official implementation for the paper "Full-Order Sampling-Based MPC for Torque-Level Locomotion Control via Diffusion-Style Annealing". DIAL-MPC is a novel sampling-based MPC framework for legged robot full-order torque-level control with both precision and agility in a training-free manner.

diffusion humanoid legged-robots mpc online-control optimal-control quadruped sampling-based-control

Last synced: 18 Oct 2025

https://github.com/afiaka87/clip-guided-diffusion

A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.

artificial-intelligence deep-learning diffusion image-generation multimodal multimodality openai openai-clip text-to-image text-to-image-synthesis

Last synced: 05 Aug 2025

https://github.com/huggingface/open-muse

Open reproduction of MUSE for fast text2image generation.

cv deep-learning diffusion generative-art nlp text2image transformer

Last synced: 14 Oct 2025

https://github.com/scenediffuser/Scene-Diffuser

Official implementation of CVPR23 paper "Diffusion-based Generation, Optimization, and Planning in 3D Scenes"

3d-scene-understanding diffusion generative-model

Last synced: 27 Apr 2025

https://github.com/HaozheLiu-ST/T-GATE

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

cross-attention cross-attention-diffusers diffusers diffusion efficiency inference pytorch text2image training-free transformer

Last synced: 13 Mar 2025

https://github.com/AILab-CVC/FreeNoise

[ICLR 2024] Code for FreeNoise based on VideoCrafter

aigc diffusion generative-model video-diffusion-model

Last synced: 28 Mar 2025

https://github.com/ailab-cvc/freenoise

[ICLR 2024] Code for FreeNoise based on VideoCrafter

aigc diffusion generative-model video-diffusion-model

Last synced: 06 Apr 2025

https://github.com/p1atdev/leco

Low-rank adaptation for Erasing COncepts from diffusion models.

diffusion lora stable-diffusion

Last synced: 06 Apr 2025

https://github.com/rehglab/rave

RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models [CVPR 2024]

diffusion stable-diffusion video-editing

Last synced: 27 Jan 2026

https://github.com/nianticlabs/diffusionerf

[CVPR 2023] DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models

deep-learning diffusion diffusion-models nerf neuralradiance-fields radiance-field reconstruction regularization

Last synced: 07 Apr 2025

https://github.com/zibojia/COCOCO

Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.

cococo diffusion inpainting pytorch sam2 segment segment-anything text-guided text-guided-video-inpainting video-inpainting video-inpainting-with-prompt video-sam2-inpaint

Last synced: 24 Jul 2025

https://github.com/yeungchenwa/FontDiffuser

[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

deep-learning diffusers diffusion font-generation image-generation

Last synced: 28 Mar 2025

https://github.com/RehgLab/RAVE

RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models [CVPR 2024]

diffusion stable-diffusion video-editing

Last synced: 28 Mar 2025

https://github.com/joaolages/diffusers-interpret

Diffusers-Interpret 🤗🧨🕵️‍♀️: Model explainability for 🤗 Diffusers. Get explanations for your generated images.

computer-vision deep-learning diffusers diffusion explainable-ai image-generation interpretability model-explainability primary-attributions pytorch text2image transformers

Last synced: 05 Apr 2025

https://github.com/baaivision/diva

[ICLR 2025] Diffusion Feedback Helps CLIP See Better

clip diffusion visual-perception

Last synced: 08 Oct 2025

https://github.com/byliutao/1Prompt1Story

🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

diffusion diffusion-model diffusion-models storytelling text-to-image

Last synced: 13 Oct 2025

https://github.com/RQ-Wu/LAMP

[CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation

aigc diffusion diffusion-model diffusion-models few-shot-learning stable-diffusion text-to-video video-editing

Last synced: 28 Mar 2025

https://github.com/ZichengDuan/TheChosenOne

Unofficial implementation of the paper "The Chosen One: Consistent Characters in Text-to-Image Diffusion Models"

deep-learning diffusion dinov2 generative-art generative-model

Last synced: 27 Mar 2025

https://github.com/Sirui-Xu/InterDiff

[ICCV 2023] Official PyTorch implementation of the paper "InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion"

3d-human-pose 6d deep-learning diffusion diffusion-models generative-ai generative-model human-motion-prediction human-object-interaction human-scene-interaction motion-prediction object-pose

Last synced: 23 Apr 2025

https://sirui-xu.github.io/InterDiff/

[ICCV 2023] Official PyTorch implementation of the paper "InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion"

3d-human-pose 6d deep-learning diffusion diffusion-models generative-ai generative-model human-motion-prediction human-object-interaction human-scene-interaction motion-prediction object-pose

Last synced: 23 Apr 2025

https://github.com/keonlee9420/diffsinger

PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)

ddpm diffsinger diffusion diffusion-models english fastspeech neural-tts non-autoregressive pytorch singing-voice speech-synthesis text-to-speech tts

Last synced: 12 Oct 2025

https://github.com/jiauzhang/dragdiffusion

Implementation of DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing

diffusion draggan image-editing

Last synced: 07 Oct 2025