An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with diffusion-models

A curated list of projects in awesome lists tagged with diffusion-models .

https://github.com/foundationvision/var

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

auto-regressive-model autoregressive-models diffusion-models generative-ai generative-model gpt gpt-2 image-generation large-language-models neurips transformers vision-transformer

Last synced: 10 Apr 2025

https://github.com/open-mmlab/mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

aigc computer-vision deep-learning diffusion diffusion-models generative-adversarial-network generative-ai image-editing image-generation image-processing image-synthesis inpainting matting pytorch super-resolution text2image video-frame-interpolation video-interpolation video-super-resolution

Last synced: 13 Dec 2025

https://github.com/yl4579/styletts2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

adversarial-training deep-learning diffusion-models gan latent-diffusion latent-diffusion-models pytorch speaker-adaptation speech-synthesis text-to-speech tts wavlm

Last synced: 14 May 2025

https://github.com/yl4579/StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

adversarial-training deep-learning diffusion-models gan latent-diffusion latent-diffusion-models pytorch speaker-adaptation speech-synthesis text-to-speech tts wavlm

Last synced: 09 Apr 2025

https://github.com/fanghua-yu/supir

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

deep-learning diffusion-models llava pytorch pytorch-lightning restoration sdxl stable-diffusion super-resolution

Last synced: 14 May 2025

https://github.com/Fanghua-Yu/SUPIR

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

deep-learning diffusion-models llava pytorch pytorch-lightning restoration sdxl stable-diffusion super-resolution

Last synced: 27 Mar 2025

https://github.com/FoundationVision/VAR

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

auto-regressive-model autoregressive-models diffusion-models generative-ai generative-model gpt gpt-2 image-generation large-language-models neurips transformers vision-transformer

Last synced: 03 Apr 2025

https://github.com/nunchaku-ai/nunchaku

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

comfyui diffusion-models flux genai iclr iclr2025 lora mlsys quantization

Last synced: 25 Jan 2026

https://github.com/SandAI-org/MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

autoregressive diffusion-models video-generation

Last synced: 13 Jun 2025

https://github.com/ali-vilab/vgen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

diffusion-models video-synthesis

Last synced: 14 May 2025

https://github.com/ali-vilab/VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

diffusion-models video-synthesis

Last synced: 28 Mar 2025

https://github.com/ali-vilab/i2vgen-xl

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

diffusion-models video-synthesis

Last synced: 28 Mar 2025

https://github.com/deepseek-ai/dreamcraft3d

[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

3d-creation 3d-generation aigc diffusion-models generative-model image-to-3d

Last synced: 15 May 2025

https://github.com/deepseek-ai/DreamCraft3D

[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

3d-creation 3d-generation aigc diffusion-models generative-model image-to-3d

Last synced: 15 Oct 2025

https://github.com/bghira/SimpleTuner

A general fine-tuning kit geared toward diffusion models.

diffusers diffusion-models fine-tuning flux-dev machine-learning stable-diffusion

Last synced: 29 Jul 2025

https://github.com/tencent/MimicMotion

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

diffusion-models video-generation

Last synced: 11 May 2025

https://github.com/tencent/mimicmotion

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

diffusion-models video-generation

Last synced: 15 May 2025

https://github.com/andreas128/repaint

Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022

cvpr2022 diffusion-models inpainting

Last synced: 08 Apr 2025

https://github.com/alpha-vllm/lumina-t2x

Lumina-T2X is a unified framework for Text to Any Modality Generation

aigc diffusion diffusion-model diffusion-models diffusion-transformer generation-models transformer transformers

Last synced: 11 Apr 2025

https://github.com/Alpha-VLLM/Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

aigc diffusion diffusion-model diffusion-models diffusion-transformer generation-models transformer transformers

Last synced: 24 Mar 2025

https://github.com/andreas128/RePaint

Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022

cvpr2022 diffusion-models inpainting

Last synced: 20 Jul 2025

https://github.com/open-mmlab/mmgeneration

MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.

diffusion-models gan generative generative-adversarial-network mmcv openmmlab pytorch

Last synced: 15 May 2025

https://github.com/adobe-research/custom-diffusion

Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

computer-vision customization diffusion-models few-shot fine-tuning pytorch text-to-image-generation

Last synced: 15 May 2025

https://github.com/sudo-ai-3d/zero123plus

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

3d 3d-graphics aigc diffusers diffusion-models image-to-3d research-project text-to-3d

Last synced: 15 May 2025

https://github.com/yang-song/score_sde_pytorch

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

controllable-generation diffusion-models generative-models iclr-2021 inverse-problems pytorch score-based-generative-modeling score-matching stochastic-differential-equations

Last synced: 15 May 2025

https://github.com/junshutang/make-it-3d

[ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior

3d-generation 3d-vision computer-vision deep-learning diffusion-models generative-art nerf

Last synced: 15 May 2025

https://github.com/CryptoAILab/Awesome-LM-SSP

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

adversarial-attacks awesome-list diffusion-models jailbreak language-model llm nlp privacy safety security vlm

Last synced: 18 Jan 2026

https://github.com/junshutang/Make-It-3D

[ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior

3d-generation 3d-vision computer-vision deep-learning diffusion-models generative-art nerf

Last synced: 07 May 2025

https://github.com/SUDO-AI-3D/zero123plus

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

3d 3d-graphics aigc diffusers diffusion-models image-to-3d research-project text-to-3d

Last synced: 24 Jul 2025

https://github.com/foundationvision/llamagen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

auto-regressive-model diffusion diffusion-models image-generation llama llm text2image

Last synced: 15 May 2025

https://github.com/bghira/simpletuner

A general fine-tuning kit geared toward diffusion models.

diffusers diffusion-models fine-tuning flux-dev machine-learning stable-diffusion

Last synced: 29 Jan 2026

https://github.com/FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

auto-regressive-model diffusion diffusion-models image-generation llama llm text2image

Last synced: 07 May 2025

https://github.com/mit-han-lab/nunchaku

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

comfyui diffusion-models flux genai iclr iclr2025 lora mlsys quantization

Last synced: 13 May 2025

https://github.com/luchengthu/dpm-solver

Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)

diffusion-models machine-learning score-based-generative-models stable-diffusion

Last synced: 15 May 2025

https://github.com/LuChengTHU/dpm-solver

Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)

diffusion-models machine-learning score-based-generative-models stable-diffusion

Last synced: 04 Apr 2025

https://github.com/eloialonso/diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

artificial-intelligence atari deep-learning diffusion-models machine-learning reinforcement-learning research world-models

Last synced: 26 Feb 2025

https://github.com/guochengqian/Magic123

[ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

3d-generation diffusion-models image-to-3d

Last synced: 22 Jul 2025

https://github.com/guochengqian/magic123

[ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

3d-generation diffusion-models image-to-3d

Last synced: 08 Apr 2025

https://github.com/tencentarc/brushnet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image

Last synced: 14 May 2025

https://github.com/menyifang/MIMO

Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"

character-animation diffusion-models video-synthesis

Last synced: 01 Oct 2025

https://github.com/yang-song/score_sde

Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

controllable-generation diffusion-models flax generative-models iclr-2021 inverse-problems jax score-based-generative-modeling score-matching stochastic-differential-equations

Last synced: 16 May 2025

https://github.com/TencentARC/BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image

Last synced: 28 Mar 2025

https://github.com/zai-org/ImageReward

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

diffusion-models generative-model human-preferences rlhf

Last synced: 29 Dec 2025

https://github.com/THUDM/ImageReward

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

diffusion-models generative-model human-preferences rlhf

Last synced: 28 Mar 2025

https://github.com/bloc97/crossattentioncontrol

Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion

cross-attention deep-learning diffusion-models stable-diffusion

Last synced: 16 May 2025

https://github.com/bloc97/CrossAttentionControl

Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion

cross-attention deep-learning diffusion-models stable-diffusion

Last synced: 03 Apr 2025

https://github.com/huggingface/finetrainers

Scalable and memory-optimized training of diffusion models

ai art artificial-intelligence diffusers diffusion diffusion-models pytorch transformers

Last synced: 14 Dec 2025

https://github.com/davidadsp/generative_deep_learning_2nd_edition

The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.

chatgpt dalle2 data-science deep-learning diffusion-models generative-adversarial-network gpt-3 machine-learning python stable-diffusion tensorflow

Last synced: 06 Oct 2025

https://github.com/wyhuai/ddnm

[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model

diffusion-models iclr iclr2023 image-restoration zero-shot

Last synced: 16 May 2025

https://github.com/davidADSP/Generative_Deep_Learning_2nd_Edition

The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.

chatgpt dalle2 data-science deep-learning diffusion-models generative-adversarial-network gpt-3 machine-learning python stable-diffusion tensorflow

Last synced: 01 May 2025

https://github.com/0xCrunchyy/10x

Optimized inference and fine-tuning framework for diffusion (image & video) models. Up to 3x faster & 80% less VRAM.

artificial-inteligence diffusion diffusion-models fine-tuning flux gpt inference lora pytorch sdxl

Last synced: 09 Jan 2026

https://github.com/wyhuai/DDNM

[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model

diffusion-models iclr iclr2023 image-restoration zero-shot

Last synced: 27 Mar 2025

https://github.com/gcorso/diffdock

Implementation of DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking

binding computational-biology diffusion-models docking equivariance machine-learning non-euclidean-geometry score-based-models

Last synced: 12 Apr 2025

https://github.com/yujun-shi/dragdiffusion

[CVPR2024, Highlight] Official code for DragDiffusion

artificial-intelligence cvpr2024 diffusion-models dragdiffusion draggan image-editing

Last synced: 16 May 2025

https://github.com/gcorso/DiffDock

Implementation of DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking

binding computational-biology diffusion-models docking equivariance machine-learning non-euclidean-geometry score-based-models

Last synced: 30 Mar 2025

https://github.com/Yujun-Shi/DragDiffusion

[CVPR2024, Highlight] Official code for DragDiffusion

artificial-intelligence cvpr2024 diffusion-models dragdiffusion draggan image-editing

Last synced: 27 Mar 2025

https://github.com/muzishen/imagdressing

[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual dressing.

datasets diffusion-models text-to-image-generation try-on

Last synced: 16 May 2025

https://github.com/muzishen/IMAGDressing

[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual dressing.

datasets diffusion-models text-to-image-generation try-on

Last synced: 27 Mar 2025

https://github.com/Zheng-Chong/CatVTON

[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).

diffusion-models fashion try-on

Last synced: 27 Mar 2025

https://github.com/River-Zhang/ICEdit

Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enough to run!

diffusion diffusion-models diffusion-transformer dit editing-image gpt4o gpt4oimage image-editing in-context

Last synced: 12 Jun 2025

https://github.com/a-r-r-o-w/finetrainers

Scalable and memory-optimized training of diffusion models

ai art artificial-intelligence diffusers diffusion diffusion-models pytorch transformers

Last synced: 14 May 2025

https://github.com/declare-lab/tango

A family of diffusion models for text-to-audio generation.

audio-generation diffusion diffusion-models language-models large-language-models text-to-audio

Last synced: 16 May 2025

https://github.com/openbmb/viscpm

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

diffusion-models large-language-models multimodal transformers

Last synced: 16 May 2025

https://github.com/OpenBMB/VisCPM

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

diffusion-models large-language-models multimodal transformers

Last synced: 16 Apr 2025

https://github.com/omerbt/multidiffusion

Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)

diffusion-models generative-model icml image-generation multidiffusion stable-diffusion text-to-image

Last synced: 16 May 2025

https://github.com/wladradchenko/wunjo.wladradchenko.ru

Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.

controlnet deepfake diffusion-models face-animation face-swap free img2video lip-sync photo-editing public-api remove-background remover restyle segment-anything txt2video video-editing video-generation voice-clone wunjo

Last synced: 10 May 2025

https://github.com/lukasHoel/text2room

Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).

3d-generation diffusion-models mesh-generation text-to-image

Last synced: 24 Mar 2025

https://github.com/hymie122/RAG-Survey

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

aigc diffusion-models llm multimodality rag survey

Last synced: 20 Apr 2025

https://github.com/3DTopia/3DTopia-XL

[CVPR 2025 Highlight] 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion

3d-generation cvpr cvpr2025 diffusion-models image-to-3d text-to-3d

Last synced: 25 Oct 2025

https://github.com/open-mmlab/pia

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画

aigc animation diffusion-models image-to-video image-to-video-generation personalized-generation stable-diffusion

Last synced: 16 May 2025

https://github.com/omerbt/MultiDiffusion

Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)

diffusion-models generative-model icml image-generation multidiffusion stable-diffusion text-to-image

Last synced: 28 Mar 2025

https://github.com/3dtopia/3dtopia-xl

[CVPR 2025 Highlight] 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion

3d-generation cvpr cvpr2025 diffusion-models image-to-3d text-to-3d

Last synced: 15 May 2025

https://github.com/open-mmlab/PIA

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画

aigc animation diffusion-models image-to-video image-to-video-generation personalized-generation stable-diffusion

Last synced: 28 Mar 2025

https://github.com/UCSC-VLAA/story-iter

[ICLR 2026] A Training-free Iterative Framework for Long Story Visualization

diffusion-models generative-art generative-model image-generation storytelling visual-storytelling

Last synced: 08 Feb 2026

https://github.com/cure-lab/magicdrive

[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”

autonomous-vehicles deep-learning diffusion-models image-generation pytorch video-generation

Last synced: 15 May 2025

https://github.com/showlab/motiondirector

[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

diffusion-models motion-customization text-to-motion text-to-video text-to-video-generation video-generation

Last synced: 12 Apr 2025

https://github.com/showlab/MotionDirector

[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

diffusion-models motion-customization text-to-motion text-to-video text-to-video-generation video-generation

Last synced: 28 Mar 2025

https://showlab.github.io/MotionDirector/

[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

diffusion-models motion-customization text-to-motion text-to-video text-to-video-generation video-generation

Last synced: 28 Mar 2025

https://github.com/phizaz/diffae

Official implementation of Diffusion Autoencoders

autoencoder cvpr2022 deep-learning diffusion-models ffhq latent-variable-models lsun

Last synced: 16 May 2025

https://github.com/liuyuan-pal/SyncDreamer

[ICLR 2024 Spotlight] SyncDreamer: Generating Multiview-consistent Images from a Single-view Image

3d-reconstruction diffusion-models generative-model single-view-reconstruction

Last synced: 07 Apr 2025

https://github.com/horseee/deepcache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

diffusion-models efficient-inference model-compression stable-diffusion training-free

Last synced: 15 May 2025

https://github.com/eric-ai-lab/MiniGPT-5

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

diffusion-models multimodal-generation multimodal-llm transformers

Last synced: 07 May 2025

https://liewfeng.github.io/TeaCache/

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

cogvideox diffusion-models hunyuan-video inference-acceleration latte open-sora open-sora-plan video-generation

Last synced: 01 Aug 2025

https://github.com/ali-vilab/videocomposer

Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability

diffusion-models video-generation video-synthesiswith

Last synced: 29 Dec 2025

https://github.com/showlab/show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

diffusion-models large-language-models multimodal

Last synced: 07 Apr 2025