Projects in Awesome Lists tagged with diffusion-models
A curated list of projects in awesome lists tagged with diffusion-models .
https://github.com/openvinotoolkit/openvino
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
ai computer-vision deep-learning deploy-ai diffusion-models generative-ai good-first-issue inference llm-inference natural-language-processing nlp openvino optimize-ai performance-boost recommendation-system speech-recognition stable-diffusion transformers yolo
Last synced: 12 May 2025
https://github.com/foundationvision/var
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
auto-regressive-model autoregressive-models diffusion-models generative-ai generative-model gpt gpt-2 image-generation large-language-models neurips transformers vision-transformer
Last synced: 10 Apr 2025
https://github.com/Lightricks/LTX-Video
Official repository for LTX-Video
diffusion-models dit image-to-video image-to-video-generation text-to-video text-to-video-generation
Last synced: 18 Jul 2025
https://github.com/open-mmlab/mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
aigc computer-vision deep-learning diffusion diffusion-models generative-adversarial-network generative-ai image-editing image-generation image-processing image-synthesis inpainting matting pytorch super-resolution text2image video-frame-interpolation video-interpolation video-super-resolution
Last synced: 13 Dec 2025
https://github.com/yl4579/styletts2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
adversarial-training deep-learning diffusion-models gan latent-diffusion latent-diffusion-models pytorch speaker-adaptation speech-synthesis text-to-speech tts wavlm
Last synced: 14 May 2025
https://github.com/yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
adversarial-training deep-learning diffusion-models gan latent-diffusion latent-diffusion-models pytorch speaker-adaptation speech-synthesis text-to-speech tts wavlm
Last synced: 09 Apr 2025
https://github.com/fanghua-yu/supir
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
deep-learning diffusion-models llava pytorch pytorch-lightning restoration sdxl stable-diffusion super-resolution
Last synced: 14 May 2025
https://github.com/Fanghua-Yu/SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
deep-learning diffusion-models llava pytorch pytorch-lightning restoration sdxl stable-diffusion super-resolution
Last synced: 27 Mar 2025
https://github.com/FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
auto-regressive-model autoregressive-models diffusion-models generative-ai generative-model gpt gpt-2 image-generation large-language-models neurips transformers vision-transformer
Last synced: 03 Apr 2025
https://github.com/lightricks/ltx-video
Official repository for LTX-Video
diffusion-models dit image-to-video image-to-video-generation text-to-video text-to-video-generation
Last synced: 14 May 2025
https://github.com/nunchaku-ai/nunchaku
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
comfyui diffusion-models flux genai iclr iclr2025 lora mlsys quantization
Last synced: 25 Jan 2026
https://github.com/SandAI-org/MAGI-1
MAGI-1: Autoregressive Video Generation at Scale
autoregressive diffusion-models video-generation
Last synced: 13 Jun 2025
https://github.com/yangling0818/diffusion-models-papers-survey-taxonomy
Diffusion model papers, survey, and taxonomy
diffusion-models stable-diffusion survey text-to-3d text-to-image text-to-video
Last synced: 14 May 2025
https://github.com/YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
diffusion-models stable-diffusion survey text-to-3d text-to-image text-to-video
Last synced: 26 Mar 2025
https://github.com/ali-vilab/vgen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
diffusion-models video-synthesis
Last synced: 14 May 2025
https://github.com/ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
diffusion-models video-synthesis
Last synced: 28 Mar 2025
https://github.com/ali-vilab/i2vgen-xl
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
diffusion-models video-synthesis
Last synced: 28 Mar 2025
https://github.com/deepseek-ai/dreamcraft3d
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
3d-creation 3d-generation aigc diffusion-models generative-model image-to-3d
Last synced: 15 May 2025
https://github.com/deepseek-ai/DreamCraft3D
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
3d-creation 3d-generation aigc diffusion-models generative-model image-to-3d
Last synced: 15 Oct 2025
https://github.com/bghira/SimpleTuner
A general fine-tuning kit geared toward diffusion models.
diffusers diffusion-models fine-tuning flux-dev machine-learning stable-diffusion
Last synced: 29 Jul 2025
https://github.com/tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
diffusion-models video-generation
Last synced: 11 May 2025
https://github.com/tencent/mimicmotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
diffusion-models video-generation
Last synced: 15 May 2025
https://github.com/andreas128/repaint
Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022
cvpr2022 diffusion-models inpainting
Last synced: 08 Apr 2025
https://github.com/alpha-vllm/lumina-t2x
Lumina-T2X is a unified framework for Text to Any Modality Generation
aigc diffusion diffusion-model diffusion-models diffusion-transformer generation-models transformer transformers
Last synced: 11 Apr 2025
https://github.com/Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
aigc diffusion diffusion-model diffusion-models diffusion-transformer generation-models transformer transformers
Last synced: 24 Mar 2025
https://github.com/andreas128/RePaint
Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022
cvpr2022 diffusion-models inpainting
Last synced: 20 Jul 2025
https://github.com/open-mmlab/mmgeneration
MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.
diffusion-models gan generative generative-adversarial-network mmcv openmmlab pytorch
Last synced: 15 May 2025
https://github.com/adobe-research/custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
computer-vision customization diffusion-models few-shot fine-tuning pytorch text-to-image-generation
Last synced: 15 May 2025
https://github.com/sudo-ai-3d/zero123plus
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
3d 3d-graphics aigc diffusers diffusion-models image-to-3d research-project text-to-3d
Last synced: 15 May 2025
https://github.com/yang-song/score_sde_pytorch
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
controllable-generation diffusion-models generative-models iclr-2021 inverse-problems pytorch score-based-generative-modeling score-matching stochastic-differential-equations
Last synced: 15 May 2025
https://github.com/siliconflow/onediff
OneDiff: An out-of-the-box acceleration library for diffusion models.
aigc-serving comfyui comfyui-workflow cuda diffusers diffusion-models inference-engine lcm lcm-lora lora performance-optimization pytorch sd-webui sdxl sdxl-turbo stable-diffusion stable-video-diffusion
Last synced: 13 May 2025
https://github.com/junshutang/make-it-3d
[ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
3d-generation 3d-vision computer-vision deep-learning diffusion-models generative-art nerf
Last synced: 15 May 2025
https://github.com/CryptoAILab/Awesome-LM-SSP
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
adversarial-attacks awesome-list diffusion-models jailbreak language-model llm nlp privacy safety security vlm
Last synced: 18 Jan 2026
https://github.com/junshutang/Make-It-3D
[ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
3d-generation 3d-vision computer-vision deep-learning diffusion-models generative-art nerf
Last synced: 07 May 2025
https://github.com/SUDO-AI-3D/zero123plus
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
3d 3d-graphics aigc diffusers diffusion-models image-to-3d research-project text-to-3d
Last synced: 24 Jul 2025
https://github.com/foundationvision/llamagen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
auto-regressive-model diffusion diffusion-models image-generation llama llm text2image
Last synced: 15 May 2025
https://github.com/bghira/simpletuner
A general fine-tuning kit geared toward diffusion models.
diffusers diffusion-models fine-tuning flux-dev machine-learning stable-diffusion
Last synced: 29 Jan 2026
https://github.com/FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
auto-regressive-model diffusion diffusion-models image-generation llama llm text2image
Last synced: 07 May 2025
https://github.com/mit-han-lab/nunchaku
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
comfyui diffusion-models flux genai iclr iclr2025 lora mlsys quantization
Last synced: 13 May 2025
https://github.com/luchengthu/dpm-solver
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
diffusion-models machine-learning score-based-generative-models stable-diffusion
Last synced: 15 May 2025
https://github.com/LuChengTHU/dpm-solver
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
diffusion-models machine-learning score-based-generative-models stable-diffusion
Last synced: 04 Apr 2025
https://github.com/maximevandegar/papers-in-100-lines-of-code
Implementation of papers in 100 lines of code.
3d aes artificial-intelligence deep-learning diffusion-models educational gans generative-model implementation-of-research-paper inverse-rendering machine-learning meta-learning nerf neural-radiance-fields papers python pytorch reinforcement-learning research rl
Last synced: 07 Oct 2025
https://github.com/eloialonso/diamond
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
artificial-intelligence atari deep-learning diffusion-models machine-learning reinforcement-learning research world-models
Last synced: 26 Feb 2025
https://github.com/guochengqian/Magic123
[ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors
3d-generation diffusion-models image-to-3d
Last synced: 22 Jul 2025
https://github.com/guochengqian/magic123
[ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors
3d-generation diffusion-models image-to-3d
Last synced: 08 Apr 2025
https://github.com/tencentarc/brushnet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image
Last synced: 14 May 2025
https://github.com/menyifang/MIMO
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
character-animation diffusion-models video-synthesis
Last synced: 01 Oct 2025
https://github.com/yang-song/score_sde
Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
controllable-generation diffusion-models flax generative-models iclr-2021 inverse-problems jax score-based-generative-modeling score-matching stochastic-differential-equations
Last synced: 16 May 2025
https://github.com/TencentARC/BrushNet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image
Last synced: 28 Mar 2025
https://github.com/zai-org/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
diffusion-models generative-model human-preferences rlhf
Last synced: 29 Dec 2025
https://github.com/THUDM/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
diffusion-models generative-model human-preferences rlhf
Last synced: 28 Mar 2025
https://github.com/bloc97/crossattentioncontrol
Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
cross-attention deep-learning diffusion-models stable-diffusion
Last synced: 16 May 2025
https://github.com/bloc97/CrossAttentionControl
Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
cross-attention deep-learning diffusion-models stable-diffusion
Last synced: 03 Apr 2025
https://github.com/huggingface/finetrainers
Scalable and memory-optimized training of diffusion models
ai art artificial-intelligence diffusers diffusion diffusion-models pytorch transformers
Last synced: 14 Dec 2025
https://github.com/davidadsp/generative_deep_learning_2nd_edition
The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.
chatgpt dalle2 data-science deep-learning diffusion-models generative-adversarial-network gpt-3 machine-learning python stable-diffusion tensorflow
Last synced: 06 Oct 2025
https://github.com/wyhuai/ddnm
[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model
diffusion-models iclr iclr2023 image-restoration zero-shot
Last synced: 16 May 2025
https://github.com/davidADSP/Generative_Deep_Learning_2nd_Edition
The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.
chatgpt dalle2 data-science deep-learning diffusion-models generative-adversarial-network gpt-3 machine-learning python stable-diffusion tensorflow
Last synced: 01 May 2025
https://github.com/0xCrunchyy/10x
Optimized inference and fine-tuning framework for diffusion (image & video) models. Up to 3x faster & 80% less VRAM.
artificial-inteligence diffusion diffusion-models fine-tuning flux gpt inference lora pytorch sdxl
Last synced: 09 Jan 2026
https://github.com/wyhuai/DDNM
[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model
diffusion-models iclr iclr2023 image-restoration zero-shot
Last synced: 27 Mar 2025
https://github.com/gcorso/diffdock
Implementation of DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking
binding computational-biology diffusion-models docking equivariance machine-learning non-euclidean-geometry score-based-models
Last synced: 12 Apr 2025
https://github.com/yujun-shi/dragdiffusion
[CVPR2024, Highlight] Official code for DragDiffusion
artificial-intelligence cvpr2024 diffusion-models dragdiffusion draggan image-editing
Last synced: 16 May 2025
https://github.com/gcorso/DiffDock
Implementation of DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking
binding computational-biology diffusion-models docking equivariance machine-learning non-euclidean-geometry score-based-models
Last synced: 30 Mar 2025
https://github.com/Yujun-Shi/DragDiffusion
[CVPR2024, Highlight] Official code for DragDiffusion
artificial-intelligence cvpr2024 diffusion-models dragdiffusion draggan image-editing
Last synced: 27 Mar 2025
https://github.com/muzishen/imagdressing
[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual dressing.
datasets diffusion-models text-to-image-generation try-on
Last synced: 16 May 2025
https://github.com/muzishen/IMAGDressing
[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual dressing.
datasets diffusion-models text-to-image-generation try-on
Last synced: 27 Mar 2025
https://github.com/Zheng-Chong/CatVTON
[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
diffusion-models fashion try-on
Last synced: 27 Mar 2025
https://github.com/fantasy-studio/paint-by-example
Paint by Example: Exemplar-based Image Editing with Diffusion Models
computer-vision deep-learning diffusion-models image-editing image-generation image-manipulation paint-by-example pytorch stable-diffusion
Last synced: 16 May 2025
https://github.com/lightricks/comfyui-ltxvideo
LTX-Video Support for ComfyUI
comfyui diffusion-models dit image-to-video image-to-video-generation text-to-image text-to-image-generation
Last synced: 15 May 2025
https://github.com/Fantasy-Studio/Paint-by-Example?tab=readme-ov-file
Paint by Example: Exemplar-based Image Editing with Diffusion Models
computer-vision deep-learning diffusion-models image-editing image-generation image-manipulation paint-by-example pytorch stable-diffusion
Last synced: 11 May 2025
https://github.com/River-Zhang/ICEdit
Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enough to run!
diffusion diffusion-models diffusion-transformer dit editing-image gpt4o gpt4oimage image-editing in-context
Last synced: 12 Jun 2025
https://github.com/a-r-r-o-w/finetrainers
Scalable and memory-optimized training of diffusion models
ai art artificial-intelligence diffusers diffusion diffusion-models pytorch transformers
Last synced: 14 May 2025
https://github.com/Fantasy-Studio/Paint-by-Example
Paint by Example: Exemplar-based Image Editing with Diffusion Models
computer-vision deep-learning diffusion-models image-editing image-generation image-manipulation paint-by-example pytorch stable-diffusion
Last synced: 28 Mar 2025
https://github.com/declare-lab/tango
A family of diffusion models for text-to-audio generation.
audio-generation diffusion diffusion-models language-models large-language-models text-to-audio
Last synced: 16 May 2025
https://github.com/openbmb/viscpm
[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
diffusion-models large-language-models multimodal transformers
Last synced: 16 May 2025
https://github.com/OpenBMB/VisCPM
[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
diffusion-models large-language-models multimodal transformers
Last synced: 16 Apr 2025
https://github.com/omerbt/multidiffusion
Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)
diffusion-models generative-model icml image-generation multidiffusion stable-diffusion text-to-image
Last synced: 16 May 2025
https://github.com/wladradchenko/wunjo.wladradchenko.ru
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.
controlnet deepfake diffusion-models face-animation face-swap free img2video lip-sync photo-editing public-api remove-background remover restyle segment-anything txt2video video-editing video-generation voice-clone wunjo
Last synced: 10 May 2025
https://github.com/lukasHoel/text2room
Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).
3d-generation diffusion-models mesh-generation text-to-image
Last synced: 24 Mar 2025
https://github.com/hymie122/RAG-Survey
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
aigc diffusion-models llm multimodality rag survey
Last synced: 20 Apr 2025
https://github.com/3DTopia/3DTopia-XL
[CVPR 2025 Highlight] 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion
3d-generation cvpr cvpr2025 diffusion-models image-to-3d text-to-3d
Last synced: 25 Oct 2025
https://github.com/open-mmlab/pia
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
aigc animation diffusion-models image-to-video image-to-video-generation personalized-generation stable-diffusion
Last synced: 16 May 2025
https://github.com/omerbt/MultiDiffusion
Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)
diffusion-models generative-model icml image-generation multidiffusion stable-diffusion text-to-image
Last synced: 28 Mar 2025
https://github.com/3dtopia/3dtopia-xl
[CVPR 2025 Highlight] 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion
3d-generation cvpr cvpr2025 diffusion-models image-to-3d text-to-3d
Last synced: 15 May 2025
https://github.com/open-mmlab/PIA
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
aigc animation diffusion-models image-to-video image-to-video-generation personalized-generation stable-diffusion
Last synced: 28 Mar 2025
https://github.com/UCSC-VLAA/story-iter
[ICLR 2026] A Training-free Iterative Framework for Long Story Visualization
diffusion-models generative-art generative-model image-generation storytelling visual-storytelling
Last synced: 08 Feb 2026
https://github.com/cure-lab/magicdrive
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
autonomous-vehicles deep-learning diffusion-models image-generation pytorch video-generation
Last synced: 15 May 2025
https://github.com/showlab/motiondirector
[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
diffusion-models motion-customization text-to-motion text-to-video text-to-video-generation video-generation
Last synced: 12 Apr 2025
https://github.com/showlab/MotionDirector
[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
diffusion-models motion-customization text-to-motion text-to-video text-to-video-generation video-generation
Last synced: 28 Mar 2025
https://showlab.github.io/MotionDirector/
[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
diffusion-models motion-customization text-to-motion text-to-video text-to-video-generation video-generation
Last synced: 28 Mar 2025
https://github.com/phizaz/diffae
Official implementation of Diffusion Autoencoders
autoencoder cvpr2022 deep-learning diffusion-models ffhq latent-variable-models lsun
Last synced: 16 May 2025
https://github.com/liuyuan-pal/SyncDreamer
[ICLR 2024 Spotlight] SyncDreamer: Generating Multiview-consistent Images from a Single-view Image
3d-reconstruction diffusion-models generative-model single-view-reconstruction
Last synced: 07 Apr 2025
https://github.com/radames/real-time-latent-consistency-model
App showcasing multiple real-time diffusion models pipelines with Diffusers
diffusers diffusion-models latent-consistency-model machine-learning mjpeg mjpeg-stream real-time stable-diffusion
Last synced: 15 May 2025
https://github.com/horseee/deepcache
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
diffusion-models efficient-inference model-compression stable-diffusion training-free
Last synced: 15 May 2025
https://github.com/eric-ai-lab/MiniGPT-5
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
diffusion-models multimodal-generation multimodal-llm transformers
Last synced: 07 May 2025
https://liewfeng.github.io/TeaCache/
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
cogvideox diffusion-models hunyuan-video inference-acceleration latte open-sora open-sora-plan video-generation
Last synced: 01 Aug 2025
https://github.com/ali-vilab/videocomposer
Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability
diffusion-models video-generation video-synthesiswith
Last synced: 29 Dec 2025
https://github.com/NVlabs/ODISE
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
deep-learning diffusion-models instance-segmentation open-vocabulary open-vocabulary-segmentation open-vocabulary-semantic-segmentation open-world-classification open-world-object-detection panoptic-segmentation pytorch semantic-segmentation text-image-retrieval zero-shot-learning
Last synced: 28 Mar 2025
https://github.com/hkproj/pytorch-stable-diffusion
Stable Diffusion implemented from scratch in PyTorch
diffusion-models latent-diffusion-models paper-implementations pytorch pytorch-implementation stable-diffusion
Last synced: 15 May 2025
https://github.com/showlab/show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
diffusion-models large-language-models multimodal
Last synced: 07 Apr 2025
https://github.com/SongweiGe/rich-text-to-image
Rich-Text-to-Image Generation
computer-vision diffusion-models pytorch rich-text text-to-image-generation
Last synced: 28 Mar 2025