Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with diffusion-models

A curated list of projects in awesome lists tagged with diffusion-models .

https://github.com/open-mmlab/mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

aigc computer-vision deep-learning diffusion diffusion-models generative-adversarial-network generative-ai image-editing image-generation image-processing image-synthesis inpainting matting pytorch super-resolution text2image video-frame-interpolation video-interpolation video-super-resolution

Last synced: 16 Dec 2024

https://github.com/yl4579/styletts2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

adversarial-training deep-learning diffusion-models gan latent-diffusion latent-diffusion-models pytorch speaker-adaptation speech-synthesis text-to-speech tts wavlm

Last synced: 17 Dec 2024

https://github.com/yl4579/StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

adversarial-training deep-learning diffusion-models gan latent-diffusion latent-diffusion-models pytorch speaker-adaptation speech-synthesis text-to-speech tts wavlm

Last synced: 06 Nov 2024

https://github.com/fanghua-yu/supir

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

deep-learning diffusion-models llava pytorch pytorch-lightning restoration sdxl stable-diffusion super-resolution

Last synced: 17 Dec 2024

https://github.com/Fanghua-Yu/SUPIR

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

deep-learning diffusion-models llava pytorch pytorch-lightning restoration sdxl stable-diffusion super-resolution

Last synced: 30 Oct 2024

https://github.com/foundationvision/var

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

auto-regressive-model autoregressive-models diffusion-models generative-ai generative-model gpt gpt-2 image-generation large-language-models neurips transformers vision-transformer

Last synced: 17 Dec 2024

https://github.com/FoundationVision/VAR

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

auto-regressive-model autoregressive-models diffusion-models generative-ai generative-model gpt gpt-2 image-generation large-language-models neurips transformers vision-transformer

Last synced: 04 Nov 2024

https://github.com/ali-vilab/vgen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

diffusion-models video-synthesis

Last synced: 18 Dec 2024

https://github.com/ali-vilab/VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

diffusion-models video-synthesis

Last synced: 31 Oct 2024

https://github.com/ali-vilab/i2vgen-xl

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

diffusion-models video-synthesis

Last synced: 31 Oct 2024

https://github.com/alpha-vllm/lumina-t2x

Lumina-T2X is a unified framework for Text to Any Modality Generation

aigc diffusion diffusion-model diffusion-models diffusion-transformer generation-models transformer transformers

Last synced: 19 Dec 2024

https://github.com/deepseek-ai/dreamcraft3d

[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

3d-creation 3d-generation aigc diffusion-models generative-model image-to-3d

Last synced: 13 Dec 2024

https://github.com/andreas128/repaint

Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022

cvpr2022 diffusion-models inpainting

Last synced: 21 Dec 2024

https://github.com/andreas128/RePaint

Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022

cvpr2022 diffusion-models inpainting

Last synced: 28 Nov 2024

https://github.com/tencent/mimicmotion

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

diffusion-models video-generation

Last synced: 19 Dec 2024

https://github.com/open-mmlab/mmgeneration

MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.

diffusion-models gan generative generative-adversarial-network mmcv openmmlab pytorch

Last synced: 14 Dec 2024

https://github.com/adobe-research/custom-diffusion

Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

computer-vision customization diffusion-models few-shot fine-tuning pytorch text-to-image-generation

Last synced: 21 Dec 2024

https://github.com/junshutang/make-it-3d

[ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior

3d-generation 3d-vision computer-vision deep-learning diffusion-models generative-art nerf

Last synced: 19 Dec 2024

https://github.com/sudo-ai-3d/zero123plus

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

3d 3d-graphics aigc diffusers diffusion-models image-to-3d research-project text-to-3d

Last synced: 21 Dec 2024

https://github.com/junshutang/Make-It-3D

[ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior

3d-generation 3d-vision computer-vision deep-learning diffusion-models generative-art nerf

Last synced: 14 Nov 2024

https://github.com/SUDO-AI-3D/zero123plus

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

3d 3d-graphics aigc diffusers diffusion-models image-to-3d research-project text-to-3d

Last synced: 30 Nov 2024

https://github.com/yang-song/score_sde_pytorch

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

controllable-generation diffusion-models generative-models iclr-2021 inverse-problems pytorch score-based-generative-modeling score-matching stochastic-differential-equations

Last synced: 15 Dec 2024

https://github.com/bghira/simpletuner

A general fine-tuning kit geared toward diffusion models.

diffusers diffusion-models fine-tuning flux-dev machine-learning stable-diffusion

Last synced: 16 Dec 2024

https://github.com/bghira/SimpleTuner

A general fine-tuning kit geared toward diffusion models.

diffusers diffusion-models fine-tuning flux-dev machine-learning stable-diffusion

Last synced: 03 Dec 2024

https://github.com/luchengthu/dpm-solver

Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)

diffusion-models machine-learning score-based-generative-models stable-diffusion

Last synced: 21 Dec 2024

https://github.com/guochengqian/magic123

[ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

3d-generation diffusion-models image-to-3d

Last synced: 21 Dec 2024

https://github.com/guochengqian/Magic123

[ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

3d-generation diffusion-models image-to-3d

Last synced: 29 Nov 2024

https://github.com/LuChengTHU/dpm-solver

Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)

diffusion-models machine-learning score-based-generative-models stable-diffusion

Last synced: 05 Nov 2024

https://github.com/yang-song/score_sde

Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

controllable-generation diffusion-models flax generative-models iclr-2021 inverse-problems jax score-based-generative-modeling score-matching stochastic-differential-equations

Last synced: 15 Dec 2024

https://github.com/tencentarc/brushnet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image

Last synced: 19 Dec 2024

https://github.com/TencentARC/BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image

Last synced: 31 Oct 2024

https://github.com/bloc97/crossattentioncontrol

Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion

cross-attention deep-learning diffusion-models stable-diffusion

Last synced: 15 Dec 2024

https://github.com/bloc97/CrossAttentionControl

Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion

cross-attention deep-learning diffusion-models stable-diffusion

Last synced: 04 Nov 2024

https://github.com/foundationvision/llamagen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

auto-regressive-model diffusion diffusion-models image-generation llama llm text2image

Last synced: 19 Dec 2024

https://github.com/FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

auto-regressive-model diffusion diffusion-models image-generation llama llm text2image

Last synced: 14 Nov 2024

https://github.com/thudm/imagereward

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

diffusion-models generative-model human-preferences rlhf

Last synced: 18 Dec 2024

https://github.com/yujun-shi/dragdiffusion

[CVPR2024, Highlight] Official code for DragDiffusion

artificial-intelligence cvpr2024 diffusion-models dragdiffusion draggan image-editing

Last synced: 15 Dec 2024

https://github.com/Alpha-VLLM/Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

aigc diffusion diffusion-model diffusion-models diffusion-transformer generation-models transformer transformers

Last synced: 29 Oct 2024

https://github.com/wyhuai/ddnm

[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model

diffusion-models iclr iclr2023 image-restoration zero-shot

Last synced: 15 Dec 2024

https://github.com/THUDM/ImageReward

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

diffusion-models generative-model human-preferences rlhf

Last synced: 31 Oct 2024

https://github.com/wyhuai/DDNM

[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model

diffusion-models iclr iclr2023 image-restoration zero-shot

Last synced: 30 Oct 2024

https://github.com/Yujun-Shi/DragDiffusion

[CVPR2024, Highlight] Official code for DragDiffusion

artificial-intelligence cvpr2024 diffusion-models dragdiffusion draggan image-editing

Last synced: 30 Oct 2024

https://github.com/davidadsp/generative_deep_learning_2nd_edition

The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.

chatgpt dalle2 data-science deep-learning diffusion-models generative-adversarial-network gpt-3 machine-learning python stable-diffusion tensorflow

Last synced: 20 Dec 2024

https://github.com/gcorso/diffdock

Implementation of DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking

binding computational-biology diffusion-models docking equivariance machine-learning non-euclidean-geometry score-based-models

Last synced: 20 Dec 2024

https://github.com/openbmb/viscpm

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

diffusion-models large-language-models multimodal transformers

Last synced: 20 Dec 2024

https://github.com/declare-lab/tango

A family of diffusion models for text-to-audio generation.

audio-generation diffusion diffusion-models language-models large-language-models text-to-audio

Last synced: 20 Dec 2024

https://github.com/OpenBMB/VisCPM

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

diffusion-models large-language-models multimodal transformers

Last synced: 08 Nov 2024

https://github.com/gcorso/DiffDock

Implementation of DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking

binding computational-biology diffusion-models docking equivariance machine-learning non-euclidean-geometry score-based-models

Last synced: 01 Nov 2024

https://github.com/tencent/MimicMotion

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

diffusion-models video-generation

Last synced: 17 Nov 2024

https://github.com/hymie122/RAG-Survey

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

aigc diffusion-models llm multimodality rag survey

Last synced: 09 Nov 2024

https://github.com/3dtopia/3dtopia-xl

3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion

3d-generation diffusion-models image-to-3d text-to-3d

Last synced: 20 Dec 2024

https://github.com/omerbt/multidiffusion

Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)

diffusion-models generative-model icml image-generation multidiffusion stable-diffusion text-to-image

Last synced: 17 Dec 2024

https://github.com/lukasHoel/text2room

Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).

3d-generation diffusion-models mesh-generation text-to-image

Last synced: 28 Oct 2024

https://github.com/omerbt/MultiDiffusion

Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)

diffusion-models generative-model icml image-generation multidiffusion stable-diffusion text-to-image

Last synced: 31 Oct 2024

https://github.com/open-mmlab/pia

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画

aigc animation diffusion-models image-to-video image-to-video-generation personalized-generation stable-diffusion

Last synced: 20 Dec 2024

https://github.com/phizaz/diffae

Official implementation of Diffusion Autoencoders

autoencoder cvpr2022 deep-learning diffusion-models ffhq latent-variable-models lsun

Last synced: 16 Dec 2024

https://github.com/Zheng-Chong/CatVTON

CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).

diffusion-models fashion try-on

Last synced: 30 Oct 2024

https://github.com/showlab/motiondirector

[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

diffusion-models motion-customization text-to-motion text-to-video text-to-video-generation video-generation

Last synced: 15 Dec 2024

https://github.com/liuyuan-pal/SyncDreamer

[ICLR 2024 Spotlight] SyncDreamer: Generating Multiview-consistent Images from a Single-view Image

3d-reconstruction diffusion-models generative-model single-view-reconstruction

Last synced: 06 Nov 2024

https://github.com/ali-vilab/videocomposer

Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability

diffusion-models video-generation video-synthesiswith

Last synced: 31 Oct 2024

https://github.com/open-mmlab/PIA

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画

aigc animation diffusion-models image-to-video image-to-video-generation personalized-generation stable-diffusion

Last synced: 31 Oct 2024

https://github.com/eric-ai-lab/MiniGPT-5

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

diffusion-models multimodal-generation multimodal-llm transformers

Last synced: 14 Nov 2024

https://github.com/horseee/deepcache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

diffusion-models efficient-inference model-compression stable-diffusion training-free

Last synced: 19 Dec 2024

https://github.com/showlab/show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

diffusion-models large-language-models multimodal

Last synced: 15 Dec 2024

https://github.com/showlab/MotionDirector

[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

diffusion-models motion-customization text-to-motion text-to-video text-to-video-generation video-generation

Last synced: 31 Oct 2024

https://showlab.github.io/MotionDirector/

[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

diffusion-models motion-customization text-to-motion text-to-video text-to-video-generation video-generation

Last synced: 31 Oct 2024

https://github.com/dome272/Paella

Official Implementation of Paella https://arxiv.org/abs/2211.07292v2

diffusion-models generative-model

Last synced: 04 Nov 2024

https://github.com/Text-to-Audio/Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

diffusion-models latent-diffusion latent-space text-to-audio video-to-audio

Last synced: 08 Nov 2024

https://github.com/cure-lab/magicdrive

[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”

autonomous-vehicles deep-learning diffusion-models image-generation pytorch video-generation

Last synced: 21 Dec 2024

https://github.com/vicgalle/stable-diffusion-aesthetic-gradients

Personalization for Stable Diffusion via Aesthetic Gradients 🎨

diffusion-models laion stable-diffusion text-to-image text2image

Last synced: 05 Nov 2024

https://github.com/horseee/DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

diffusion-models efficient-inference model-compression stable-diffusion training-free

Last synced: 30 Aug 2024

https://github.com/yuval-alaluf/attend-and-excite

Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)

diffusion-models stable-diffusion text-to-image

Last synced: 20 Dec 2024

https://github.com/hustvl/gaussiandreamer

GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models (CVPR 2024)

aigc computer-vision cvpr2024 diffusion-models dreamfusion gaussian-splatting nerf radiance-field smpl text-to-3d

Last synced: 21 Dec 2024

https://github.com/OpenTexture/Paint3D

[CVPR 2024] Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models, a no lighting baked texture generative model

diffusion-models generative-ai generative-model stable-diffusion texture texture-synthesis

Last synced: 14 Nov 2024

https://github.com/yuval-alaluf/Attend-and-Excite

Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)

diffusion-models stable-diffusion text-to-image

Last synced: 31 Oct 2024

https://yuval-alaluf.github.io/Attend-and-Excite/

Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)

diffusion-models stable-diffusion text-to-image

Last synced: 31 Oct 2024

https://github.com/teticio/audio-diffusion

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

diffusion-models huggingface latent-diffusion music-generation

Last synced: 15 Dec 2024

https://boese0601.github.io/magicdance/

[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

behavior-generation cartoon-animation diffusion-models generative-ai generative-model image-editing video-editing video-generation

Last synced: 31 Oct 2024

https://github.com/Boese0601/MagicDance

[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

behavior-generation cartoon-animation diffusion-models generative-ai generative-model image-editing video-editing video-generation

Last synced: 31 Oct 2024

https://github.com/project-monai/generativemodels

MONAI Generative Models makes it easy to train, evaluate, and deploy generative models and related applications

anomaly-detection diffusion-models generative-adversarial-network generative-models image-synthesis image-translation medical-imaging monai mri-reconstruction

Last synced: 15 Dec 2024

https://github.com/hustvl/GaussianDreamer

GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models (CVPR 2024)

aigc computer-vision cvpr2024 diffusion-models dreamfusion gaussian-splatting nerf radiance-field smpl text-to-3d

Last synced: 09 Nov 2024

https://github.com/thu-ml/crm

[ECCV 2024] Single Image to 3D Textured Mesh in 10 seconds with Convolutional Reconstruction Model.

3d aigc diffusion-models generative-model multiview reconstruction

Last synced: 21 Dec 2024

https://github.com/mit-han-lab/distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

acceleration diffusion-models generative-ai generative-model parallelism

Last synced: 31 Oct 2024