Projects in Awesome Lists tagged with diffusion
A curated list of projects in awesome lists tagged with diffusion .
https://github.com/automatic1111/stable-diffusion-webui
Stable Diffusion web UI
ai ai-art deep-learning diffusion gradio image-generation image2image img2img pytorch stable-diffusion text2image torch txt2img unstable upscaling web
Last synced: 09 Sep 2025
https://github.com/AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
ai ai-art deep-learning diffusion gradio image-generation image2image img2img pytorch stable-diffusion text2image torch txt2img unstable upscaling web
Last synced: 14 Mar 2025
https://github.com/huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
deep-learning diffusion flax flux hacktoberfest image-generation image2image image2video jax latent-diffusion-models pytorch score-based-generative-modeling stable-diffusion stable-diffusion-diffusers text2image text2video video2video
Last synced: 12 May 2025
https://github.com/huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
adapter diffusion llm lora parameter-efficient-learning python pytorch transformers
Last synced: 12 May 2025
https://github.com/datawhalechina/leedl-tutorial
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
bert chatgpt cnn deep-learning diffusion gan leedl-tutorial machine-learning network-compression pruning reinforcement-learning rnn self-attention transfer-learning transformer tutorial
Last synced: 14 May 2025
https://github.com/easydiffusion/easydiffusion
An easy 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image.
art diffusion generative-art gui stable
Last synced: 11 May 2025
https://github.com/cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
diffusion dreambooth fine-tuning lora stable-diffusion
Last synced: 14 May 2025
https://github.com/open-mmlab/mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
aigc computer-vision deep-learning diffusion diffusion-models generative-adversarial-network generative-ai image-editing image-generation image-processing image-synthesis inpainting matting pytorch super-resolution text2image video-frame-interpolation video-interpolation video-super-resolution
Last synced: 13 Dec 2025
https://github.com/NVlabs/Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
diffusion dit pytorch sana text-to-image-generation transformers
Last synced: 07 Aug 2025
https://github.com/leejet/stable-diffusion.cpp
Stable Diffusion and Flux in pure C/C++
ai cplusplus diffusion flux flux-dev flux-schnell ggml image-generation image2image img2img latent-diffusion stable-diffusion text2image txt2img
Last synced: 25 Jan 2026
https://github.com/riffusion/riffusion-hobby
Stable diffusion for real-time music generation
ai audio diffusers diffusion music stable-diffusion
Last synced: 12 Jan 2026
https://github.com/jina-ai/discoart
🪩 Create Disco Diffusion artworks in one line
clip-guided-diffusion creative-ai creative-art cross-modal dalle diffusion disco-diffusion discodiffusion generative-art imgen latent-diffusion midjourney multimodal prompts stable-diffusion
Last synced: 14 May 2025
https://github.com/williamyang1991/rerender_a_video
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
controlnet diffusion video-processing
Last synced: 15 May 2025
https://github.com/williamyang1991/Rerender_A_Video
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
controlnet diffusion video-processing
Last synced: 11 Apr 2025
https://github.com/openvpi/DiffSinger
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
acoustic-model diffusion diffussion-model melody-frontend midi pitch-prediction rectified-flow singing-voice singing-voice-synthesis svs
Last synced: 02 Apr 2025
https://github.com/ai-forever/kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
diffusion image-generation image2image inpainting ipython-notebook kandinsky outpainting text-to-image text2image
Last synced: 15 May 2025
https://github.com/datawhalechina/tiny-universe
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
agent diffusion evaluation-metrics llama qwen rag transformers
Last synced: 14 May 2025
https://github.com/ai-forever/Kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
diffusion image-generation image2image inpainting ipython-notebook kandinsky outpainting text-to-image text2image
Last synced: 08 Apr 2025
https://github.com/openvpi/diffsinger
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
acoustic-model diffusion diffussion-model melody-frontend midi pitch-prediction rectified-flow singing-voice singing-voice-synthesis svs
Last synced: 08 Jan 2026
https://github.com/playvoice/whisper-vits-svc
Core Engine of Singing Voice Conversion & Singing Voice Clone
change diff-svc diffusion diffusion-svc singing-voice-conversion sovits svc vits vits2 voice
Last synced: 14 May 2025
https://github.com/tmelyralab/musev
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
diffusion human-video-generation image2video infinite-length musev video-generation
Last synced: 26 Sep 2025
https://github.com/TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
diffusion human-video-generation image2video infinite-length musev video-generation
Last synced: 11 Apr 2025
https://github.com/riffusion/riffusion-app-hobby
Stable diffusion for real-time music generation (web app)
ai audio diffusion music nextjs stable-diffusion threejs
Last synced: 15 May 2025
https://github.com/PlayVoice/whisper-vits-svc
Core Engine of Singing Voice Conversion & Singing Voice Clone
change diff-svc diffusion diffusion-svc singing-voice-conversion sovits svc vits vits2 voice
Last synced: 30 Aug 2025
https://github.com/prs-eth/marigold
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
diffusion in-the-wild monocular-depth-estimation zero-shot
Last synced: 14 May 2025
https://github.com/prs-eth/Marigold
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
diffusion in-the-wild monocular-depth-estimation zero-shot
Last synced: 28 Mar 2025
https://github.com/Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
aigc diffusion diffusion-model diffusion-models diffusion-transformer generation-models transformer transformers
Last synced: 24 Mar 2025
https://github.com/alpha-vllm/lumina-t2x
Lumina-T2X is a unified framework for Text to Any Modality Generation
aigc diffusion diffusion-model diffusion-models diffusion-transformer generation-models transformer transformers
Last synced: 11 Apr 2025
https://github.com/rupeshs/fastsdcpu
Fast stable diffusion on CPU and AI PC
aipc api cli cpu desktopgui diffusers diffusion fastsdcpu flux gradio latentconsistencymodels lcmdiffusion openvino qt sdupcale sdxlturbo sdxs stablediffusion torch webui
Last synced: 12 Jan 2026
https://github.com/nunchaku-tech/ComfyUI-nunchaku
ComfyUI Plugin of Nunchaku
comfyui diffusion flux genai mlsys quantization
Last synced: 02 Sep 2025
https://github.com/pollinations/pollinations
Free Open-Source Image and Text Generation
colaboratory colaboratory-notebook diffusion gan generative ipfs javascript machinelearning nodejs python
Last synced: 13 May 2025
https://github.com/foundationvision/llamagen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
auto-regressive-model diffusion diffusion-models image-generation llama llm text2image
Last synced: 15 May 2025
https://github.com/FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
auto-regressive-model diffusion diffusion-models image-generation llama llm text2image
Last synced: 07 May 2025
https://github.com/varunshenoy/opendream
An extensible, easy-to-use, and portable diffusion web UI 👨🎨
ai automatic-1111 diffusion image-generation stable-diffusion
Last synced: 08 Apr 2025
https://github.com/maks-s/sd-akashic
A compendium of informations regarding Stable Diffusion (SD)
diffusion guide stable-diffusion
Last synced: 16 May 2025
https://github.com/Maks-s/sd-akashic
A compendium of informations regarding Stable Diffusion (SD)
diffusion guide stable-diffusion
Last synced: 24 Mar 2025
https://github.com/nvidia/cosmos-tokenizer
A suite of image and video neural tokenizers
diffusion tokenization transformers
Last synced: 30 Oct 2025
https://github.com/tencentarc/brushnet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image
Last synced: 14 May 2025
https://github.com/intellabs/fastrag
Efficient Retrieval Augmentation and Generation Framework
benchmark colbert diffusion generative-ai information-retrieval knowledge-graph llm multi-modal nlp question-answering semantic-search sentence-transformers summarization transformers
Last synced: 14 May 2025
https://github.com/TencentARC/BrushNet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image
Last synced: 28 Mar 2025
https://github.com/huggingface/finetrainers
Scalable and memory-optimized training of diffusion models
ai art artificial-intelligence diffusers diffusion diffusion-models pytorch transformers
Last synced: 14 Dec 2025
https://github.com/mini-sora/minisora
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
diffusion sora video-generation
Last synced: 14 May 2025
https://github.com/0xCrunchyy/10x
Optimized inference and fine-tuning framework for diffusion (image & video) models. Up to 3x faster & 80% less VRAM.
artificial-inteligence diffusion diffusion-models fine-tuning flux gpt inference lora pytorch sdxl
Last synced: 09 Jan 2026
https://github.com/uminosachi/sd-webui-inpaint-anything
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
ai-art anything diffusers diffusion extension generative-art gradio huggingface huggingface-diffusers image-generation image2image img2img inpaint inpaint-anything inpainting latent-diffusion segment segment-anything segmentation stable-diffusion
Last synced: 16 May 2025
https://github.com/River-Zhang/ICEdit
Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enough to run!
diffusion diffusion-models diffusion-transformer dit editing-image gpt4o gpt4oimage image-editing in-context
Last synced: 12 Jun 2025
https://github.com/a-r-r-o-w/finetrainers
Scalable and memory-optimized training of diffusion models
ai art artificial-intelligence diffusers diffusion diffusion-models pytorch transformers
Last synced: 14 May 2025
https://github.com/thu-lyj-lab/t3bench
T3Bench: Benchmarking Current Progress in Text-to-3D Generation
Last synced: 16 May 2025
https://github.com/declare-lab/tango
A family of diffusion models for text-to-audio generation.
audio-generation diffusion diffusion-models language-models large-language-models text-to-audio
Last synced: 16 May 2025
https://github.com/EdVince/Stable-Diffusion-NCNN
Stable Diffusion in NCNN with c++, supported txt2img and img2img
android clip cpp diffusion executable img2img mnn ncnn onnx stable-diffusion tensorrt tnn txt2img
Last synced: 13 Apr 2025
https://github.com/cloneofsimo/mindiffusion
Self-contained, minimalistic implementation of diffusion models with Pytorch.
Last synced: 13 Apr 2025
https://github.com/IntelLabs/fastRAG
Efficient Retrieval Augmentation and Generation Framework
benchmark colbert diffusion generative-ai information-retrieval knowledge-graph llm multi-modal nlp question-answering semantic-search sentence-transformers summarization transformers
Last synced: 24 Mar 2025
https://github.com/cloneofsimo/minDiffusion
Self-contained, minimalistic implementation of diffusion models with Pytorch.
Last synced: 27 Mar 2025
https://github.com/Uminosachi/sd-webui-inpaint-anything
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
ai-art anything diffusers diffusion extension generative-art gradio huggingface huggingface-diffusers image-generation image2image img2img inpaint inpaint-anything inpainting latent-diffusion segment segment-anything segmentation stable-diffusion
Last synced: 16 Apr 2025
https://github.com/sail-sg/adan
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
adan artificial-intelligence bert-model convnext cuda-programming deep-learning diffusion dreamfusion fairseq gpt2 llm-training llms mae moe optimizer pytorch resnet timm transformer-xl vit
Last synced: 07 Jul 2025
https://github.com/sail-sg/Adan
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
adan artificial-intelligence bert-model convnext cuda-programming deep-learning diffusion dreamfusion fairseq gpt2 llm-training llms mae moe optimizer pytorch resnet timm transformer-xl vit
Last synced: 05 Apr 2025
https://github.com/castorini/daam
Diffusion attentive attribution maps for interpreting Stable Diffusion.
diffusion explainable-ai generative-ai huggingface pytorch stable-diffusion
Last synced: 16 May 2025
https://github.com/fboulnois/stable-diffusion-docker
Run the official Stable Diffusion releases in a Docker container with txt2img, img2img, depth2img, pix2pix, upscale4x, and inpaint.
dall-e dalle diffusion docker generative-art huggingface image-generation inpainting midjourney pytorch stable-diffusion tensorflow text-to-image
Last synced: 13 Apr 2025
https://github.com/pku-yuangroup/consisid
[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
diffusion diffusion-models identity-preserving text-to-video video-generation video-generation-dataset video-generator videogeneration
Last synced: 06 Jul 2025
https://github.com/thu-ml/riflex
Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)
cogvideox diffusion diffusion-models diffusion-transformer dit extrapolation generative-model hunyuan-video long-video-generation position-embedding rope video-generation
Last synced: 01 Jul 2025
https://github.com/cloneofsimo/paint-with-words-sd
Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentation map.
diffusion generative-model stable-diffusion
Last synced: 05 Apr 2025
https://github.com/some9000/StylePile
A prompt generation helper script for AUTOMATIC1111/stable-diffusion-webui and compatible forks
diffusion generation generator promt stable
Last synced: 08 May 2025
https://github.com/omriav/blended-latent-diffusion
Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]
computer-vision deep-learning diffusion diffusion-models generative-model image-generation multimodal multimodal-deep-learning pytorch text-driven-editing text-guided-manipulation text-to-image text-to-image-synthesis
Last synced: 28 Mar 2025
https://github.com/omriav/blended-diffusion
Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]
blended-diffusion deep-learning diffusion multimodal openai openai-clip text-guided-manipulation text-to-image
Last synced: 28 Mar 2025
https://github.com/williamyang1991/fresco
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
controlnet diffusion video-processing
Last synced: 05 Apr 2025
https://github.com/williamyang1991/FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
controlnet diffusion video-processing
Last synced: 28 Mar 2025
https://github.com/microsoft/foldingdiff
Diffusion models of protein structure; trigonometry and attention are all you need!
diffusion diffusion-models protein protein-structure-generation proteins transformer
Last synced: 30 Mar 2025
https://github.com/AspirinCode/papers-for-molecular-design-using-DL
List of molecular design using Generative AI and Deep Learning
deep-generative-models diffusion drug-design energy-based-model gan generative-ai gnns lstm molecular-design prompt-learning reinforcement-learning rnn score-based-generative-models transformer vae
Last synced: 14 Mar 2025
https://github.com/aspirincode/papers-for-molecular-design-using-dl
List of molecular design using Generative AI and Deep Learning
deep-generative-models diffusion drug-design energy-based-model gan generative-ai gnns lstm molecular-design prompt-learning reinforcement-learning rnn score-based-generative-models transformer vae
Last synced: 24 Mar 2025
https://github.com/dromara/omega-ai
Omega-AI:基于java打造的深度学习框架,帮助你快速搭建神经网络,实现模型推理与训练,引擎支持自动求导,多线程与GPU运算,GPU支持CUDA,CUDNN。
ai deeplearning diffusion llm neural-network yolo
Last synced: 08 Jul 2025
https://github.com/LeCAR-Lab/dial-mpc
Official implementation for the paper "Full-Order Sampling-Based MPC for Torque-Level Locomotion Control via Diffusion-Style Annealing". DIAL-MPC is a novel sampling-based MPC framework for legged robot full-order torque-level control with both precision and agility in a training-free manner.
diffusion humanoid legged-robots mpc online-control optimal-control quadruped sampling-based-control
Last synced: 18 Oct 2025
https://github.com/afiaka87/clip-guided-diffusion
A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.
artificial-intelligence deep-learning diffusion image-generation multimodal multimodality openai openai-clip text-to-image text-to-image-synthesis
Last synced: 05 Aug 2025
https://github.com/Auto1111SDK/Auto1111SDK
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
ai ai-art api automatic1111 deep-learning diffusers diffusion image-generation image-to-image img2img python pytorch stable-diffusion stable-diffusion-webui text-to-image torch txt2img unstable upscaling web
Last synced: 29 Oct 2025
https://github.com/huggingface/open-muse
Open reproduction of MUSE for fast text2image generation.
cv deep-learning diffusion generative-art nlp text2image transformer
Last synced: 14 Oct 2025
https://github.com/scenediffuser/Scene-Diffuser
Official implementation of CVPR23 paper "Diffusion-based Generation, Optimization, and Planning in 3D Scenes"
3d-scene-understanding diffusion generative-model
Last synced: 27 Apr 2025
https://github.com/HaozheLiu-ST/T-GATE
T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!
cross-attention cross-attention-diffusers diffusers diffusion efficiency inference pytorch text2image training-free transformer
Last synced: 13 Mar 2025
https://github.com/ailab-cvc/freenoise
[ICLR 2024] Code for FreeNoise based on VideoCrafter
aigc diffusion generative-model video-diffusion-model
Last synced: 06 Apr 2025
https://github.com/AILab-CVC/FreeNoise
[ICLR 2024] Code for FreeNoise based on VideoCrafter
aigc diffusion generative-model video-diffusion-model
Last synced: 28 Mar 2025
https://github.com/p1atdev/leco
Low-rank adaptation for Erasing COncepts from diffusion models.
diffusion lora stable-diffusion
Last synced: 06 Apr 2025
https://github.com/woctezuma/stable-diffusion-colab
Colab notebook for Stable Diffusion Hyper-SDXL.
colab colab-notebook colaboratory deep-learning diffusers diffusion diffusion-models google-colab google-colab-notebook google-colaboratory huggingface-diffusers hyper-sd hyper-sdxl image-generation stable-diffusion stable-diffusion-xl text-to-image text-to-image-generation text-to-image-synthesis text2image
Last synced: 05 Apr 2025
https://github.com/rehglab/rave
RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models [CVPR 2024]
diffusion stable-diffusion video-editing
Last synced: 15 Sep 2025
https://github.com/nianticlabs/diffusionerf
[CVPR 2023] DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models
deep-learning diffusion diffusion-models nerf neuralradiance-fields radiance-field reconstruction regularization
Last synced: 07 Apr 2025
https://github.com/qitianwu/DIFFormer
The official implementation for ICLR23 spotlight paper "DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion"
attention diffusion diffusion-equation geometric-deep-learning graph-neural-networks graph-transformer iclr2023 image-classification large-graph node-classification pytorch pytorch-geometric pytorch-geometric-temporal spatial-temporal-forecasting text-classification transformer
Last synced: 27 Mar 2025
https://github.com/zibojia/COCOCO
Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.
cococo diffusion inpainting pytorch sam2 segment segment-anything text-guided text-guided-video-inpainting video-inpainting video-inpainting-with-prompt video-sam2-inpaint
Last synced: 24 Jul 2025
https://github.com/dwctod/eccv2022-papers-with-code-demo
收集 ECCV 最新的成果,包括论文、代码和demo视频等,欢迎大家推荐!
ai computer-vision cv dataset diffusion eccv eccv2022 face-recognition image-segmentation multimodal-deep-learning nerf objection-detection vision-transformer
Last synced: 16 Mar 2025
https://github.com/yeungchenwa/FontDiffuser
[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning
deep-learning diffusers diffusion font-generation image-generation
Last synced: 28 Mar 2025
https://github.com/RehgLab/RAVE
RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models [CVPR 2024]
diffusion stable-diffusion video-editing
Last synced: 28 Mar 2025
https://github.com/joaolages/diffusers-interpret
Diffusers-Interpret 🤗🧨🕵️♀️: Model explainability for 🤗 Diffusers. Get explanations for your generated images.
computer-vision deep-learning diffusers diffusion explainable-ai image-generation interpretability model-explainability primary-attributions pytorch text2image transformers
Last synced: 05 Apr 2025
https://github.com/baaivision/diva
[ICLR 2025] Diffusion Feedback Helps CLIP See Better
clip diffusion visual-perception
Last synced: 08 Oct 2025
https://github.com/byliutao/1Prompt1Story
🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt
diffusion diffusion-model diffusion-models storytelling text-to-image
Last synced: 13 Oct 2025
https://github.com/RQ-Wu/LAMP
[CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation
aigc diffusion diffusion-model diffusion-models few-shot-learning stable-diffusion text-to-video video-editing
Last synced: 28 Mar 2025
https://github.com/ZichengDuan/TheChosenOne
Unofficial implementation of the paper "The Chosen One: Consistent Characters in Text-to-Image Diffusion Models"
deep-learning diffusion dinov2 generative-art generative-model
Last synced: 27 Mar 2025
https://github.com/mbodiai/embodied-agents
Seamlessly integrate state-of-the-art transformer models into robotics stacks
agents artificial-intelligence diffusion embodied embodied-agent embodied-agents generative-ai large-language-models llm mbodi mbodiai multimodal robotics transformer vision-language-model vlm
Last synced: 25 Sep 2025
https://sirui-xu.github.io/InterDiff/
[ICCV 2023] Official PyTorch implementation of the paper "InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion"
3d-human-pose 6d deep-learning diffusion diffusion-models generative-ai generative-model human-motion-prediction human-object-interaction human-scene-interaction motion-prediction object-pose
Last synced: 23 Apr 2025
https://github.com/Sirui-Xu/InterDiff
[ICCV 2023] Official PyTorch implementation of the paper "InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion"
3d-human-pose 6d deep-learning diffusion diffusion-models generative-ai generative-model human-motion-prediction human-object-interaction human-scene-interaction motion-prediction object-pose
Last synced: 23 Apr 2025
https://github.com/keonlee9420/diffsinger
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
ddpm diffsinger diffusion diffusion-models english fastspeech neural-tts non-autoregressive pytorch singing-voice speech-synthesis text-to-speech tts
Last synced: 12 Oct 2025
https://github.com/tensorstack-ai/onnxstack
C# Stable Diffusion using ONNX Runtime
csharp diffusion direct-ml img2img inpainting net7 onnx-model onnxruntime stable-diffusion txt2img
Last synced: 15 May 2025
https://github.com/jiauzhang/dragdiffusion
Implementation of DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing
diffusion draggan image-editing
Last synced: 07 Oct 2025
https://github.com/uminosachi/inpaint-anything
Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
ai-art anything diffusers diffusion generative-art gradio huggingface huggingface-diffusers image-generation image2image img2img inpaint inpaint-anything inpainting latent-diffusion segment segment-anything segmentation stable-diffusion
Last synced: 08 Apr 2025
https://github.com/mihirp1998/VADER
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.
alignment diffusion reinforcement-learning reinforcement-learning-human-feedback rl rlhf vader video-diffusion video-diffusion-alignment
Last synced: 28 Mar 2025
https://github.com/ssube/onnx-web
web UI for GPU-accelerated ONNX pipelines like Stable Diffusion, even on Windows and AMD
ai-art amd diffusion esrgan flask generative-art gfpgan image-generation linux nvidia python pytorch reactjs stable-diffusion super-resolution text2image upscaling web-app windows
Last synced: 16 May 2025