Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with diffusion
A curated list of projects in awesome lists tagged with diffusion .
https://github.com/automatic1111/stable-diffusion-webui
Stable Diffusion web UI
ai ai-art deep-learning diffusion gradio image-generation image2image img2img pytorch stable-diffusion text2image torch txt2img unstable upscaling web
Last synced: 16 Dec 2024
https://github.com/AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
ai ai-art deep-learning diffusion gradio image-generation image2image img2img pytorch stable-diffusion text2image torch txt2img unstable upscaling web
Last synced: 25 Oct 2024
https://github.com/huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
deep-learning diffusion flax hacktoberfest image-generation image2image jax latent-diffusion-models pytorch score-based-generative-modeling stable-diffusion stable-diffusion-diffusers text2image
Last synced: 16 Dec 2024
https://github.com/huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
adapter diffusion llm lora parameter-efficient-learning python pytorch transformers
Last synced: 16 Dec 2024
https://github.com/datawhalechina/leedl-tutorial
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
bert chatgpt cnn deep-learning diffusion gan leedl-tutorial machine-learning network-compression pruning reinforcement-learning rnn self-attention transfer-learning transformer tutorial
Last synced: 16 Dec 2024
https://github.com/easydiffusion/easydiffusion
Easiest 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image.
art diffusion generative-art gui stable
Last synced: 18 Dec 2024
https://github.com/cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
diffusion dreambooth fine-tuning lora stable-diffusion
Last synced: 19 Dec 2024
https://github.com/open-mmlab/mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
aigc computer-vision deep-learning diffusion diffusion-models generative-adversarial-network generative-ai image-editing image-generation image-processing image-synthesis inpainting matting pytorch super-resolution text2image video-frame-interpolation video-interpolation video-super-resolution
Last synced: 16 Dec 2024
https://github.com/jina-ai/discoart
🪩 Create Disco Diffusion artworks in one line
clip-guided-diffusion creative-ai creative-art cross-modal dalle diffusion disco-diffusion discodiffusion generative-art imgen latent-diffusion midjourney multimodal prompts stable-diffusion
Last synced: 15 Dec 2024
https://github.com/leejet/stable-diffusion.cpp
Stable Diffusion and Flux in pure C/C++
ai cplusplus diffusion flux flux-dev flux-schnell ggml image-generation image2image img2img latent-diffusion stable-diffusion text2image txt2img
Last synced: 18 Dec 2024
https://github.com/riffusion/riffusion-hobby
Stable diffusion for real-time music generation
ai audio diffusers diffusion music stable-diffusion
Last synced: 18 Dec 2024
https://github.com/williamyang1991/rerender_a_video
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
controlnet diffusion video-processing
Last synced: 20 Dec 2024
https://github.com/williamyang1991/Rerender_A_Video
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
controlnet diffusion video-processing
Last synced: 07 Nov 2024
https://github.com/ai-forever/kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
diffusion image-generation image2image inpainting ipython-notebook kandinsky outpainting text-to-image text2image
Last synced: 20 Dec 2024
https://github.com/ai-forever/Kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
diffusion image-generation image2image inpainting ipython-notebook kandinsky outpainting text-to-image text2image
Last synced: 06 Nov 2024
https://github.com/openvpi/DiffSinger
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
acoustic-model diffusion diffussion-model melody-frontend midi pitch-prediction rectified-flow singing-voice singing-voice-synthesis svs
Last synced: 03 Nov 2024
https://github.com/openvpi/diffsinger
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
acoustic-model diffusion diffussion-model melody-frontend midi pitch-prediction rectified-flow singing-voice singing-voice-synthesis svs
Last synced: 30 Sep 2024
https://github.com/playvoice/whisper-vits-svc
Core Engine of Singing Voice Conversion & Singing Voice Clone
change diff-svc diffusion diffusion-svc singing-voice-conversion sovits svc vits vits2 voice
Last synced: 20 Dec 2024
https://github.com/riffusion/riffusion-app-hobby
Stable diffusion for real-time music generation (web app)
ai audio diffusion music nextjs stable-diffusion threejs
Last synced: 18 Dec 2024
https://github.com/PlayVoice/whisper-vits-svc
Core Engine of Singing Voice Conversion & Singing Voice Clone
change diff-svc diffusion diffusion-svc singing-voice-conversion sovits svc vits vits2 voice
Last synced: 03 Sep 2024
https://github.com/tmelyralab/musev
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
diffusion human-video-generation image2video infinite-length musev video-generation
Last synced: 18 Dec 2024
https://github.com/TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
diffusion human-video-generation image2video infinite-length musev video-generation
Last synced: 07 Nov 2024
https://github.com/prs-eth/Marigold
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
diffusion in-the-wild monocular-depth-estimation zero-shot
Last synced: 31 Oct 2024
https://github.com/prs-eth/marigold
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
diffusion in-the-wild monocular-depth-estimation zero-shot
Last synced: 19 Dec 2024
https://github.com/alpha-vllm/lumina-t2x
Lumina-T2X is a unified framework for Text to Any Modality Generation
aigc diffusion diffusion-model diffusion-models diffusion-transformer generation-models transformer transformers
Last synced: 19 Dec 2024
https://github.com/varunshenoy/opendream
An extensible, easy-to-use, and portable diffusion web UI 👨🎨
ai automatic-1111 diffusion image-generation stable-diffusion
Last synced: 14 Dec 2024
https://github.com/maks-s/sd-akashic
A compendium of informations regarding Stable Diffusion (SD)
diffusion guide stable-diffusion
Last synced: 30 Nov 2024
https://github.com/Maks-s/sd-akashic
A compendium of informations regarding Stable Diffusion (SD)
diffusion guide stable-diffusion
Last synced: 29 Oct 2024
https://github.com/rupeshs/fastsdcpu
Fast stable diffusion on CPU
api cli cpu desktopgui diffusers diffusion edsr fastsdcpu flux gradio latentconsistencymodels lcmdiffusion openvino qt sdupcale sdxlturbo sdxs stablediffusion torch webui
Last synced: 19 Dec 2024
https://github.com/tencentarc/brushnet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image
Last synced: 19 Dec 2024
https://github.com/amirhossein-kz/Awesome-Diffusion-Models-in-Medical-Imaging
Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)
ddpm deep-learning denoising diffusion diffusion-models generation generative-models machine-learning medical-imaging ncsn reconstruction score-based score-matching sde segmentation vae
Last synced: 16 Nov 2024
https://github.com/TencentARC/BrushNet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image
Last synced: 31 Oct 2024
https://github.com/foundationvision/llamagen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
auto-regressive-model diffusion diffusion-models image-generation llama llm text2image
Last synced: 19 Dec 2024
https://github.com/FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
auto-regressive-model diffusion diffusion-models image-generation llama llm text2image
Last synced: 14 Nov 2024
https://github.com/mini-sora/minisora
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
diffusion sora video-generation
Last synced: 20 Dec 2024
https://github.com/Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
aigc diffusion diffusion-model diffusion-models diffusion-transformer generation-models transformer transformers
Last synced: 29 Oct 2024
https://github.com/thu-lyj-lab/t3bench
T3Bench: Benchmarking Current Progress in Text-to-3D Generation
Last synced: 17 Dec 2024
https://github.com/declare-lab/tango
A family of diffusion models for text-to-audio generation.
audio-generation diffusion diffusion-models language-models large-language-models text-to-audio
Last synced: 20 Dec 2024
https://github.com/nvidia/cosmos-tokenizer
A suite of image and video neural tokenizers
diffusion tokenization transformers
Last synced: 20 Dec 2024
https://github.com/EdVince/Stable-Diffusion-NCNN
Stable Diffusion in NCNN with c++, supported txt2img and img2img
android clip cpp diffusion executable img2img mnn ncnn onnx stable-diffusion tensorrt tnn txt2img
Last synced: 07 Nov 2024
https://github.com/IntelLabs/fastRAG
Efficient Retrieval Augmentation and Generation Framework
benchmark colbert diffusion generative-ai information-retrieval knowledge-graph llm multi-modal nlp question-answering semantic-search sentence-transformers summarization transformers
Last synced: 28 Oct 2024
https://github.com/intellabs/fastrag
Efficient Retrieval Augmentation and Generation Framework
benchmark colbert diffusion generative-ai information-retrieval knowledge-graph llm multi-modal nlp question-answering semantic-search sentence-transformers summarization transformers
Last synced: 21 Dec 2024
https://github.com/Uminosachi/sd-webui-inpaint-anything
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
ai-art anything diffusers diffusion extension generative-art gradio huggingface huggingface-diffusers image-generation image2image img2img inpaint inpaint-anything inpainting latent-diffusion segment segment-anything segmentation stable-diffusion
Last synced: 08 Nov 2024
https://github.com/uminosachi/sd-webui-inpaint-anything
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
ai-art anything diffusers diffusion extension generative-art gradio huggingface huggingface-diffusers image-generation image2image img2img inpaint inpaint-anything inpainting latent-diffusion segment segment-anything segmentation stable-diffusion
Last synced: 15 Dec 2024
https://github.com/cloneofsimo/mindiffusion
Self-contained, minimalistic implementation of diffusion models with Pytorch.
Last synced: 20 Dec 2024
https://github.com/cloneofsimo/minDiffusion
Self-contained, minimalistic implementation of diffusion models with Pytorch.
Last synced: 30 Oct 2024
https://github.com/sail-sg/Adan
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
adan artificial-intelligence bert-model convnext cuda-programming deep-learning diffusion dreamfusion fairseq gpt2 llm-training llms mae moe optimizer pytorch resnet timm transformer-xl vit
Last synced: 05 Nov 2024
https://github.com/fboulnois/stable-diffusion-docker
Run the official Stable Diffusion releases in a Docker container with txt2img, img2img, depth2img, pix2pix, upscale4x, and inpaint.
dall-e dalle diffusion docker generative-art huggingface image-generation inpainting midjourney pytorch stable-diffusion tensorflow text-to-image
Last synced: 18 Dec 2024
https://github.com/wladradchenko/wunjo.wladradchenko.ru
Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.
controlnet deepfake deepfake-emotion deepfakes diffusion face-swap face-swapping free image-animation retouching-video segment-anything tacotron2 talking-face talking-face-generation talking-head tts vid2vid voice-cloning voice-recognition wunjo
Last synced: 17 Nov 2024
https://github.com/cloneofsimo/paint-with-words-sd
Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentation map.
diffusion generative-model stable-diffusion
Last synced: 21 Dec 2024
https://github.com/castorini/daam
Diffusion attentive attribution maps for interpreting Stable Diffusion.
diffusion explainable-ai generative-ai huggingface pytorch stable-diffusion
Last synced: 16 Dec 2024
https://github.com/some9000/StylePile
A prompt generation helper script for AUTOMATIC1111/stable-diffusion-webui and compatible forks
diffusion generation generator promt stable
Last synced: 14 Nov 2024
https://github.com/omriav/blended-latent-diffusion
Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]
computer-vision deep-learning diffusion diffusion-models generative-model image-generation multimodal multimodal-deep-learning pytorch text-driven-editing text-guided-manipulation text-to-image text-to-image-synthesis
Last synced: 31 Oct 2024
https://github.com/omriav/blended-diffusion
Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]
blended-diffusion deep-learning diffusion multimodal openai openai-clip text-guided-manipulation text-to-image
Last synced: 31 Oct 2024
https://github.com/williamyang1991/fresco
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
controlnet diffusion video-processing
Last synced: 21 Dec 2024
https://github.com/williamyang1991/FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
controlnet diffusion video-processing
Last synced: 31 Oct 2024
https://github.com/AspirinCode/papers-for-molecular-design-using-DL
List of molecular design using Generative AI and Deep Learning
deep-generative-models diffusion drug-design energy-based-model gan generative-ai gnns lstm molecular-design prompt-learning reinforcement-learning rnn score-based-generative-models transformer vae
Last synced: 26 Oct 2024
https://github.com/aspirincode/papers-for-molecular-design-using-dl
List of molecular design using Generative AI and Deep Learning
deep-generative-models diffusion drug-design energy-based-model gan generative-ai gnns lstm molecular-design prompt-learning reinforcement-learning rnn score-based-generative-models transformer vae
Last synced: 01 Dec 2024
https://github.com/microsoft/foldingdiff
Diffusion models of protein structure; trigonometry and attention are all you need!
diffusion diffusion-models protein protein-structure-generation proteins transformer
Last synced: 01 Nov 2024
https://github.com/afiaka87/clip-guided-diffusion
A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.
artificial-intelligence deep-learning diffusion image-generation multimodal multimodality openai openai-clip text-to-image text-to-image-synthesis
Last synced: 17 Nov 2024
https://github.com/Auto1111SDK/Auto1111SDK
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
ai ai-art api automatic1111 deep-learning diffusers diffusion image-generation image-to-image img2img python pytorch stable-diffusion stable-diffusion-webui text-to-image torch txt2img unstable upscaling web
Last synced: 12 Oct 2024
https://github.com/scenediffuser/Scene-Diffuser
Official implementation of CVPR23 paper "Diffusion-based Generation, Optimization, and Planning in 3D Scenes"
3d-scene-understanding diffusion generative-model
Last synced: 11 Nov 2024
https://github.com/pollinations/pollinations
Free Open-Source Image and Text Generation
colaboratory colaboratory-notebook diffusion gan generative ipfs javascript machinelearning nodejs python
Last synced: 16 Dec 2024
https://github.com/AILab-CVC/FreeNoise
[ICLR 2024] Code for FreeNoise based on VideoCrafter
aigc diffusion generative-model video-diffusion-model
Last synced: 31 Oct 2024
https://github.com/ailab-cvc/freenoise
[ICLR 2024] Code for FreeNoise based on VideoCrafter
aigc diffusion generative-model video-diffusion-model
Last synced: 16 Dec 2024
https://github.com/woctezuma/stable-diffusion-colab
Colab notebook for Stable Diffusion Hyper-SDXL.
colab colab-notebook colaboratory deep-learning diffusers diffusion diffusion-models google-colab google-colab-notebook google-colaboratory huggingface-diffusers hyper-sd hyper-sdxl image-generation stable-diffusion stable-diffusion-xl text-to-image text-to-image-generation text-to-image-synthesis text2image
Last synced: 20 Dec 2024
https://github.com/nianticlabs/diffusionerf
[CVPR 2023] DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models
deep-learning diffusion diffusion-models nerf neuralradiance-fields radiance-field reconstruction regularization
Last synced: 17 Dec 2024
https://github.com/qitianwu/DIFFormer
The official implementation for ICLR23 spotlight paper "DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion"
attention diffusion diffusion-equation geometric-deep-learning graph-neural-networks graph-transformer iclr2023 image-classification large-graph node-classification pytorch pytorch-geometric pytorch-geometric-temporal spatial-temporal-forecasting text-classification transformer
Last synced: 30 Oct 2024
https://github.com/dwctod/eccv2022-papers-with-code-demo
收集 ECCV 最新的成果,包括论文、代码和demo视频等,欢迎大家推荐!
ai computer-vision cv dataset diffusion eccv eccv2022 face-recognition image-segmentation multimodal-deep-learning nerf objection-detection vision-transformer
Last synced: 21 Nov 2024
https://github.com/RehgLab/RAVE
RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models - CVPR 2024 - Official Repo
diffusion stable-diffusion video-editing
Last synced: 31 Oct 2024
https://github.com/RQ-Wu/LAMP
Official implement code of LAMP: Learn a Motion Pattern by Few-Shot Tuning a Text-to-Image Diffusion Model (Few-shot-based text-to-video diffusion)
aigc diffusion diffusion-model diffusion-models few-shot-learning stable-diffusion text-to-video video-editing
Last synced: 31 Oct 2024
https://github.com/yeungchenwa/FontDiffuser
[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning
deep-learning diffusers diffusion font-generation image-generation
Last synced: 31 Oct 2024
https://github.com/ZichengDuan/TheChosenOne
Unofficial implementation of the paper "The Chosen One: Consistent Characters in Text-to-Image Diffusion Models"
deep-learning diffusion dinov2 generative-art generative-model
Last synced: 30 Oct 2024
https://github.com/jiauzhang/dragdiffusion
Implementation of DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing
diffusion draggan image-editing
Last synced: 18 Dec 2024
https://github.com/keonlee9420/diffsinger
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
ddpm diffsinger diffusion diffusion-models english fastspeech neural-tts non-autoregressive pytorch singing-voice speech-synthesis text-to-speech tts
Last synced: 02 Oct 2024
https://github.com/baaivision/diva
Diffusion Feedback Helps CLIP See Better
clip diffusion visual-perception
Last synced: 21 Dec 2024
https://github.com/uminosachi/inpaint-anything
Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
ai-art anything diffusers diffusion generative-art gradio huggingface huggingface-diffusers image-generation image2image img2img inpaint inpaint-anything inpainting latent-diffusion segment segment-anything segmentation stable-diffusion
Last synced: 20 Dec 2024
https://github.com/Sirui-Xu/InterDiff
[ICCV 2023] Official PyTorch implementation of the paper "InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion"
3d-human-pose 6d deep-learning diffusion diffusion-models generative-ai generative-model human-motion-prediction human-object-interaction human-scene-interaction motion-prediction object-pose
Last synced: 10 Nov 2024
https://sirui-xu.github.io/InterDiff/
[ICCV 2023] Official PyTorch implementation of the paper "InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion"
3d-human-pose 6d deep-learning diffusion diffusion-models generative-ai generative-model human-motion-prediction human-object-interaction human-scene-interaction motion-prediction object-pose
Last synced: 10 Nov 2024
https://github.com/ssube/onnx-web
web UI for GPU-accelerated ONNX pipelines like Stable Diffusion, even on Windows and AMD
ai-art amd diffusion esrgan flask generative-art gfpgan image-generation linux nvidia python pytorch reactjs stable-diffusion super-resolution text2image upscaling web-app windows
Last synced: 15 Dec 2024
https://github.com/zhanghm1995/Forge_VFM4AD
A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.
3dgs adaptation autonomous-driving diffusion end-to-end-autonomous-driving foundation-model large-language-models nerf pre-training survey world-models
Last synced: 30 Nov 2024
https://github.com/pansanity666/Awesome-Avatars
List of recent advances for human avatars, including generation, reconstruction, and editing, etc.
3dreconstruction aigc avatar diffusion humannerf image-to-3d motion-generation nerf neural-rendering sdf smpl stable-diffusion t23d text-to-3d tt3d
Last synced: 12 Sep 2024
https://github.com/benedekrozemberczki/GraphWaveMachine
A scalable implementation of "Learning Structural Node Embeddings Via Diffusion Wavelets (KDD 2018)".
diffusion embedding factorization graph-embedding graph-wavelet graphwave heat-kernel kdd laplacian machine-learning node2vec refex rolx spectral struc2vec structural-embedding structural-role unsupervised-learning wavelet word2vec
Last synced: 11 Nov 2024
https://github.com/benedekrozemberczki/graphwavemachine
A scalable implementation of "Learning Structural Node Embeddings Via Diffusion Wavelets (KDD 2018)".
diffusion embedding factorization graph-embedding graph-wavelet graphwave heat-kernel kdd laplacian machine-learning node2vec refex rolx spectral struc2vec structural-embedding structural-role unsupervised-learning wavelet word2vec
Last synced: 19 Dec 2024
https://github.com/rose-stl-lab/dyffusion
[NeurIPS 2023] A Dynamics-informed Diffusion Model for Spatiotemporal Forecasting
deep-learning diffusion diffusion-models ensemble-forecasts machine-learning neurips neurips-2023 probabilistic-forecasting pytorch pytorch-lightning spatiotemporal-forecasting
Last synced: 15 Dec 2024
https://github.com/mbodiai/embodied-agents
Seamlessly integrate state-of-the-art transformer models into robotics stacks
agents artificial-intelligence diffusion embodied embodied-agent embodied-agents generative-ai large-language-models llm mbodi mbodiai multimodal robotics transformer vision-language-model vlm
Last synced: 05 Dec 2024
https://github.com/rainbowluocs/diffusiontrack
[AAAI 2024] DiffusionTrack: Diffusion Model For Multi-Object Tracking. DiffusionTrack is the first work to employ the diffusion model for multi-object tracking by formulating it as a generative noise-to-tracking diffusion process.
diffusion multi-object-tracking object-detection tracking
Last synced: 31 Oct 2024
https://barquerogerman.github.io/FlowMDM/
[CVPR 2024] Official Implementation of "Seamless Human Motion Composition with Blended Positional Encodings".
cvpr cvpr2024 diffusion generative-model human-motion human-motion-composition human-motion-extrapolation motion-generation
Last synced: 03 Dec 2024
https://github.com/L-YeZhu/CDCD
[ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).
contrastive diffusion music-generation text2image
Last synced: 10 Nov 2024
https://github.com/ozanciga/diffusion-for-beginners
denoising diffusion models, as simple as possible
dall-e diffusion imagen midjourney pytorch scheduler stable-diffusion
Last synced: 17 Nov 2024
https://github.com/safreita1/TIGER
Python toolbox to evaluate graph vulnerability and robustness (CIKM 2021)
adversarial-attacks attack cascading-failures data-mining data-science defense diffusion epidemics graph graph-attack graph-mining machine-learning netshield network-attack networks robustness simulation vulnerability
Last synced: 12 Nov 2024
https://github.com/iamncj/dilightnet
Official Code Release for [SIGGRAPH 2024] DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation
controlnet diffusers diffusion diffusion-models image-generation lighting-control relighting siggraph stable-diffusion
Last synced: 15 Dec 2024
https://github.com/happylittlecat2333/Auffusion
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"
audio-generation diffusion diffusion-models large-language-models text-to-audio
Last synced: 14 Nov 2024
https://github.com/mihirp1998/VADER
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.
alignment diffusion reinforcement-learning reinforcement-learning-human-feedback rl rlhf vader video-diffusion video-diffusion-alignment
Last synced: 31 Oct 2024
https://github.com/ducha-aiki/manifold-diffusion
Diffusion on manifolds for image retrieval
diffusion image-retrieval manifold python reranking
Last synced: 01 Nov 2024
https://github.com/benedekrozemberczki/diff2vec
Reference implementation of Diffusion2Vec (Complenet 2018) built on Gensim and NetworkX.
complex-networks deep-learning deepwalk diff2vec diffusion embedding embeddings factorization gensim graph-embedding implicit-factorization machine-learning network-embedding neural-network node-embedding node2vec semisupervised-learning struc2vec tensorflow unsupervised-learning
Last synced: 14 Nov 2024
https://github.com/zibojia/COCOCO
Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.
cococo diffusion inpainting pytorch sam2 segment segment-anything text-guided text-guided-video-inpainting video-inpainting video-sam2-inpaint
Last synced: 30 Nov 2024
https://github.com/eliahuhorwitz/conffusion
Official Implementation for the "Conffusion: Confidence Intervals for Diffusion Models" paper.
computer-vision conformal-prediction deep-learning diffusion diffusion-models generative-model image-generation image-to-image inpainting prediction-intervals pytorch quantile-regression super-resolution uncertainty-estimation uncertainty-quantification
Last synced: 10 Nov 2024
https://github.com/juliahealth/komamri.jl
Koma is a Pulseq-compatible framework to efficiently simulate Magnetic Resonance Imaging (MRI) acquisitions. The main focus of this package is to simulate general scenarios that could arise in pulse sequence development.
cardiac diffusion diffusion-mri gpu-acceleration mri simulation
Last synced: 25 Nov 2024