Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with diffusion

A curated list of projects in awesome lists tagged with diffusion .

https://github.com/huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

adapter diffusion llm lora parameter-efficient-learning python pytorch transformers

Last synced: 16 Dec 2024

https://github.com/datawhalechina/leedl-tutorial

《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

bert chatgpt cnn deep-learning diffusion gan leedl-tutorial machine-learning network-compression pruning reinforcement-learning rnn self-attention transfer-learning transformer tutorial

Last synced: 16 Dec 2024

https://github.com/easydiffusion/easydiffusion

Easiest 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image.

art diffusion generative-art gui stable

Last synced: 18 Dec 2024

https://github.com/cloneofsimo/lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

diffusion dreambooth fine-tuning lora stable-diffusion

Last synced: 19 Dec 2024

https://github.com/open-mmlab/mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

aigc computer-vision deep-learning diffusion diffusion-models generative-adversarial-network generative-ai image-editing image-generation image-processing image-synthesis inpainting matting pytorch super-resolution text2image video-frame-interpolation video-interpolation video-super-resolution

Last synced: 16 Dec 2024

https://github.com/riffusion/riffusion-hobby

Stable diffusion for real-time music generation

ai audio diffusers diffusion music stable-diffusion

Last synced: 18 Dec 2024

https://github.com/williamyang1991/rerender_a_video

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

controlnet diffusion video-processing

Last synced: 20 Dec 2024

https://github.com/williamyang1991/Rerender_A_Video

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

controlnet diffusion video-processing

Last synced: 07 Nov 2024

https://github.com/ai-forever/kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

diffusion image-generation image2image inpainting ipython-notebook kandinsky outpainting text-to-image text2image

Last synced: 20 Dec 2024

https://github.com/ai-forever/Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

diffusion image-generation image2image inpainting ipython-notebook kandinsky outpainting text-to-image text2image

Last synced: 06 Nov 2024

https://github.com/openvpi/DiffSinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

acoustic-model diffusion diffussion-model melody-frontend midi pitch-prediction rectified-flow singing-voice singing-voice-synthesis svs

Last synced: 03 Nov 2024

https://github.com/openvpi/diffsinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

acoustic-model diffusion diffussion-model melody-frontend midi pitch-prediction rectified-flow singing-voice singing-voice-synthesis svs

Last synced: 30 Sep 2024

https://github.com/playvoice/whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

change diff-svc diffusion diffusion-svc singing-voice-conversion sovits svc vits vits2 voice

Last synced: 20 Dec 2024

https://github.com/riffusion/riffusion-app-hobby

Stable diffusion for real-time music generation (web app)

ai audio diffusion music nextjs stable-diffusion threejs

Last synced: 18 Dec 2024

https://github.com/PlayVoice/whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

change diff-svc diffusion diffusion-svc singing-voice-conversion sovits svc vits vits2 voice

Last synced: 03 Sep 2024

https://github.com/tmelyralab/musev

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

diffusion human-video-generation image2video infinite-length musev video-generation

Last synced: 18 Dec 2024

https://github.com/TMElyralab/MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

diffusion human-video-generation image2video infinite-length musev video-generation

Last synced: 07 Nov 2024

https://github.com/prs-eth/Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

diffusion in-the-wild monocular-depth-estimation zero-shot

Last synced: 31 Oct 2024

https://github.com/prs-eth/marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

diffusion in-the-wild monocular-depth-estimation zero-shot

Last synced: 19 Dec 2024

https://github.com/alpha-vllm/lumina-t2x

Lumina-T2X is a unified framework for Text to Any Modality Generation

aigc diffusion diffusion-model diffusion-models diffusion-transformer generation-models transformer transformers

Last synced: 19 Dec 2024

https://github.com/varunshenoy/opendream

An extensible, easy-to-use, and portable diffusion web UI 👨‍🎨

ai automatic-1111 diffusion image-generation stable-diffusion

Last synced: 14 Dec 2024

https://github.com/maks-s/sd-akashic

A compendium of informations regarding Stable Diffusion (SD)

diffusion guide stable-diffusion

Last synced: 30 Nov 2024

https://github.com/Maks-s/sd-akashic

A compendium of informations regarding Stable Diffusion (SD)

diffusion guide stable-diffusion

Last synced: 29 Oct 2024

https://github.com/tencentarc/brushnet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image

Last synced: 19 Dec 2024

https://github.com/TencentARC/BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image

Last synced: 31 Oct 2024

https://github.com/foundationvision/llamagen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

auto-regressive-model diffusion diffusion-models image-generation llama llm text2image

Last synced: 19 Dec 2024

https://github.com/FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

auto-regressive-model diffusion diffusion-models image-generation llama llm text2image

Last synced: 14 Nov 2024

https://github.com/mini-sora/minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

diffusion sora video-generation

Last synced: 20 Dec 2024

https://github.com/Alpha-VLLM/Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

aigc diffusion diffusion-model diffusion-models diffusion-transformer generation-models transformer transformers

Last synced: 29 Oct 2024

https://github.com/thu-lyj-lab/t3bench

T3Bench: Benchmarking Current Progress in Text-to-3D Generation

3d diffusion nerf text-to-3d

Last synced: 17 Dec 2024

https://github.com/declare-lab/tango

A family of diffusion models for text-to-audio generation.

audio-generation diffusion diffusion-models language-models large-language-models text-to-audio

Last synced: 20 Dec 2024

https://github.com/nvidia/cosmos-tokenizer

A suite of image and video neural tokenizers

diffusion tokenization transformers

Last synced: 20 Dec 2024

https://github.com/EdVince/Stable-Diffusion-NCNN

Stable Diffusion in NCNN with c++, supported txt2img and img2img

android clip cpp diffusion executable img2img mnn ncnn onnx stable-diffusion tensorrt tnn txt2img

Last synced: 07 Nov 2024

https://github.com/cloneofsimo/mindiffusion

Self-contained, minimalistic implementation of diffusion models with Pytorch.

diffusion pytorch

Last synced: 20 Dec 2024

https://github.com/cloneofsimo/minDiffusion

Self-contained, minimalistic implementation of diffusion models with Pytorch.

diffusion pytorch

Last synced: 30 Oct 2024

https://github.com/fboulnois/stable-diffusion-docker

Run the official Stable Diffusion releases in a Docker container with txt2img, img2img, depth2img, pix2pix, upscale4x, and inpaint.

dall-e dalle diffusion docker generative-art huggingface image-generation inpainting midjourney pytorch stable-diffusion tensorflow text-to-image

Last synced: 18 Dec 2024

https://github.com/wladradchenko/wunjo.wladradchenko.ru

Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.

controlnet deepfake deepfake-emotion deepfakes diffusion face-swap face-swapping free image-animation retouching-video segment-anything tacotron2 talking-face talking-face-generation talking-head tts vid2vid voice-cloning voice-recognition wunjo

Last synced: 17 Nov 2024

https://github.com/cloneofsimo/paint-with-words-sd

Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentation map.

diffusion generative-model stable-diffusion

Last synced: 21 Dec 2024

https://github.com/castorini/daam

Diffusion attentive attribution maps for interpreting Stable Diffusion.

diffusion explainable-ai generative-ai huggingface pytorch stable-diffusion

Last synced: 16 Dec 2024

https://github.com/some9000/StylePile

A prompt generation helper script for AUTOMATIC1111/stable-diffusion-webui and compatible forks

diffusion generation generator promt stable

Last synced: 14 Nov 2024

https://github.com/omriav/blended-diffusion

Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]

blended-diffusion deep-learning diffusion multimodal openai openai-clip text-guided-manipulation text-to-image

Last synced: 31 Oct 2024

https://github.com/williamyang1991/fresco

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

controlnet diffusion video-processing

Last synced: 21 Dec 2024

https://github.com/williamyang1991/FRESCO

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

controlnet diffusion video-processing

Last synced: 31 Oct 2024

https://github.com/microsoft/foldingdiff

Diffusion models of protein structure; trigonometry and attention are all you need!

diffusion diffusion-models protein protein-structure-generation proteins transformer

Last synced: 01 Nov 2024

https://github.com/afiaka87/clip-guided-diffusion

A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.

artificial-intelligence deep-learning diffusion image-generation multimodal multimodality openai openai-clip text-to-image text-to-image-synthesis

Last synced: 17 Nov 2024

https://github.com/scenediffuser/Scene-Diffuser

Official implementation of CVPR23 paper "Diffusion-based Generation, Optimization, and Planning in 3D Scenes"

3d-scene-understanding diffusion generative-model

Last synced: 11 Nov 2024

https://github.com/AILab-CVC/FreeNoise

[ICLR 2024] Code for FreeNoise based on VideoCrafter

aigc diffusion generative-model video-diffusion-model

Last synced: 31 Oct 2024

https://github.com/ailab-cvc/freenoise

[ICLR 2024] Code for FreeNoise based on VideoCrafter

aigc diffusion generative-model video-diffusion-model

Last synced: 16 Dec 2024

https://github.com/nianticlabs/diffusionerf

[CVPR 2023] DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models

deep-learning diffusion diffusion-models nerf neuralradiance-fields radiance-field reconstruction regularization

Last synced: 17 Dec 2024

https://github.com/RehgLab/RAVE

RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models - CVPR 2024 - Official Repo

diffusion stable-diffusion video-editing

Last synced: 31 Oct 2024

https://github.com/RQ-Wu/LAMP

Official implement code of LAMP: Learn a Motion Pattern by Few-Shot Tuning a Text-to-Image Diffusion Model (Few-shot-based text-to-video diffusion)

aigc diffusion diffusion-model diffusion-models few-shot-learning stable-diffusion text-to-video video-editing

Last synced: 31 Oct 2024

https://github.com/yeungchenwa/FontDiffuser

[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

deep-learning diffusers diffusion font-generation image-generation

Last synced: 31 Oct 2024

https://github.com/ZichengDuan/TheChosenOne

Unofficial implementation of the paper "The Chosen One: Consistent Characters in Text-to-Image Diffusion Models"

deep-learning diffusion dinov2 generative-art generative-model

Last synced: 30 Oct 2024

https://github.com/jiauzhang/dragdiffusion

Implementation of DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing

diffusion draggan image-editing

Last synced: 18 Dec 2024

https://github.com/keonlee9420/diffsinger

PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)

ddpm diffsinger diffusion diffusion-models english fastspeech neural-tts non-autoregressive pytorch singing-voice speech-synthesis text-to-speech tts

Last synced: 02 Oct 2024

https://github.com/baaivision/diva

Diffusion Feedback Helps CLIP See Better

clip diffusion visual-perception

Last synced: 21 Dec 2024

https://github.com/Sirui-Xu/InterDiff

[ICCV 2023] Official PyTorch implementation of the paper "InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion"

3d-human-pose 6d deep-learning diffusion diffusion-models generative-ai generative-model human-motion-prediction human-object-interaction human-scene-interaction motion-prediction object-pose

Last synced: 10 Nov 2024

https://sirui-xu.github.io/InterDiff/

[ICCV 2023] Official PyTorch implementation of the paper "InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion"

3d-human-pose 6d deep-learning diffusion diffusion-models generative-ai generative-model human-motion-prediction human-object-interaction human-scene-interaction motion-prediction object-pose

Last synced: 10 Nov 2024

https://github.com/ssube/onnx-web

web UI for GPU-accelerated ONNX pipelines like Stable Diffusion, even on Windows and AMD

ai-art amd diffusion esrgan flask generative-art gfpgan image-generation linux nvidia python pytorch reactjs stable-diffusion super-resolution text2image upscaling web-app windows

Last synced: 15 Dec 2024

https://github.com/zhanghm1995/Forge_VFM4AD

A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.

3dgs adaptation autonomous-driving diffusion end-to-end-autonomous-driving foundation-model large-language-models nerf pre-training survey world-models

Last synced: 30 Nov 2024

https://github.com/pansanity666/Awesome-Avatars

List of recent advances for human avatars, including generation, reconstruction, and editing, etc.

3dreconstruction aigc avatar diffusion humannerf image-to-3d motion-generation nerf neural-rendering sdf smpl stable-diffusion t23d text-to-3d tt3d

Last synced: 12 Sep 2024

https://github.com/rainbowluocs/diffusiontrack

[AAAI 2024] DiffusionTrack: Diffusion Model For Multi-Object Tracking. DiffusionTrack is the first work to employ the diffusion model for multi-object tracking by formulating it as a generative noise-to-tracking diffusion process.

diffusion multi-object-tracking object-detection tracking

Last synced: 31 Oct 2024

https://barquerogerman.github.io/FlowMDM/

[CVPR 2024] Official Implementation of "Seamless Human Motion Composition with Blended Positional Encodings".

cvpr cvpr2024 diffusion generative-model human-motion human-motion-composition human-motion-extrapolation motion-generation

Last synced: 03 Dec 2024

https://github.com/L-YeZhu/CDCD

[ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).

contrastive diffusion music-generation text2image

Last synced: 10 Nov 2024

https://github.com/ozanciga/diffusion-for-beginners

denoising diffusion models, as simple as possible

dall-e diffusion imagen midjourney pytorch scheduler stable-diffusion

Last synced: 17 Nov 2024

https://github.com/iamncj/dilightnet

Official Code Release for [SIGGRAPH 2024] DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation

controlnet diffusers diffusion diffusion-models image-generation lighting-control relighting siggraph stable-diffusion

Last synced: 15 Dec 2024

https://github.com/happylittlecat2333/Auffusion

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"

audio-generation diffusion diffusion-models large-language-models text-to-audio

Last synced: 14 Nov 2024

https://github.com/mihirp1998/VADER

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.

alignment diffusion reinforcement-learning reinforcement-learning-human-feedback rl rlhf vader video-diffusion video-diffusion-alignment

Last synced: 31 Oct 2024

https://github.com/ducha-aiki/manifold-diffusion

Diffusion on manifolds for image retrieval

diffusion image-retrieval manifold python reranking

Last synced: 01 Nov 2024

https://github.com/intellabs/mmpano

Official implementation of L-MAGIC

diffusion llms

Last synced: 16 Dec 2024

https://github.com/zibojia/COCOCO

Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.

cococo diffusion inpainting pytorch sam2 segment segment-anything text-guided text-guided-video-inpainting video-inpainting video-sam2-inpaint

Last synced: 30 Nov 2024

https://github.com/juliahealth/komamri.jl

Koma is a Pulseq-compatible framework to efficiently simulate Magnetic Resonance Imaging (MRI) acquisitions. The main focus of this package is to simulate general scenarios that could arise in pulse sequence development.

cardiac diffusion diffusion-mri gpu-acceleration mri simulation

Last synced: 25 Nov 2024