https://github.com/moatifbutt/awesome-diffusion-eccv-2024
List of diffusion papers accepted in ECCV 2024.
https://github.com/moatifbutt/awesome-diffusion-eccv-2024
List: awesome-diffusion-eccv-2024
accepted-papers diffusion diffusion-models eccv eccv-2024 eccv2024 t2i text-to-image
Last synced: 5 months ago
JSON representation
List of diffusion papers accepted in ECCV 2024.
- Host: GitHub
- URL: https://github.com/moatifbutt/awesome-diffusion-eccv-2024
- Owner: moatifbutt
- Created: 2024-10-04T19:36:26.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-10-17T20:48:58.000Z (over 1 year ago)
- Last Synced: 2025-11-05T22:01:59.677Z (8 months ago)
- Topics: accepted-papers, diffusion, diffusion-models, eccv, eccv-2024, eccv2024, t2i, text-to-image
- Homepage:
- Size: 281 KB
- Stars: 15
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Diffusion papers in ECCV 2024
List of papers accepted in ECCV 2024.
#### SMooDi: Stylized Motion Diffusion Model
Lei Zhong, Yiming Xie, Varun Jampani, Deqing Sun, Huaizu Jiang*
 [[arXiv](https://arxiv.org/abs/2407.12783)] [[Project](https://neu-vi.github.io/SMooDi/)] [[Code](https://github.com/neu-vi/SMooDi)] [[Slides](https://eccv.ecva.net/media/eccv-2024/Slides/1010.pdf)]
#### SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion
Vikram Voleti*, Chun-Han Yao, Mark Boss, Adam Letts, David Pankratz, Dmitrii Tochilkin, Christian Laforte, Robin Rombach, Varun Jampani*
 [[arXiv](https://arxiv.org/abs/2403.12008)] [[Project](https://sv3d.github.io/)] [[Model](https://huggingface.co/stabilityai/sv3d)]
#### EMDM: Efficient Motion Diffusion Model for Fast, High-Quality Human Motion Generation
Wenyang Zhou, Zhiyang Dou*, Zeyu Cao, Zhouyingcheng Liao, Jingbo Wang, Wenjia Wang, Yuan Liu, Taku Komura, Wenping Wang, Lingjie Liu
 [[arXiv](https://arxiv.org/abs/2312.02256)] [[Project](https://frank-zy-dou.github.io/projects/EMDM/index.html)] [[Code](https://github.com/Frank-ZY-Dou/EMDM)] [[Demo Video](https://www.youtube.com/watch?v=1SyCXbnol_g&ab_channel=FrankZhiyangDou)]
#### Diffusion Bridges for 3D Point Cloud Denoising
Mathias Vogel Hüni, Keisuke Tateno, Marc Pollefeys, Federico Tombari, Marie-Julie Rakotosaona, Francis Engelmann*
 [[arXiv](https://arxiv.org/abs/2408.16325)] [[Project](https://p2p-bridge.github.io/)] [[Code](https://github.com/matvogel/P2P-Bridge)] [[Poster](https://p2p-bridge.github.io/static/images/poster.png)]
#### VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
Junlin Han*, Filippos Kokkinos, Philip Torr
 [[arXiv](https://arxiv.org/abs/2403.12034)] [[Project](https://junlinhan.github.io/projects/vfusion3d.html)] [[Code](https://github.com/facebookresearch/vfusion3d)] [[Poster](https://junlinhan.github.io/projects/resources/paper16/vfusion3d_poster.pdf)] [[Huggingface Demo](https://huggingface.co/spaces/facebook/VFusion3D)]
#### Beta-Tuned Timestep Diffusion Model
Tianyi Zheng*, Peng-Tao Jiang, Ben Wan, Hao Zhang, Jinwei Chen, Jia Wang*, Bo Li*
 [[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/00328.pdf)]
#### Taming Latent Diffusion Model for Neural Radiance Field Inpainting
Chieh Hubert Lin*, Changil Kim, Jia-Bin Huang, Qinbo Li, Chih-Yao Ma, Johannes Kopf, Ming-Hsuan Yang, Hung-Yu Tseng
 [[arXiv](https://arxiv.org/abs/2404.09995)] [[Project](https://hubert0527.github.io/MALD-NeRF/)]
#### FreeInit: Bridging Initialization Gap in Video Diffusion Models
Tianxing Wu*, Chenyang Si, Yuming Jiang, Ziqi Huang, Ziwei Liu
 [[arXiv](https://arxiv.org/abs/2312.07537)] [[Project](https://tianxingwu.github.io/pages/FreeInit/)] [[Code](https://github.com/TianxingWu/FreeInit)] [[Huggingface Demo](https://huggingface.co/spaces/TianxingWu/FreeInit)] [[Video](https://youtu.be/lS5IYbAqriI)]
#### LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation
Yushi Lan, Fangzhou Hong, Shuai Yang, Shangchen Zhou, Xuyi Meng, Bo Dai, Xingang Pan, Chen Change Loy*
 [[arXiv](https://arxiv.org/abs/2403.12019)] [[Project](https://nirvanalan.github.io/projects/ln3diff/)] [[Code](https://github.com/NIRVANALAN/LN3Diff)] [[Gradio Demo](https://huggingface.co/spaces/yslan/LN3Diff_I23D)]
#### UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation
Zexiang Liu, Yangguang Li, Youtian Lin, Xin Yu, Sida Peng, Yan-Pei Cao, Xiaojuan Qi, Xiaoshui Huang, Ding Liang*, Wanli Ouyang
 [[arXiv](https://arxiv.org/abs/2312.08754)] [[Project](https://yg256li.github.io/UniDream/)] [[Code](https://github.com/YG256Li/UniDream)]
#### FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models
Wei WU*, Qingnan Fan, Shuai Qin, Hong Gu, Ruoyu Zhao, Antoni Chan*
 [[arXiv](https://arxiv.org/abs/2404.11895)] [[Code](https://github.com/Thermal-Dynamics/FreeDiff)]
#### Synchronous Diffusion for Unsupervised Smooth Non-Rigid 3D Shape Matching
Dongliang Cao*, Zorah Laehner, Florian Bernard
 [[arXiv](https://arxiv.org/abs/2407.08244)]
#### Diffusion Models for Open-Vocabulary Segmentation
Laurynas Karazija*, Iro Laina, Andrea Vedaldi, Christian Rupprecht
 [[arXiv](https://arxiv.org/abs/2306.09316)] [[Project](https://www.robots.ox.ac.uk/~vgg/research/ovdiff/)] [[Video](https://youtu.be/OSDtkp7Ta-8)]
#### AccDiffusion: An Accurate Method for Higher-Resolution Image Generation
Zhihang Lin, Mingbao Lin, Meng Zhao, Rongrong Ji*
 [[arXiv](https://arxiv.org/abs/2407.10738)] [[Project](https://lzhxmu.github.io/accdiffusion/accdiffusion.html)] [[Code](https://github.com/lzhxmu/AccDiffusion)]
#### Learning Differentially Private Diffusion Models via Stochastic Adversarial Distillation
Bochao Liu, Pengju Wang, Shiming Ge*
 [[arXiv](https://arxiv.org/abs/2408.14738)]
#### Prompting Future Driven Diffusion Model for Hand Motion Prediction
Bowen Tang*, Kaihao Zhang*, Wenhan Luo*, Wei Liu, HONGDONG LI
 [[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/01102.pdf)] [[Poster](https://eccv.ecva.net/media/PosterPDFs/ECCV%202024/653.png?t=1725929440.8208065)]
#### ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement
Muhammad Atif Butt*, Kai Wang, Javier Vazquez-Corral, Joost van de Weijer
 [[arXiv](https://arxiv.org/abs/2407.07197)] [[Project](https://moatifbutt.github.io/colorpeel/)] [[Code](https://github.com/moatifbutt/color-peel)] [[Poster](https://github.com/moatifbutt/color-peel/blob/main/assets/ECCV2024_ColorPeel_.pdf)]
#### DiffiT: Diffusion Vision Transformers for Image Generation
Ali Hatamizadeh*, Jiaming Song, Guilin Liu, Jan Kautz, Arash Vahdat
 [[arXiv](https://arxiv.org/abs/2312.02139)] [[Code](https://github.com/NVlabs/DiffiT)]
#### MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration
Yulin Ren, Xin Li*, Bingchen Li, Xingrui Wang, Mengxi China Guo, Shijie Zhao, Li Zhang, Zhibo Chen*
 [[arXiv](https://arxiv.org/abs/2407.10833)] [[Project](https://renyulin-f.github.io/MoE-DiffIR.github.io/)] [[Code](https://github.com/renyulin-f/MoE-DiffIR)] [[Data](https://drive.google.com/drive/folders/1Kn8SjJWpHITHlg5kuL1Ur7Ml-WNJJ064)]
#### MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection
Youngmin Oh, Hyung-Il Kim, Seong Tae Kim*, Jung Uk Kim*
 [[arXiv](https://arxiv.org/abs/2407.16448)] [[Code](https://github.com/VisualAIKHU/MonoWAD)] [[Data](https://drive.google.com/file/d/1iOpoZ-QbJdU2ytRmd9wPxH0RNjZ6KNdQ/view)]
#### Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization
Tao Yang*, Rongyuan Wu, Peiran Ren, Xuansong Xie, Lei Zhang
 [[arXiv](https://arxiv.org/abs/2308.14469)] [[Code](https://github.com/yangxy/PASD/)] [[Data](https://huggingface.co/datasets/yangtao9009/PASD_dataset)] [[Demo](https://colab.research.google.com/drive/1lZ_-rSGcmreLCiRniVT973x6JLjFiC-b?usp=sharing)]
#### XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution
Qu Yunpeng*, Kun Yuan, Kai Zhao, Qizhi Xie, Jinhua Hao, Ming Sun, Chao Zhou
 [[arXiv](https://arxiv.org/abs/2403.05049)] [[Code](https://github.com/qyp2000/XPSR)]
#### DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation
Yiqun Duan*, Xianda Guo*, Zheng Zhu
 [[arXiv](https://arxiv.org/abs/2303.05021)] [[Code](https://github.com/duanyiqun/DiffusionDepth)]
#### DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation
Wenliang Zhao, Haolin Wang, Jie Zhou, Jiwen Lu*
 [[arXiv](https://arxiv.org/abs/2409.03755)] [[Code](https://github.com/wl-zhao/DC-Solver)]
#### Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models
Claudio Rota*, Marco Buzzelli, Joost van de Weijer
 [[arXiv](https://arxiv.org/abs/2311.15908)] [[Code](https://github.com/claudiom4sir/StableVSR)]
#### DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors
Zizheng Yan*, Jiapeng Zhou, Fanpeng Meng, Yushuang Wu, Lingteng Qiu, Zisheng Ye, Shuguang Cui, Guanying CHEN, Xiaoguang Han*
 [[arXiv](https://arxiv.org/abs/2407.16260)] [[Project](https://chester256.github.io/dreamdissector/)] [[Video](https://youtu.be/qHiEoio7SJ0)]
#### Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion
Xiang Fan*, Anand Bhattad, Ranjay Krishna
 [[arXiv](https://arxiv.org/abs/2403.14617)] [[Project](https://videoshop-editing.github.io/)] [[Code](https://github.com/sfanxiang/videoshop)] [[Supplementary](https://videoshop-editing.github.io/static/supplementary/)] [[Video](https://videoshop-editing.github.io/static/supplementary/assets/intro.mp4)]
#### Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation
Junsung Lee, Minsoo Kang, Bohyung Han*
 [[arXiv](https://arxiv.org/abs/2409.08077)] [[Code](https://github.com/JS-Lee525/PIC)]
#### RadEdit: stress-testing biomedical vision models via diffusion image editing
Fernando Pérez-García, Sam Bond-Taylor, Pedro Sanchez, Boris van Breugel, Daniel Coelho de Castro, Harshita Sharma, Valentina Salvatelli, Maria Teodora A Wetscherek, Hannah CM Richardson, Lungren Matthew, Aditya Nori, Javier Alvarez-Valle, Ozan Oktay, Maximilian Ilse*
 [[arXiv](https://arxiv.org/abs/2312.12865)] [[Project](https://huggingface.co/microsoft/radedit)]
#### AdaDiffSR: Adaptive Region-aware Dynamic acceleration Diffusion Model for Real-World Image Super-Resolution
Yuanting Fan, Chengxu Liu, Nengzhong Yin, Changlong Gao, Xueming Qian*
 [[paper](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/01944.pdf)] [[Video](https://www.youtube.com/watch?v=UcmJI3Cd9UM)]
#### Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Xuelu Feng, Dongdong Chen, Junsong Yuan, Chunming Qiao, Gang Hua, Zixin Zhu*
 [[arXiv](https://arxiv.org/abs/2403.12042)] [[Code](https://github.com/buxiangzhiren/VD-IT)] [[Video](https://youtu.be/da-Fs5-ZyLc)]
#### Co-synthesis of Histopathology Nuclei Image-Label Pairs using a Context-Conditioned Joint Diffusion Model
Seonghui Min, Hyun-Jic Oh, Won-Ki Jeong*
 [[arXiv](https://arxiv.org/abs/2407.14434)]
#### MVDD: Multi-View Depth Diffusion Models
Zhen Wang*, Qiangeng Xu, Feitong Tan, Menglei Chai, Shichen Liu, Rohit Pandey, Sean Fanello, Achuta Kadambi, Yinda Zhang
 [[arXiv](https://arxiv.org/abs/2312.04875)] [[Project](https://mvdepth.github.io/)]
#### EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models
Eungbean Lee, Somi Jeong, Kwanghoon Sohn*
 [[arXiv](https://arxiv.org/abs/2205.07680)]
#### DreamDrone: Text-to-Image Diffusion Models are Zero-shot Perpetual View Generators
Hanyang Kong*, Dongze Lian, Michael Bi Mi, Xinchao Wang*
 [[arXiv](https://arxiv.org/html/2312.08746v3)] [[Project](https://hyokong.github.io/dreamdrone-page/)] [[Code](https://github.com/HyoKong/DreamDrone)] [[Demo](https://huggingface.co/spaces/imsuperkong/dreamdrone)]
#### Harnessing Text-to-Image Diffusion Models for Category-Agnostic Pose Estimation
Duo Peng, Zhengbo Zhang, Ping Hu, Qiuhong Ke, David Yau, Jun Liu*
 [[paper](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/02103.pdf)]
#### M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models
Seunggeun Chi*, Hyung-gun Chi, Hengbo Ma, Nakul Agarwal, Faizan Siddiqui, Karthik Ramani*, Kwonjoon Lee*
 [[arXiv](https://arxiv.org/abs/2407.14502)] [[Video](https://www.youtube.com/watch?v=DERy31VEK2g)]
#### Shapefusion: 3D localized human diffusion models
Rolandos Alexandros Potamias*, Michael Tarasiou, Stylianos Ploumpis, Stefanos Zafeiriou
 [arXiv](https://arxiv.org/abs/2403.19773) [[Project](https://rolpotamias.github.io/Shapefusion/)]
#### Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing
Wonjun Kang, Kevin Galim, Hyung Il Koo*
 [arXiv](https://arxiv.org/abs/2403.09468) [[Code](https://github.com/furiosa-ai/eta-inversion)] [[Video](https://www.youtube.com/watch?v=NwqK9p4GKlo)]
#### MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization
Tianchen Zhao*, Xuefei Ning, Tongcheng Fang, Enshu Liu, Guyue Huang, Zinan Lin, Shengen Yan, Guohao Dai, Yu Wang
 [arXiv](https://arxiv.org/abs/2405.17873) [[Project](https://a-suozhang.xyz/mixdq.github.io/)] [[Code](https://github.com/A-suozhang/MixDQ)] [[Huggingface](https://huggingface.co/nics-efc/MixDQ)]
#### RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
Bowen Zhang, Yiji Cheng, Chunyu Wang*, Ting Zhang, Jiaolong Yang, Yansong Tang, Feng Zhao, Dong Chen, Baining Guo
 [arXiv](https://www.arxiv.org/abs/2407.06938) [[Project](https://rodinhd.github.io/)] [[Code](https://github.com/RodinHD/RodinHD)]
#### A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting
Wouter Van Gansbeke*, Bert De Brabandere
 [arXiv](https://arxiv.org/abs/2401.10227) [[Code](https://github.com/segments-ai/latent-diffusion-segmentation)]
#### Lego: Learning to Disentangle and Invert Personalized Concepts Beyond Object Appearance in Text-to-Image Diffusion Models
Saman Motamed*, Danda Pani Paudel, Luc Van Gool
 [arXiv](https://arxiv.org/abs/2311.13833) [[Project](https://sam-motamed.github.io/projects/lego)]
#### IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Yuanhao Zhai*, Kevin Lin, Linjie Li, Chung-Ching Lin, Jianfeng Wang, Zhengyuan Yang, David Doermann, Junsong Yuan, Zicheng Liu, Lijuan Wang
 [arXiv](https://arxiv.org/abs/2407.10937) [[Code](https://github.com/yhZhai/idol)]
#### DCDM: Diffusion-Conditioned-Diffusion Model for Scene Text Image Super-Resolution
Shrey Singh*, Prateek Keserwani, Masakazu Iwamura*, Partha Pratim Roy
 [paper]([DCDM: Diffusion-Conditioned-Diffusion Model for Scene Text Image Super-Resolution](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/02357.pdf)) [[Code](https://github.com/shreygithub/DCDM)]
#### DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion
Liao Shen, Tianqi Liu, Huiqiang Sun, Xinyi Ye, Baopu Li, Jianming Zhang, Zhiguo Cao*
 [arXiv](https://www.arxiv.org/abs/2409.09605) [Code](https://github.com/leoShen917/DreamMover)
#### Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Yifan Pu*, Zhuofan Xia, Jiayi Guo, Dongchen Han, Qixiu Li, Duo Li, Yuhui Yuan, Ji Li, Yizeng Han, Shiji Song, Gao Huang*, Xiu Li*
 [arXiv](https://arxiv.org/abs/2408.05710) [Code](https://github.com/LeapLabTHU/Attention-Mediators)
#### Diffusion Model is a Good Pose Estimator from 3D RF-Vision
Junqiao Fan, Jianfei Yang*, Yuecong Xu, Lihua Xie
 [[arXiv](https://arxiv.org/abs/2403.16198)]
#### MVDiffHD: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction
Shitao Tang*, Jiacheng Chen, Dilin Wang, Chengzhou Tang, Fuyang Zhang, Yuchen Fan, Vikas Chandra, Yasutaka Furukawa, Rakesh Ranjan
 [[arXiv](https://arxiv.org/abs/2402.12712)] [[Project](https://mvdiffusion-plusplus.github.io/)] [[Code](https://github.com/Tangshitao/MVDiffusion_plusplus)]
#### Diffusion-Generated Pseudo-Observations for High-Quality Sparse-View Reconstruction
Xinhang Liu*, Jiaben Chen, Shiu-Hong Kao, Yu-Wing Tai, Chi-Keung Tang
 [[arXiv](https://arxiv.org/abs/2305.15171)] [[Project](https://xinhangliu.com/deceptive-nerf-3dgs)]
#### Memory-Efficient Fine-Tuning for Quantized Diffusion Model
Hyogon Ryu, Seohyun Lim, Hyunjung Shim*
 [arXiv](https://arxiv.org/abs/2401.04339) [[Code](https://github.com/ugonfor/TuneQDM)] [[Poster](https://eccv.ecva.net/media/PosterPDFs/ECCV%202024/1781.png?t=1727550795.317845)]
#### COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation
Jiefeng Li*, Ye Yuan, Davis Rempe, Haotian Zhang, Pavlo Molchanov, Cewu Lu, Jan Kautz, Umar Iqbal*
 [arXiv](https://arxiv.org/abs/2408.16426)
#### FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior
Zhekai Chen, Wen Wang, Zhen Yang, Zeqing Yuan, Hao Chen*, Chunhua Shen*
 [arXiv](https://arxiv.org/abs/2407.04947) [[Code](https://github.com/aim-uofa/FreeCompose)] [[Poster](https://eccv.ecva.net/media/PosterPDFs/ECCV%202024/297.png?t=1725802844.8353653)] [[Slides](https://eccv.ecva.net/media/eccv-2024/Slides/297.pdf)]
#### WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models
Zijian He, Peixin Chen, Guangrun Wang, Guanbin Li*, Philip Torr, Liang Lin
 [arXiv](https://arxiv.org/abs/2407.10625) [[Project](https://wildvidfit-project.github.io/)] [[Poster](https://eccv.ecva.net/media/PosterPDFs/ECCV%202024/2494.png?t=1726192744.396735)]
#### RegionDrag: Fast Region-Based Image Editing with Diffusion Models
Jingyi Lu, Xinghui Li, Kai Han*
 [arXiv](https://arxiv.org/abs/2407.18247) [[Project](https://visual-ai.github.io/regiondrag/)] [[Demo](https://colab.research.google.com/drive/1pnq9t_1zZ8yL_Oba20eBLVZLp3glniBR?usp=sharing)] [[Code](https://github.com/Visual-AI/RegionDrag)] [[Slides](https://eccv.ecva.net/media/eccv-2024/Slides/1756_57T8SZT.pdf)] [[Poster](https://eccv.ecva.net/media/PosterPDFs/ECCV%202024/1756.png?t=1726153953.1186402)] [[Dataset](https://visual-ai.github.io/regiondrag/#dataset)]
#### MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing
Haoyu Zhao, Tianyi Lu, Jiaxi Gu, Xing Zhang, Qingping Zheng, Zuxuan Wu*, Hang Xu, Yu-Gang Jiang
 [[arXiv](https://arxiv.org/abs/2311.17338)] [[Poster](https://eccv.ecva.net/media/PosterPDFs/ECCV%202024/665.png?t=1726062032.621387)]
#### Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion
Jian Ma, Wenguan Wang*, Yi Yang, Feng Zheng
 [[arXiv](https://www.arxiv.org/abs/2407.10373)] [[Project](https://hechang25.github.io/MVSD/)] [[Poster](https://eccv.ecva.net/media/PosterPDFs/ECCV%202024/1096.png?t=1726061514.9066596)] [[Code](https://github.com/hechang25/MVSD)]
#### SEDiff: Structure Extraction for Domain Adaptive Depth Estimation via Denoising Diffusion Models
Dongseok Shim*, Hyoun Jin Kim*
 [arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/02829.pdf) [[Poster](https://eccv.ecva.net/media/PosterPDFs/ECCV%202024/973.png?t=1726087766.2618341)]
#### MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model
Muyao Niu, Xiaodong Cun*, Xintao Wang, Yong Zhang, Ying Shan, Yinqiang Zheng*
 [arXiv](https://arxiv.org/abs/2405.20222) [[Project]()] [[Code]()]
#### RoofDiffusion: Constructing Roofs from Severely Corrupted Point Data via Diffusion
Kyle Shih-Huang Lo*, Jorg Peters, Eric Spellman
 [arXiv](https://arxiv.org/abs/2404.09290) [[Project]()] [[Code]()]
#### L-DiffER: Single Image Reflection Removal with Language-based Diffusion Model
Yuchen Hong*, Haofeng Zhong*, Shuchen Weng, Jinxiu S Liang, Boxin Shi
 [arXiv](https://assets.ctfassets.net/yreyglvi5sud/4uhN2PF7UyMGgiWQgCMSgi/41f4f9f46fbfa370b3ccd8fbcadbc2b3/2024______Hong_ECCV.pdf) [[Project]()] [[Code]()]
#### BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion
Xuan Ju*, Xian Liu, Xintao Wang*, Yuxuan Bian, Ying Shan, Qiang Xu*
 [arXiv](https://arxiv.org/abs/2403.06976) [[Project]()] [[Code]()]
#### Realistic Human Motion Generation with Cross-Diffusion Models
Zeping Ren, Shaoli Huang*, Xiu Li*
 [arXiv](https://arxiv.org/abs/2312.10993) [[Project]()] [Code]()
#### ZigMa: A DiT-style Zigzag Mamba Diffusion Model
Vincent Tao Hu*, Stefan A Baumann, Ming Gui, Olga Grebenkova, Pingchuan Ma, Johannes S Fischer, Bjorn Ommer
 [arXiv](https://arxiv.org/abs/2403.13802) [[Project]()] [Code]()
#### EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion
Guangyao Zhai*, Evin Pınar Örnek, Dave Zhenyu Chen, Ruotong Liao, Yan Di, Nassir Navab, Federico Tombari, Benjamin Busam
 [arXiv](https://arxiv.org/abs/2405.00915) [[Project]()] [Code]()
#### Safe-Sim: Safety-Critical Closed-Loop Traffic Simulation with Diffusion-Controllable Adversaries
Wei-Jer Chang*, Francesco Pittaluga, Masayoshi Tomizuka, Wei Zhan, Manmohan Chandraker
 [arXiv](https://arxiv.org/abs/2401.00391) [[Project]()] [Code]()
#### Implicit Concept Removal of Diffusion Models
Zhili Liu*, Kai Chen, Yifan Zhang, Jianhua Han, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung, James Kwok
 [arXiv](https://arxiv.org/abs/2310.05873) [[Project]()] [Code]()
#### GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image
Xiao Fu*, Wei Yin, Mu Hu, Kaixuan Wang, Yuexin Ma, Ping Tan, Shaojie Shen, Dahua Lin, Xiaoxiao Long
 [arXiv](https://arxiv.org/abs/2403.12013) [[Project]()] [Code]()
#### Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions
Fabio Tosi, Pierluigi Zama Ramirez, Matteo Poggi*
 [arXiv](https://arxiv.org/abs/2407.16698) [[Project]()] [Code]()
#### Lazy Diffusion Transformer for Interactive Image Editing
Yotam Nitzan*, Zongze Wu, Richard Zhang, Eli Shechtman, Danny Cohen-Or, Taesung Park, Michaël Gharbi
 [arXiv](https://arxiv.org/abs/2404.12382) [[Project]()] [Code]()
#### ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance
Yongwei Chen, Tengfei Wang, Tong Wu, Xingang Pan, Kui Jia*, Ziwei Liu
 [arXiv](https://arxiv.org/abs/2403.12409) [[Project]()] [Code]()
#### 4Diff: 3D-Aware Diffusion Model for Third-to-First Viewpoint Translation
Feng Cheng*, Mi Luo*, Huiyu Wang, Alex Dimakis, Lorenzo Torresani, Gedas Bertasius, Kristen Grauman
 [arXiv](https://eccv.ecva.net/virtual/2024/poster/1665) [[Project]()] [Code]()
#### Enhancing Diffusion Models with Text-Encoder Reinforcement Learning
Chaofeng Chen*, Annan Wang, Haoning Wu, Liang Liao, Wenxiu Sun, Qiong Yan, Weisi Lin*
 [arXiv](https://arxiv.org/abs/2311.15657) [[Project]()] [Code]()
#### Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation
Peng Jin*, Hao Li, Zesen Cheng, Kehan Li, Runyi Yu, Chang Liu*, Xiangyang Ji, Li Yuan*, Jie Chen
 [arXiv](https://arxiv.org/abs/2407.10528) [[Project]()] [Code]()
#### MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion
Lehong Wu*, Lilang Lin, Jiahang Zhang, Yiyang Ma, Jiaying Liu*
 [arXiv](https://www.arxiv.org/abs/2409.10473) [[Project]()] [Code]()
#### Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
Ruibin Li*, Ruihuang Li, Song Guo, Lei Zhang
 [arXiv](https://arxiv.org/abs/2403.11105) [[Project]()] [Code]()
#### StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models
Wen Li*, Muyuan Fang, Cheng Zou, Biao Gong, Ruobing Zheng, Meng Wang, Jingdong Chen, Ming Yang
 [arXiv](https://arxiv.org/abs/2409.02543) [[Project]()] [Code]()
#### NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model
Zhongqun Zhang*, Hengfei Wang, Ziwei Yu, Yihua Cheng*, Angela Yao, Hyung Jin Chang
 [arXiv](https://arxiv.org/abs/2407.12727) [[Project]()] [Code]()
#### Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers
Zhengbo Zhang*, Li Xu, Duo Peng, Hossein Rahmani, Jun Liu*
 [arXiv](https://arxiv.org/abs/2407.08394) [[Project]()] [Code]()
#### Transferable 3D Adversarial Shape Completion using Diffusion Models
Xuelong Dai*, Bin Xiao
 [arXiv](http://arxiv.org/abs/2407.10077) [[Project]()] [Code]()
#### Distilling Diffusion Models into Conditional GANs
MinGuk Kang*, Richard Zhang, Connelly Barnes, Sylvain Paris, Suha Kwak, Jaesik Park, Eli Shechtman, Jun-Yan Zhu, Taesung Park*
 [arXiv](https://arxiv.org/abs/2405.05967) [[Project]()] [Code]()
#### You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation
Mehdi Noroozi*, Isma Hadji*, Brais Martinez*, Adrian Bulat*, Georgios Tzimiropoulos*
 [arXiv](https://arxiv.org/abs/2401.17258) [[Project]()] [Code]()
#### Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation
Yixiao Wang*, Chen Tang, Lingfeng Sun, Simone Rossi, Yichen Xie, Chensheng Peng, Thomas Hannagan, Stefano Sabatini, Nicola Poerio, Masayoshi TOMIZUKA, Wei Zhan
 [arXiv](https://arxiv.org/abs/2408.00766) [[Project]()] [Code]()
#### Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models
Xiaoyu Zhu*, Hao Zhou, Pengfei Xing, Long Zhao, Hao Xu, Junwei Liang, Alexander G. Hauptmann, Ting Liu, Andrew Gallagher
 [arXiv](https://arxiv.org/abs/2407.13642) [[Project]()] [Code]()
#### D-SCo: Dual-Stream Conditional Diffusion for Monocular Hand-Held Object Reconstruction
Bowen Fu*, Gu Wang*, Chenyangguang Zhang, Yan Di, Ziqin Huang, Zhiying Leng, Fabian Manhardt, Xiangyang Ji*, Federico Tombari*
 [arXiv](https://arxiv.org/abs/2311.14189) [[Project]()] [Code]()
#### Probabilistic Weather Forecasting with Deterministic Guidance-based Diffusion Model
Donggeun Yoon, Minseok Seo, Doyi Kim, Yeji Choi, Donghyeon Cho*
 [arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/04326.pdf) [[Project]()] [Code]()
#### Diffusion-Driven Data Replay: A Novel Approach to Combat Forgetting in Federated Class Continual Learning
Jinglin Liang, Jin Zhong, Hanlin Gu, Zhongqi Lu, Xingxing Tang, Gang Dai, Shuangping Huang*, Lixin Fan, Qiang Yang
 [arXiv](https://arxiv.org/abs/2409.01128) [Code]()
#### View Selection for 3D Captioning via Diffusion Ranking
Tiange Luo*, Justin Johnson, Honglak Lee
 [arXiv](https://arxiv.org/abs/2404.07984) [Code]()
#### OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model
Runyi Li*, Xuhan Sheng, Weiqi Li, Jian Zhang*
 [arXiv](https://arxiv.org/abs/2404.10312) [Code]()
#### UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models
Yiming Zhao*, Zhouhui Lian*
 [arXiv](https://arxiv.org/abs/2312.04884) [Code]()
#### OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models
Zhe Kong*, Yong Zhang*, Tianyu Yang, Tao Wang, Kaihao Zhang, Bizhu Wu, Guanying Chen, Wei Liu, Wenhan Luo*
 [arXiv](https://arxiv.org/abs/2403.10983) [Code]()
#### CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation
Hajin Shim, Changhun Kim, Eunho Yang*
 [arXiv](https://arxiv.org/abs/2407.16193) [Code]()
#### DreamDiffusion: High-Quality EEG-to-Image Generation with Temporal Masked Signal Modeling and CLIP Alignment
Yunpeng Bai*, Xintao Wang, Yan-Pei Cao, Yixiao Ge, Chun Yuan, Ying Shan
 [arXiv](https://arxiv.org/abs/2306.16934) [Code]()
#### SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis
Huan-ang Gao, Mingju Gao, Jiaju Li, Wenyi Li, Rong Zhi, Hao Tang, Hao Zhao*
[arXiv](https://arxiv.org/abs/2403.09638) [Code]()
#### PixArt-Sigma: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Junsong Chen, Chongjian GE, Enze Xie*, Yue Wu, Lewei Yao, Xiaozhe Ren, Zhongdao Wang, Ping Luo, Huchuan Lu, Zhenguo Li
[arXiv](https://arxiv.org/abs/2403.04692) [Code]()
#### Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models
Yixuan Ren*, Yang Zhou, Jimei Yang, Jing Shi, Difan Liu, Feng Liu, Mingi Kwon, Abhinav Shrivastava
[arXiv](https://arxiv.org/abs/2402.14780) [Code]()
#### ∞-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions
Minh-Quan Le*, Alexandros Graikos, Srikar Yellapragada, Rajarsi Gupta, Joel Saltz, Dimitris Samaras
[arXiv](https://arxiv.org/abs/2407.14709) [Code]()
#### ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation
Yi Zhang, Yun Tang, Wenjie Ruan, Xiaowei Huang, Siddartha Khastgir, Paul A Jennings, Xingyu Zhao*
[arXiv](https://arxiv.org/abs/2402.15429) [Code]()
#### Latent Diffusion Prior Enhanced Deep Unfolding for Snapshot Spectral Compressive Imaging
Zongliang Wu*, Ruiying Lu, Ying Fu, Xin Yuan
[arXiv](https://arxiv.org/abs/2311.14280) [Code]()
#### Learning Diffusion Models for Multi-View Anomaly Detection
Chieh Liu*, Yu-Min Chu*, Ting-I Hsieh*, Hwann-Tzong Chen*, Tyng-Luh Liu*
[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/04907.pdf) [Code]()
#### Diff3DETR: Agent-based Diffusion Model for Semi-supervised 3D Object Detection
Jiacheng Deng*, Jiahao Lu, Tianzhu Zhang
[arXiv](https://arxiv.org/abs/2408.00286) [Code]()
#### Layout-Corrector: Alleviating Layout Sticking Phenomenon in Discrete Diffusion Model
Shoma Iwai*, Atsuki Osanai, Shunsuke Kitada, Shinichiro Omachi
[arXiv](https://arxiv.org/abs/2409.16689) [Code]()
#### Kinetic Typography Diffusion Model
Seonmi Park, Inhwan Bae, Seunghyun Shin, Hae-Gon Jeon*
[arXiv](https://arxiv.org/abs/2407.10476) [Code]()
#### GroupDiff: Diffusion-based Group Portrait Editing
Yuming Jiang, Nanxuan Zhao*, Qing Liu, Krishna Kumar Singh, Shuai Yang, Chen Change Loy, Ziwei Liu
[arXiv](https://arxiv.org/abs/2409.14379) [Code]()
#### TransFusion -- A Transparency-Based Diffusion Model for Anomaly Detection
Matic Fučka*, Vitjan Zavrtanik, Danijel Skočaj
[arXiv](https://arxiv.org/abs/2311.09999) [Code]()
#### Test-Time Stain Adaptation with Diffusion Models for Histopathology Image Classification
Cheng-Chang Tsai*, Yuan-Chih Chen, Chun-Shien Lu*
[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/05175.pdf) [Code]()
#### Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
Lanqing Guo, Yingqing HE, Haoxin Chen, Menghan Xia, Xiaodong Cun, Yufei Wang, Siyu Huang, Yong Zhang, Xintao Wang, Qifeng Chen, Ying Shan, Bihan Wen*
[arXiv](https://arxiv.org/abs/2402.10491) [Code]()
#### R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection
Zheyuan Zhou, Le Wang, Naiyu Fang, Zili Wang, Lemiao Qiu*, Shuyou Zhang
[arXiv](https://arxiv.org/abs/2407.10862) [Code]()
#### Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
Qinyu Yang, Haoxin Chen, Yong Zhang*, Menghan Xia, Xiaodong Cun, Zhixun Su*, Ying Shan
[arXiv](https://www.arxiv.org/abs/2407.10285) [Code]()
#### Revisiting Feature Disentanglement Strategy in Diffusion Training and Breaking Conditional Independence Assumption in Sampling
Wonwoong Cho*, Hareesh Ravi*, Midhun Harikumar, Vinh Khuc, Krishna Kumar Singh, Jingwan Lu, David Iseri Inouye*, Ajinkya Kale*
[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/05452.pdf) [Code]()
#### MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models
Nithin Gopalakrishnan Nair*, Jeya Maria Jose Valanarasu, Vishal Patel
[arXiv](https://arxiv.org/abs/2404.09977) [Code]()
#### DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control
Yuru Jia, Lukas Hoyer, Shengyu Huang, Tianfu Wang, Luc Van Gool, Konrad Schindler, Anton Obukhov*
[arXiv](https://arxiv.org/abs/2312.03048) [Code]()
#### Surf-D: Generating High-Quality Surfaces of Arbitrary Topologies Using Diffusion Models
Zhengming Yu*, Zhiyang Dou, Xiaoxiao Long, Cheng Lin, Zekun Li, Yuan Liu, Norman Müller, Taku Komura, Marc Habermann, Christian Theobalt, Xin Li, Wenping Wang*
[arXiv](https://arxiv.org/abs/2311.17050) [Code]()
#### Diffusion-Refined VQA Annotations for Semi-Supervised Gaze Following
Qiaomu Miao*, Alexandros Graikos, Jingwei Zhang, Sounak Mondal, Minh Hoai, Dimitris Samaras
[arXiv](https://arxiv.org/abs/2406.02774) [Code]()
#### Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models
Rohit Gandikota*, Joanna Materzynska, Tingrui Zhou, Antonio Torralba, David Bau
[arXiv](https://arxiv.org/abs/2311.12092) [Code]()
#### AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion
Yitong Jiang*, Zhaoyang Zhang, Tianfan Xue, Jinwei Gu*
[arXiv](https://arxiv.org/abs/2310.10123) [Code]()
#### Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers
Chi-Pin Huang*, Kai-Po Chang, Chung-Ting Tsai, Yung-Hsuan Lai, Fu-En Yang, Yu-Chiang Frank Wang
[arXiv](https://arxiv.org/abs/2311.17717) [Code]()
#### Temporal Residual Guided Diffusion Framework for Event-Driven Video Reconstruction
Lin Zhu*, Yunlong Zheng, Yijun Zhang, Xiao Wang, Lizhi Wang, Hua Huang
[arXiv](https://arxiv.org/abs/2407.10636) [Code]()
#### Free-ATM: Harnessing Free Attention Masks for Representation Learning on Diffusion-Generated Images
David Junhao Zhang*, Mutian Xu, Jay Zhangjie Wu, Chuhui Xue, Wenqing Zhang, Xiaoguang Han, Song Bai, Mike Zheng Shou*
[arXiv](https://arxiv.org/abs/2308.06739) [Code]()
#### AlignDiff: Aligning Diffusion Models for General Few-Shot Segmentation
Ri-Zhao Qiu*, Yu-Xiong Wang, Kris Hauser
[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/05794.pdf) [Code]()
#### Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors
Jae Joong Lee, Bosheng Li, Sara M Beery, Jonathan Huang, Songlin Fei, Raymond A. Yeh, Bedrich Benes*
[arXiv](https://www.arxiv.org/abs/2407.10330) [Code]()
#### DomainFusion: Generalizing To Unseen Domains with Latent Diffusion Models
Yuyang Huang, Yabo Chen, Yuchen Liu, xiaopeng zhang*, Wenrui Dai*, Hongkai Xiong, Qi Tian
[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/05806.pdf) [Code]()
#### Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models
Yasi Zhang*, Peiyu Yu, Ying Nian Wu
[arXiv](https://arxiv.org/abs/2404.07389) [Code]()
#### Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediciton Tasks
Manyuan Zhang*, Guanglu Song, Xiaoyu Shi, Yu Liu, Hongsheng Li
[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/05837.pdf) [Code]()
#### SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
Yuwei Guo, Ceyuan Yang*, Anyi Rao, Maneesh Agrawala, Dahua Lin*, Bo Dai*
[arXiv](https://arxiv.org/abs/2311.16933) [Code]()
#### Diffusion Reward: Learning Rewards via Conditional Video Diffusion
Tao Huang*, Guangqi Jiang, Yanjie Ze, Huazhe Xu*
[arXiv](https://arxiv.org/abs/2312.14134) [Code]()
#### SpeedUpNet: A Plug-and-Play Adapter Network for Accelerating Text-to-Image Diffusion Models
Weilong Chai*, Dandan Zheng, Jiajiong Cao, Zhiquan Chen, Changbao Wang, Chenguang Ma
[arXiv](https://arxiv.org/abs/2312.08887) [Code]()
#### DECap: Towards Generalized Explicit Caption Editing via Diffusion Mechanism
Zhen Wang, Xinyun Jiang, Jun Xiao, Tao Chen, Long Chen*
[arXiv](https://arxiv.org/abs/2311.14920) [Code]()
#### DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays
Xuhui Liu, Zhi Qiao, Runkun Liu, Hong Li, Xiantong Zhen*, Zhen Qian, Juan Zhang*, Baochang Zhang
[arXiv](https://arxiv.org/abs/2407.13545) [Code]()
#### MoVideo: Motion-Aware Video Generation with Diffusion Models
Jingyun Liang*, Yuchen Fan, Kai Zhang*, Radu Timofte, Luc Van Gool, Rakesh Ranjan
[arXiv](https://arxiv.org/abs/2311.11325) [Code]()
#### Learn to Optimize Denoising Scores: A Unified and Improved Diffusion Prior for 3D Generation
Xiaofeng Yang*, Yiwen Chen, Cheng Chen, Chi Zhang, Yi Xu, Xulei Yang, Fayao Liu, Guosheng Lin
[arXiv](https://arxiv.org/abs/2312.04820) [Code]()
#### Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution
Xi Yang*, Chenhang He, Jianqi Ma, Lei Zhang
[arXiv](https://arxiv.org/abs/2312.00853) [Code]()
#### DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency
Xiaojing Zhong, Xinyi Huang, Xiaofeng Yang, Guosheng Lin*, Qingyao Wu*
[arXiv](https://arxiv.org/abs/2408.07481) [Code]()
#### Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing
Zizheng Yang, Hu Yu, Bing Li, Jinghao Zhang, Jie Huang, Feng Zhao*
[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/06072.pdf) [Code]()
#### PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Guansong Lu*, Yuanfan Guo, Jianhua Han, Minzhe Niu, Yihan Zeng, Songcen Xu, Zeyi Huang, Zhao Zhong, Wei Zhang, Hang Xu
[arXiv](https://arxiv.org/abs/2312.16486) [Code]()
#### Closed-Loop Unsupervised Representation Disentanglement with $\\beta$-VAE Distillation and Diffusion Probabilistic Feedback
Xin Jin*, Bohan Li*, Baao Xie, Wenyao Zhang, Jinming Liu, Ziqiang Li, Tao Yang, Wenjun Zeng
[arXiv](https://arxiv.org/abs/2402.02346) [Code]()
#### Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model
Chen Rao, Guangyuan Li, Zehua Lan, Jiakai Sun, Junsheng Luan, Wei Xing*, Lei Zhao*, Huaizhong Lin*, Jianfeng Dong, Dalong Zhang
[arXiv](https://arxiv.org/abs/2408.13459) [Code]()
#### D4-VTON: Dynamic Semantics Disentangling for Differential Diffusion based Virtual Try-On
Zhaotong Yang, Zicheng Jiang, Xinzhe Li, Huiyu Zhou, Junyu Dong, Huaidong Zhang, Yong Du*
[arXiv](https://arxiv.org/abs/2407.15111) [Code]()
#### AdvDiff: Generating Unrestricted Adversarial Examples using Diffusion Models
Xuelong Dai*, Kaisheng Liang, Bin Xiao
[arXiv](https://arxiv.org/abs/2307.12499) [Code]()
#### DiffPMAE: Diffusion Masked Autoencoders for Point Cloud Reconstruction
Yanlong LI*, Chamara Madarasingha, Kanchana Thilakarathna
[arXiv](https://arxiv.org/abs/2312.03298) [Code]()
#### DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Jinbo Xing*, Menghan Xia, Yong Zhang, Haoxin Chen, Wangbo Yu, Hanyuan Liu, Gongye Liu, Xintao Wang, Ying Shan, Tien-Tsin Wong
[arXiv](https://arxiv.org/abs/2310.12190) [Code]()
#### Text-Anchored Score Composition: Tackling Condition Misalignment in Text-to-Image Diffusion Models
Luozhou Wang*, Guibao Shen, Wenhang Ge, Guangyong Chen, Yijun Li, Yingcong Chen*
[arXiv](https://arxiv.org/abs/2306.14408) [Code]()
#### LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models
Hai Jiang, Ao Luo, Xiaohong Liu, Songchen Han, Shuaicheng Liu*
[arXiv](https://arxiv.org/abs/2407.08939) [Code]()
#### DNI: Dilutional Noise Initialization for Diffusion Video Editing
Sunjae Yoon, Gwanhyeong Koo, Ji Woo Hong, Chang D. Yoo*
[arXiv](https://www.arxiv.org/abs/2409.13037) [Code]()
#### Diffusion-Guided Weakly Supervised Semantic Segmentation
Sung-Hoon Yoon, Hoyong Kwon, Jaeseok Jeong, Daehee Park, Kuk-Jin Yoon*
[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/06482.pdf) [Code]()
#### Improving Virtual Try-On with Garment-focused Diffusion Models
Siqi Wan, Yehao Li, Jingwen Chen, Yingwei Pan*, Ting Yao, Yang Cao, Tao Mei
[arXiv](https://arxiv.org/abs/2409.08258) [Code]()
#### Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control
Yue Han*, Junwei Zhu, Keke He, Xu Chen, Yanhao Ge, Wei Li, Xiangtai Li, Jiangning Zhang, Chengjie Wang, Yong Liu
[arXiv](https://arxiv.org/abs/2405.12970) [Code]()
#### Diffusion Models as Optimizers for Efficient Planning in Offline RL
Renming Huang, Yunqiang Pei, Guoqing Wang*, Yangming Zhang, Yang Yang, Peng Wang, Heng Tao Shen
[arXiv](https://arxiv.org/abs/2407.16142) [Code]()
#### HiDiffusion: Unlocking Higher-Resolution Creativity and Efficiency in Pretrained Diffusion Models
Shen Zhang, Zhaowei CHEN, Zhenyu Zhao, Yuhao Chen, Yao Tang, Jiajun Liang*
[arXiv](https://arxiv.org/abs/2311.17528) [Code]()
#### Dolfin: Diffusion Layout Transformers without Autoencoder
Yilin Wang, Zeyuan Chen, Liangjun Zhong, Zheng Ding, Zhuowen Tu*
[arXiv](https://arxiv.org/abs/2310.16305) [Code]()
#### StructLDM: Structured Latent Diffusion for 3D Human Generation
Tao Hu, Fangzhou Hong, Ziwei Liu*
[arXiv](https://arxiv.org/abs/2404.01241) [Code]()
#### Beyond the Contact: Discovering Comprehensive Affordance for 3D Objects from Pre-trained 2D Diffusion Models
Hyeonwoo Kim, Sookwan Han, Patrick Kwon, Hanbyul Joo*
[arXiv](https://arxiv.org/abs/2401.12978) [Code]()
#### DIFFender: Diffusion-Based Adversarial Defense against Patch Attacks
Caixin Kang*, Yinpeng Dong, Zhengyi Wang, Shouwei Ruan, Yubo Chen, Hang Su*, Xingxing Wei*
[arXiv](https://arxiv.org/abs/2306.09124) [Code]()
#### Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation
Kihong Kim, Haneol Lee, Jihye Park, Seyeon Kim, Kwang Hee Lee, Seungryong Kim*, Jaejun Yoo*
[arXiv](https://arxiv.org/abs/2402.13729) [Code]()
#### Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation
Yeongtak Oh, Jonghyun Lee, Jooyoung Choi, Dahuin Jung, Uiwon Hwang*, Sungroh Yoon*
[arXiv](https://arxiv.org/abs/2403.10911) [Code]()
#### Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-Resolution
Junxiong Lin*, Yan Wang, Zeng Tao, Boyang Wang, Qing Zhao, Haoran Wang, Xuan Tong, Xinji Mai, Yuxuan Lin, Wei Song, Jiawen Yu, Shaoqi Yan, Wenqiang Zhang
[arXiv](https://arxiv.org/abs/2403.05808) [Code]()
#### Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models
Chao Gong*, Kai Chen, Zhipeng Wei, Jingjing Chen*, Yu-Gang Jiang
[arXiv](https://arxiv.org/abs/2407.12383) [Code]()
#### Length-Aware Motion Synthesis via Latent Diffusion
Alessio Sampieri*, Alessio Palma, Indro Spinelli, Fabio Galasso
[arXiv](https://arxiv.org/abs/2407.11532) [Code]()
#### Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model
Danni Yang, Ruohan Dong, Jiayi Ji, Yiwei Ma, Haowei Wang, Xiaoshuai Sun*, Rongrong Ji
[arXiv](https://arxiv.org/abs/2407.05352) [Code]()
#### Improving image synthesis with diffusion-negative sampling
Alakh Desai*, Nuno Vasconcelos
[arXiv]() [Code]()
#### SignGen: End-to-End Sign Language Video Generation with Latent Diffusion
Fan Qi*, Yu Duan, Changsheng Xu, Huaiwen Zhang*
[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/06988.pdf) [Code]()
#### Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems
Sojin Lee, Dogyun Park, Inho Kong, Hyunwoo J. Kim*
[arXiv](https://arxiv.org/abs/2407.16125) [Code]()
#### TetraDiffusion: Tetrahedral Diffusion Models for 3D Shape Generation
Nikolai Kalischek*, Torben Peters, Jan Dirk Wegner, Konrad Schindler
[arXiv](https://arxiv.org/abs/2211.13220) [Code]()
#### Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts
Byeongjun Park, Hyojun Go, Jin-Young Kim, Sangmin Woo, Seokil Ham, Changick Kim*
[arXiv](https://arxiv.org/abs/2403.09176) [Code]()
#### DiffFAS: Face Anti-Spoofing via Generative Diffusion Models
Xinxu Ge, Xin Liu*, Zitong Yu*, Jingang Shi, Chun Qi, Jie Li, Heikki Kälviäinen
[arXiv](https://arxiv.org/abs/2409.08572) [Code]()
#### BK-SDM: A Lightweight, Fast, and Cheap Version of Stable Diffusion
Bo-Kyeong Kim*, Hyoung-Kyu Song, Thibault Castells, Shinkook Choi
[arXiv](https://arxiv.org/abs/2305.15798) [Code]()
#### CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection
Wuyang Li, Xinyu Liu, Jiayi Ma, Yixuan Yuan*
[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/07221.pdf) [Code]()
#### Gated Temporal Diffusion for Stochastic Long-term Dense Anticipation
Olga Zatsarynna*, Emad Bahrami*, Yazan Abu Farha, Gianpiero Francesca, Jürgen Gall*
[arXiv](https://arxiv.org/abs/2407.11954) [Code]()
#### MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Rui Zhao, Yuchao Gu, Jay Zhangjie Wu, David Junhao Zhang, Jia-Wei Liu, weijia wu, Jussi Keppo, Mike Zheng Shou*
[arXiv](https://arxiv.org/abs/2310.08465) [Code]()
#### Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models
Siao Tang, Xin Wang*, Hong Chen, Chaoyu Guan, Zewen Wu, Yansong Tang, Wenwu Zhu*
[arXiv](https://arxiv.org/abs/2311.06322) [Code]()
#### Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors
Ruicheng Wang*, Jianfeng Xiang, Jiaolong Yang, Xin Tong
[arXiv](https://arxiv.org/abs/2403.11503) [Code]()
#### Exact Diffusion Inversion via Bidirectional Integration Approximation
Guoqiang Zhang*, j.p. lewis, W. Bastiaan Kleijn
[arXiv](https://arxiv.org/abs/2307.10829) [Code]()
#### Object-Centric Diffusion for Efficient Video Editing
Kumara Kahatapitiya*, Adil Karjauv, Davide Abati*, Fatih Porikli, Yuki M Asano, Amirhossein Habibian
[arXiv](https://arxiv.org/abs/2401.05735) [Code]()
#### Diffusion for Natural Image Matting
Yihan Hu*, Yiheng Lin, Wei Wang, Yao Zhao, Yunchao Wei*, Humphrey Shi
[arXiv](https://arxiv.org/abs/2312.05915) [Code]()
#### Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning
Jianjie Luo, Jingwen Chen, Yehao Li, Yingwei Pan*, Jianlin Feng, Hongyang Chao, Ting Yao
[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/07445.pdf) [Code]()
#### Factorized Diffusion: Perceptual Illusions by Noise Decomposition
Daniel Geng*, Inbum Park, Andrew Owens
[arXiv](https://arxiv.org/abs/2404.11615) [Code]()
#### To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now
Yimeng Zhang*, jinghan jia, Xin Chen, Aochuan Chen, Yihua Zhang, Jiancheng Liu, Ke Ding, Sijia Liu
[arXiv](https://arxiv.org/abs/2310.11868) [Code]()
#### FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation
Xinzhi Mu*, Li Chen, Bohan CHEN, Shuyang Gu, Jianmin Bao, Dong Chen, Ji Li, Yuhui Yuan
[arXiv](https://arxiv.org/abs/2406.08392) [Code]()
#### One-Shot Diffusion Mimicker for Handwritten Text Generation
Gang Dai, Yifan Zhang, Quhui Ke, Qiangya Guo, Shuangping Huang*
[arXiv](https://www.arxiv.org/abs/2409.04004) [Code]()
#### Kernel Diffusion: An Alternate Approach to Blind Deconvolution
Yash Sanghvi*, Yiheng Chi, Stanley Chan
[arXiv](https://arxiv.org/abs/2312.02319) [Code]()
#### ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
Shaozhe Hao*, Kai Han*, Zhengyao Lv, Shihao Zhao, Kwan-Yee K. Wong*
[arXiv](https://arxiv.org/abs/2407.07077) [Code]()
#### TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models
Jeongho Kim*, Min-Jung Kim*, Junsoo Lee, Jaegul Choo*
[arXiv](http://arxiv.org/abs/2407.09012) [Code]()
#### DiffBIR: Toward Blind Image Restoration with Generative Diffusion Prior
Xinqi Lin*, Jingwen He, Ziyan Chen, Zhaoyang Lyu, Bo Dai, Fanghua Yu, Yu Qiao, Wanli Ouyang, Chao Dong*
[arXiv](https://arxiv.org/abs/2308.15070) [Code]()
#### Do text-free diffusion models learn discriminative visual representations?
Soumik Mukhopadhyay*, Matthew A Gwilliam*, Yosuke Yamaguchi, Vatsal Agarwal, Namitha Padmanabhan, Archana Swaminathan, Tianyi Zhou, Jun Ohya, Abhinav Shrivastava
[arXiv](https://arxiv.org/abs/2311.17921) [Code]()
#### LogoSticker: Inserting Logos into Diffusion Models for Customized Generation
Mingkang Zhu, Xi CHEN, Zhongdao Wang, Hengshuang Zhao*, Jiaya Jia*
[arXiv](https://arxiv.org/abs/2407.13752) [Code]()
#### ProCreate, Don't Reproduce! Propulsive Energy Diffusion for Creative Generation
Jack Lu*, Ryan Teehan*, Mengye Ren*
[arXiv](https://arxiv.org/abs/2408.02226) [Code]()
#### IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination
Xi Chen*, Sida Peng, Dongchen Yang, Yuan Liu, Bowen Pan, Chengfei Lyu, Xiaowei Zhou*
[arXiv](https://arxiv.org/abs/2404.11593) [Code]()
#### Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection
Alireza Ganjdanesh*, Yan Kang, Yuchen Liu, Richard Zhang, Zhe Lin, Heng Huang
[arXiv](https://arxiv.org/abs/2409.15557) [Code]()
#### Compensation Sampling for Improved Convergence in Diffusion Models
Hui Lu*, Albert Ali Salah, Ronald Poppe
[arXiv](https://arxiv.org/abs/2312.06285) [Code]()
#### Lossy Image Compression with Foundation Diffusion Models
Lucas Relic*, Roberto Azevedo, Markus Gross, Christopher Schroers*
[arXiv](https://arxiv.org/abs/2404.08580) [Code]()
#### FMBoost: Boosting Latent Diffusion with Flow Matching
Johannes S Fischer*, Ming Gui, Pingchuan Ma, Nick Stracke, Stefan Andreas Baumann, Vincent Tao Hu, Björn Ommer
[arXiv](https://arxiv.org/abs/2312.07360) [Code]()
#### Diffusion Models as Data Mining Tools
Ioannis Siglidis*, Aleksander Holynski, Alexei A. Efros, Mathieu Aubry, Shiry Ginosar
[arXiv](https://arxiv.org/abs/2408.02752) [Code]()
#### Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering
Ruofan Liang, Zan Gojcic, Merlin Nimier-David, David Acuna, Nandita Vijaykumar, Sanja Fidler, Zian Wang*
[arXiv](https://arxiv.org/abs/2408.09702) [Code]()
#### MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices
Yang Zhao*, Zhisheng Xiao*, Yanwu Xu, Haolin Jia, Tingbo Hou
[arXiv](https://arxiv.org/abs/2311.16567) [Code]()
#### Osmosis: RGBD Diffusion Prior for Underwater Image Restoration
Opher Bar Nathan*, Deborah Levy, Tali Treibitz, Dan Rosenbaum
[arXiv](https://arxiv.org/abs/2403.14837) [Code]()
#### Large-scale Reinforcement Learning for Diffusion Models
Yinan Zhang*, Eric Tzeng, Yilun Du, Dmitry Kislyuk*
[arXiv](https://arxiv.org/abs/2401.12244) [Code]()
#### CoMusion: Towards Consistent Stochastic Human Motion Prediction via Motion Diffusion
Jiarui Sun*, Girish Chowdhary*
[arXiv](https://arxiv.org/abs/2305.12554) [Code]()
#### EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models
Ruoxi Chen, Haibo Jin, Yixin Liu, Jinyin Chen*, Haohan Wang, Lichao Sun
[arXiv](https://arxiv.org/abs/2311.12066) [Code]()
#### Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities
Lorenzo Baraldi*, Federico Cocchi, Marcella Cornia, Lorenzo Baraldi, Alessandro Nicolosi, Rita Cucchiara
[arXiv](https://arxiv.org/abs/2407.20337) [Code]()
#### Diffusion Soup: Model Merging for Text-to-Image Diffusion Models
Benjamin J Biggs*, Arjun Seshadri, Yang Zou, Achin Jain, Aditya Golatkar, Yusheng Xie, Alessandro Achille, Ashwin Swaminathan, Stefano Soatto
[arXiv](https://arxiv.org/abs/2406.08431) [Code]()
#### DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks
Sarah Jabbour*, Gregory Kondas, Ella Kazerooni, Michael Sjoding, David Fouhey, Jenna Wiens
[arXiv](https://arxiv.org/abs/2407.14509) [Code]()
#### BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion
Gwanghyun Kim, Hayeon Kim, Hoigi Seo, Dong Un Kang, Se Young Chun*
[arXiv](https://arxiv.org/abs/2404.04544) [Code]()
#### Viewpoint textual inversion: discovering scene representations and 3D view control in 2D diffusion models
James Burgess*, Kuan-Chieh Wang, Serena Yeung-Levy
[arXiv](https://arxiv.org/abs/2309.07986) [Code]()
#### Loc3Diff: Local Diffusion for 3D Human Head Synthesis and Editing
Yushi Lan*, Feitong Tan, Qiangeng Xu, Di Qiu, Kyle Genova, Zeng Huang, Rohit Pandey, Sean Fanello, Thomas Funkhouser, Chen Change Loy, Yinda Zhang*
[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/08166.pdf) [Code]()
#### Diff-Reg: Diffusion Model in Doubly Stochastic Matrix Space for Registration Problem
Qianliang Wu*, Haobo Jiang*, Lei Luo, Jun Li, Yaqing Ding*, Jin Xie*, Jian Yang*
[arXiv](https://arxiv.org/pdf/2403.19919) [Code]()
#### Investigating Style Similarity in Diffusion Models
Gowthami Somepalli*, Anubhav Gupta, Kamal Gupta, Shramay Palta, Micah Goldblum, Jonas A. Geiping, Abhinav Shrivastava, Tom Goldstein
[arXiv](https://arxiv.org/abs/2404.01292) [Code]()
#### Timestep-Aware Correction for Quantized Diffusion Models
Yuzhe Yao, Feng Tian, Jun Chen*, Haonan Lin, Guang Dai, Yong Liu, Jingdong Wang
[arXiv](https://arxiv.org/abs/2407.03917) [Code]()
#### VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving
YIBO LIU*, Zheyuan Yang, Guile Wu, Yuan Ren, Kejian Lin, Liu Bingbing, Yang Liu, JINJUN SHAN
[arXiv](https://arxiv.org/abs/2407.06516) [Code]()
#### Unmasking Bias in Diffusion Model Training
Hu Yu, Li Shen, Jie Huang, Hongsheng Li, Feng Zhao*
[arXiv](https://arxiv.org/abs/2310.08442) [Code]()
#### Layered Rendering Diffusion Model for Controllable Zero-Shot Image Synthesis
Zipeng Qi, Guoxi Huang*, Chenyang Liu, Fei Ye
[arXiv](https://arxiv.org/abs/2311.18435) [Code]()
#### A Simple Background Augmentation Method for Object Detection with Diffusion Model
Yuhang Li, Xin Dong, Chen Chen, Weiming Zhuang, Lingjuan Lyu*
[arXiv](https://arxiv.org/abs/2408.00350) [Code]()
#### Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion
Sanghyun Kim*, Seohyeon Jung, Balhae Kim, Moonseok Choi, Jinwoo Shin, Juho Lee*
[arXiv](https://arxiv.org/abs/2407.21032) [Code]()
#### An Explainable Vision Question Answer Model via Diffusion Chain-of-Thought
Chunhao LU, Qiang Lu*, Jake Luo
[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/08395.pdf) [Code]()
#### FrePolad: Frequency-Rectified Point Latent Diffusion for Point Cloud Generation
Chenliang Zhou*, Fangcheng Zhong, Param Hanji, Zhilin Guo, Kyle Thomas Fogarty, Alejandro Sztrajman, Hongyun Gao, A. Cengiz Oztireli
[arXiv](https://arxiv.org/abs/2311.12090) [Code]()
#### GAMMA-FACE: GAussian Mixture Models Amend Diffusion Models for Bias Mitigation in Face Images
Basudha Pal*, Arunkumar Kannan*, Ram Prabhakar Kathirvel, Alice O'Toole, Rama Chellappa
[arXiv](https://bas-2k.github.io/gamma-face/) [Code]()
#### PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation
jian ma, Chen Chen*, Qingsong Xie, Haonan Lu*
[arXiv](https://arxiv.org/abs/2311.17086) [Code]()
#### Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation
Duy Tho Le*, Hengcan Shi*, Jianfei Cai, Hamid Rezatofighi
[arXiv](https://arxiv.org/html/2404.04629v1) [Code]()
#### Self-Guided Generation of Minority Samples Using Diffusion Models
Soobin Um, Jong Chul Ye*
[arXiv](https://arxiv.org/abs/2407.11555) [Code]()
#### Pyramid Diffusion for Fine 3D Large Scene Generation
Yuheng Liu*, Xinke Li, Xueting Li, Lu Qi*, Chongshou Li, Ming-Hsuan Yang
[arXiv](https://arxiv.org/abs/2311.12085) [Code]()
#### ShoeModel: Learning to Wear on the User-specified Shoes via Diffusion Model
Wenyu Li*, Binghui Chen, Yifeng Geng, Xuansong Xie, Wangmeng Zuo
[arXiv](https://arxiv.org/abs/2404.04833) [Code]()
#### A Watermark-Conditioned Diffusion Model for IP Protection
Rui Min*, Sen Li*, Hongyang Chen*, Minhao Cheng*
[arXiv](https://arxiv.org/abs/2403.10893) [Code]()
#### Lost in Translation: Latent Concept Misalignment in Text-to-Image Diffusion Models
Juntu Zhao, Junyu Deng, Yixin Ye, Chongxuan Li, Zhijie Deng*, Dequan Wang*
[arXiv](https://arxiv.org/abs/2408.00230) [Code]()
#### Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression
Animesh Sinha*, Bo Sun, Anmol Kalia, Arantxa Casanova, Elliot Blanchard, David Yan, Winnie Zhang, Tony Nelli, Jiahui Chen, Hardik Shah, Licheng Yu, Mitesh Kumar Singh, Ankit Ramchandani, Maziar Sanjabi, Sonal Gupta, Amy L Bearman, Dhruv Mahajan
[arXiv](https://arxiv.org/abs/2311.10794) [Code]()
#### GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection
Hang Yao, Ming Liu*, Zhicun Yin, Zifei Yan, Xiaopeng Hong, Wangmeng Zuo
[arXiv](https://arxiv.org/abs/2406.07487) [Code]()
#### CipherDM: Secure Three-Party Inference for Diffusion Model Sampling
Xin Zhao, Xiaojun Chen*, Xudong Chen, He Li, Tingyu Fan, Zhendong Zhao
[arXiv](https://www.arxiv.org/abs/2409.05414) [Code]()
#### Time-Efficient and Identity-Consistent Virtual Try-On Using A Variant of Altered Diffusion Models
Phuong Hoang Dam*, Jihoon Jeong*, Anh T Tran*, Daeyoung Kim*
[arXiv](https://arxiv.org/abs/2403.07371) [Code]()
#### Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance
Donghoon Ahn, Hyoungwon Cho, Jaewon Min, Jungwoo Kim, Wooseok Jang, SeonHwa Kim, Hyun Hee Park, Kyong Hwan Jin*, Seungryong Kim*
[arXiv](https://arxiv.org/abs/2403.17377) [Code]()
#### FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models
Junhyuk So, Jungwon Lee, Eunhyeok Park*
[arXiv](https://arxiv.org/abs/2312.03517) [Code]()
#### Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond
Silvio Galesso*, Philipp Schröppel*, Hssan Driss, Thomas Brox
[arXiv](https://arxiv.org/abs/2407.15739) [Code]()
#### MONTAGE: Monitoring Training for Attribution of Generative Diffusion Models
Jonathan Brokman*, Omer Hofman, Roman Vainshtein, Amit Giloni, Toshiya Shimizu, Inderjeet Singh, Oren Rachmil, Alon Zolfi, Asaf Shabtai, Yuki Unno, Hisashi Kojima
[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/09513.pdf) [Code]()
#### Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation
Mathias Öttl*, Frauke Wilm, Jana Steenpass, Jingna Qiu, Matthias Rübner, Prof Arndt Hartmann, Matthias W. Beckmann, Peter Fasching, Andreas K Maier, Ramona Erber, Bernhard Kainz, Katharina Breininger
[arXiv](https://arxiv.org/abs/2403.14429) [Code]()
#### Deep Diffusion Image Prior for Efficient OOD Adaptation in 3D Inverse Problems
Hyungjin Chung, Jong Chul Ye*
[arXiv](https://arxiv.org/abs/2407.10641) [Code]()
#### LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model
Runhui Huang, Kaixin Cai, Jianhua Han, Xiaodan Liang*, Renjing Pei, Guansong Lu, Songcen Xu, Wei Zhang, Hang Xu
[arXiv](https://arxiv.org/abs/2403.11929) [Code]()
#### UpFusion: Novel View Diffusion from Unposed Sparse View Observations
Bharath Raj Nagoor Kani*, Hsin-Ying Lee, Sergey Tulyakov, Shubham Tulsiani
[arXiv](https://arxiv.org/abs/2312.06661) [Code]()
#### Video Editing via Factorized Diffusion Distillation
Uriel Singer*, Amit Zohar*, Yuval Kirstain, Shelly Sheynin, Adam Polyak, Devi Parikh, Yaniv Taigman
[arXiv](https://arxiv.org/abs/2403.09334) [Code]()
#### CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion
Wendi Zheng*, Jiayan Teng, Zhuoyi Yang, Weihan Wang, Jidong Chen, Xiaotao Gu, Yuxiao Dong*, Ming Ding*, Jie Tang*
[arXiv](https://arxiv.org/abs/2403.05121) [Code]()
#### SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers
Nanye Ma*, Mark Goldstein, Michael Albergo, Nicholas M Boffi, Eric Vanden-Eijnden*, Saining Xie*
[arXiv](https://arxiv.org/abs/2401.08740) [Code]()
#### Curved Diffusion: A Generative Model With Optical Geometry Control
Andrey Voynov*, Amir Hertz, Moab Arar, Shlomi Fruchter, Daniel Cohen-Or
[arXiv](https://arxiv.org/abs/2311.17609) [Code]()
#### AnimateMe: 4D Facial Expressions via Diffusion Models
Dimitrios Gerogiannis*, Foivos Paraperas Papantoniou, Rolandos Alexandros Potamias, Alexandros Lattas, Stylianos Moschoglou, Stylianos Ploumpis, Stefanos Zafeiriou
[arXiv](https://arxiv.org/abs/2403.17213) [Code]()
#### Unveiling and Mitigating Memorization in Text-to-image Diffusion Models through Cross Attention
Jie Ren*, Yaxin Li, Shenglai Zeng, Han Xu, Lingjuan Lyu, Yue Xing, Jiliang Tang
[arXiv](https://arxiv.org/abs/2403.11052) [Code]()
#### Context Diffusion: In-Context Aware Image Generation
Ivona Najdenkoska*, Animesh Sinha, Abhimanyu Dubey, Dhruv Mahajan, Vignesh Ramanathan, Filip Radenovic
[arXiv](https://arxiv.org/abs/2312.03584) [Code]()
#### Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling
Noam Elata*, Tomer Michaeli, Michael Elad
[arXiv](https://arxiv.org/abs/2407.08256) [Code]()
#### Data Augmentation via Latent Diffusion for Saliency Prediction
Bahar Aydemir*, Deblina Bhattacharjee, Tong Zhang, Mathieu Salzmann, Sabine Süsstrunk
[arXiv](https://arxiv.org/abs/2409.07307) [Code]()
#### A Diffusion Model for Simulation Ready Coronary Anatomy with Morpho-skeletal Control
Karim Kadry*, Shreya Gupta, Jonas Sogbadji, Michiel Schaap, Kersten Petersen, Takuya Mizukami, Carlos Collet, Farhad R. Nezami, Elazer R Edelman
[arXiv](https://arxiv.org/abs/2407.15631) [Code]()
#### DrivingDiffusion: Layout-Guided Multi-View Driving Scenarios Video Generation with Latent Diffusion Model
Li Xiaofan*, Zhang Yifu*, Ye Xiaoqing*
[arXiv](https://arxiv.org/abs/2310.07771) [Code]()
#### GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction
Yuxuan Mu*, Xinxin Zuo, Chuan Guo, Yilin Wang, Juwei Lu, Xiaofei Wu, Songcen Xu, Peng Dai, Youliang Yan, Li Cheng
[arXiv](https://arxiv.org/abs/2407.04237) [Code]()
#### AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive Computation
Shengkun Tang*, Yaqing Wang, Caiwen Ding, Yi Liang, Yao Li, Dongkuan Xu
[arXiv](https://arxiv.org/abs/2309.17074) [Code]()
#### Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas
Fabio Quattrini*, Vittorio Pippi, Silvia Cascianelli*, Rita Cucchiara
[arXiv](https://arxiv.org/abs/2408.15660) [Code]()
#### Photorealistic Video Generation with Diffusion Models
Agrim Gupta*, Lijun Yu, Kihyuk Sohn, Xiuye Gu, Meera Hahn, Li Fei-Fei, Irfan Essa, Lu Jiang, Jose Lezama
[arXiv](https://arxiv.org/abs/2312.06662) [Code]()
#### WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation
Jiachen Lu, Ze Huang, Zeyu Yang, Zhang Jiahui, Li Zhang*
[arXiv](https://arxiv.org/abs/2312.02934) [Code]()
#### Soft Shadow Diffusion (SSD): Physics-inspired Learning for 3D Computational Periscopy
Fadlullah A Raji*, John Murray-Bruce*
[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/10427.pdf) [Code]()
#### Tackling Structural Hallucination in Image Translation with Local Diffusion
Seunghoi Kim*, Chen Jin, Tom Diethe, Matteo Figini, Henry FJ Tregidgo, Asher Mullokandov, Philip A Teare, Daniel Alexander
[arXiv](https://arxiv.org/abs/2404.05980) [Code]()
#### Adversarial Robustification via Text-to-Image Diffusion Models
Daewon Choi, Jongheon Jeong, Huiwon Jang, Jinwoo Shin*
[arXiv](https://arxiv.org/abs/2407.18658) [Code]()
#### Learning Quantized Adaptive Conditions for Diffusion Models
Yuchen Liang*, Yuchuan Tian, Lei Yu, Huaao Tang, Jie Hu, Xiangzhong Fang, Hanting Chen*
[arXiv](https://arxiv.org/abs/2409.17487) [Code]()
#### SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher
Trung Tuan Dao*, Thuan Hoang Nguyen, Thanh Van Le, Duc H Vu, Khoi Nguyen, Cuong Pham, Anh T Tran*
[arXiv](https://arxiv.org/abs/2408.14176) [Code]()
#### DiffSurf: A Transformer-based Diffusion Model for Generating and Reconstructing 3D Surfaces in Pose
Yusuke Yoshiyasu*, Leyuan Sun
[arXiv](https://www.arxiv.org/abs/2408.14860) [Code]()
#### SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow
Yuanzhi Zhu*, Xingchao Liu, Qiang Liu*
[arXiv](https://arxiv.org/abs/2407.12718) [Code]()
#### DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation
Jeongsol Kim, Geon Yeong Park, Jong Chul Ye*
[arXiv](https://arxiv.org/abs/2403.11415) [Code]()
#### PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control
Rishubh Parihar*, Sachidanand VS, Sabariswaran Mani, Tejan Karmali, Venkatesh Babu RADHAKRISHNAN
[arXiv](https://www.arxiv.org/abs/2408.05083) [Code]()
#### Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Xiaoshi Wu, Yiming Hao, Manyuan Zhang*, Keqiang Sun, Zhaoyang Huang, Guanglu Song, Yu Liu, Hongsheng Li*
[arXiv](https://arxiv.org/abs/2405.00760) [Code]()
#### Inf-DiT: Upsampling any-resolution image with memory-efficient diffusion transformer
Zhuoyi Yang*, Heyang Jiang, Wenyi Hong, Jiayan Teng, Wendi Zheng, Yuxiao Dong, Ming Ding, Jie Tang
[arXiv](https://arxiv.org/abs/2405.04312) [Code]()
#### EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Linrui Tian*, Qi Wang*, Bang Zhang*, Liefeng Bo*
[arXiv](https://arxiv.org/abs/2402.17485) [Code]()
#### Zero-Shot Adaptation for Approximate Posterior Sampling of Diffusion Models in Inverse Problems
Yasar U Alcalar*, Mehmet Akcakaya
[arXiv](https://arxiv.org/abs/2407.11288) [Code]()
#### R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model
Changhoon Kim*, Kyle Min*, Yezhou Yang
[arXiv](https://arxiv.org/abs/2405.16341) [Code]()
#### Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion
Yu Cao*, Shaogang Gong
[arXiv](https://arxiv.org/abs/2407.07249) [Code]()
#### A high-quality robust diffusion framework for corrupted dataset
Quan Dao*, Binh Ta, Tung Pham, Anh Tran
[arXiv](https://arxiv.org/abs/2311.17101) [Code]()
#### Identity-Consistent Diffusion Network for Grading Knee Osteoarthritis Progression in Radiographic Imaging
Wenhua Wu, Kun Hu*, Wenxi Yue, Wei Li, Milena Simic, Changyang Li, Wei Xiang, Zhiyong Wang
[arXiv](https://arxiv.org/abs/2407.21381) [Code]()
#### Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation
Shentong Mo, Enze Xie*, Yue Wu, Junsong Chen, Matthias Niessner, Zhenguo Li
[arXiv](https://arxiv.org/abs/2312.07231) [Code]()
#### Pix2Gif: Motion-Guided Diffusion for GIF Generation
Hitesh Kandala*, Jianfeng Gao, Jianwei Yang
[arXiv](https://arxiv.org/abs/2403.04634) [Code]()
#### T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models
Zhongqi Wang, Jie Zhang*, Shiguang Shan, Xilin Chen
[arXiv](https://arxiv.org/abs/2407.04215) [Code]()
#### DiffusionPen: Towards Controlling the Style of Handwritten Text Generation
Konstantina Nikolaidou*, George Retsinas, Giorgos Sfikas, Marcus Liwicki
[arXiv](https://arxiv.org/abs/2409.06065) [Code]()
#### Learning Pseudo 3D Guidance for View-consistent Texturing with 2D Diffusion
Kehan Li, Yanbo Fan*, Yang Wu, Zhongqian Sun, Wei Yang, Xiangyang Ji, Li Yuan, Jie Chen*
[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/11528.pdf) [Code]()
#### Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models
Yang Zhang*, Tze Tzun Teoh, Wei Hern Lim, Kenji Kawaguchi
[arXiv](https://arxiv.org/abs/2403.06381) [Code]()
#### Adversarial Diffusion Distillation
Axel Sauer*, Dominik Lorenz, Andreas Blattmann, Robin Rombach
[arXiv](https://arxiv.org/abs/2311.17042) [Code]()
#### Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Yisol Choi*, Sangkyung Kwak, Kyungmin Lee, Hyungwon Choi, Jinwoo Shin*
[arXiv](https://arxiv.org/abs/2403.05139) [Code]()
#### Fast Diffusion-Based Counterfactuals for Shortcut Removal and Generation
Nina Weng*, Paraskevas Pegios, Eike Petersen, Aasa Feragen, Siavash Arjomand Bigdeli
[arXiv](https://arxiv.org/abs/2312.14223) [Code]()
#### Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion Models
Xiao Liu, Xiaoliu Guan, Yu Wu*, Jiaxu Miao*
[arXiv](https://arxiv.org/abs/2407.15328) [Code]()
#### DiffClass: Diffusion-Based Class Incremental Learning
Zichong Meng, Jie Zhang, Changdi Yang, Zheng Zhan, Pu Zhao*, Yanzhi Wang*
[arXiv](https://arxiv.org/abs/2403.05016) [Code]()
#### Instant 3D Human Avatar Generation using Image Diffusion Models
Nikos Kolotouros*, Thiemo Alldieck, Enric Corona, Eduard Gabriel Bazavan, Cristian Sminchisescu
[arXiv](https://arxiv.org/abs/2406.07516) [Code]()
#### Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models
Vitali Petsiuk*, Kate Saenko
[arXiv](https://arxiv.org/abs/2404.13706) [Code]()
#### ControlNet-XS: Rethinking the Control of Text-to-Image Diffusion Models as Feedback-Control Systems
Denis Zavadski*, Johann-Friedrich Feiden, Carsten Rother
[arXiv](https://arxiv.org/abs/2312.06573) [Code]()
#### Hierarchical Conditioning of Diffusion Models Using Tree-of-Life for Studying Species Evolution
Mridul Khurana*, Arka Daw, M. Maruf, Josef C. Uyeda, Wasila Dahdul, Caleb Charpentier, Yasin Bakış, Henry L. Bart Jr., Paula M. Mabee, Hilmar Lapp, James P. Balhoff, Wei-Lun Chao, Charles Stewart, Tanya Berger-Wolf, Anuj Karpatne*
 [arXiv](https://arxiv.org/abs/2408.00160) [Code]()