An open API service indexing awesome lists of open source software.

https://github.com/moatifbutt/awesome-diffusion-eccv-2024

List of diffusion papers accepted in ECCV 2024.
https://github.com/moatifbutt/awesome-diffusion-eccv-2024

List: awesome-diffusion-eccv-2024

accepted-papers diffusion diffusion-models eccv eccv-2024 eccv2024 t2i text-to-image

Last synced: 5 months ago
JSON representation

List of diffusion papers accepted in ECCV 2024.

Awesome Lists containing this project

README

          

# Diffusion papers in ECCV 2024
List of papers accepted in ECCV 2024.


#### SMooDi: Stylized Motion Diffusion Model
Lei Zhong, Yiming Xie, Varun Jampani, Deqing Sun, Huaizu Jiang*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2407.12783)] [[Project](https://neu-vi.github.io/SMooDi/)] [[Code](https://github.com/neu-vi/SMooDi)] [[Slides](https://eccv.ecva.net/media/eccv-2024/Slides/1010.pdf)]

#### SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion
Vikram Voleti*, Chun-Han Yao, Mark Boss, Adam Letts, David Pankratz, Dmitrii Tochilkin, Christian Laforte, Robin Rombach, Varun Jampani*

![Oral Badge](https://img.shields.io/badge/Oral-blue) [[arXiv](https://arxiv.org/abs/2403.12008)] [[Project](https://sv3d.github.io/)] [[Model](https://huggingface.co/stabilityai/sv3d)]

#### EMDM: Efficient Motion Diffusion Model for Fast, High-Quality Human Motion Generation
Wenyang Zhou, Zhiyang Dou*, Zeyu Cao, Zhouyingcheng Liao, Jingbo Wang, Wenjia Wang, Yuan Liu, Taku Komura, Wenping Wang, Lingjie Liu

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2312.02256)] [[Project](https://frank-zy-dou.github.io/projects/EMDM/index.html)] [[Code](https://github.com/Frank-ZY-Dou/EMDM)] [[Demo Video](https://www.youtube.com/watch?v=1SyCXbnol_g&ab_channel=FrankZhiyangDou)]

#### Diffusion Bridges for 3D Point Cloud Denoising
Mathias Vogel Hüni, Keisuke Tateno, Marc Pollefeys, Federico Tombari, Marie-Julie Rakotosaona, Francis Engelmann*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2408.16325)] [[Project](https://p2p-bridge.github.io/)] [[Code](https://github.com/matvogel/P2P-Bridge)] [[Poster](https://p2p-bridge.github.io/static/images/poster.png)]

#### VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
Junlin Han*, Filippos Kokkinos, Philip Torr

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2403.12034)] [[Project](https://junlinhan.github.io/projects/vfusion3d.html)] [[Code](https://github.com/facebookresearch/vfusion3d)] [[Poster](https://junlinhan.github.io/projects/resources/paper16/vfusion3d_poster.pdf)] [[Huggingface Demo](https://huggingface.co/spaces/facebook/VFusion3D)]

#### Beta-Tuned Timestep Diffusion Model
Tianyi Zheng*, Peng-Tao Jiang, Ben Wan, Hao Zhang, Jinwei Chen, Jia Wang*, Bo Li*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/00328.pdf)]

#### Taming Latent Diffusion Model for Neural Radiance Field Inpainting
Chieh Hubert Lin*, Changil Kim, Jia-Bin Huang, Qinbo Li, Chih-Yao Ma, Johannes Kopf, Ming-Hsuan Yang, Hung-Yu Tseng

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2404.09995)] [[Project](https://hubert0527.github.io/MALD-NeRF/)]

#### FreeInit: Bridging Initialization Gap in Video Diffusion Models
Tianxing Wu*, Chenyang Si, Yuming Jiang, Ziqi Huang, Ziwei Liu

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2312.07537)] [[Project](https://tianxingwu.github.io/pages/FreeInit/)] [[Code](https://github.com/TianxingWu/FreeInit)] [[Huggingface Demo](https://huggingface.co/spaces/TianxingWu/FreeInit)] [[Video](https://youtu.be/lS5IYbAqriI)]

#### LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation
Yushi Lan, Fangzhou Hong, Shuai Yang, Shangchen Zhou, Xuyi Meng, Bo Dai, Xingang Pan, Chen Change Loy*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2403.12019)] [[Project](https://nirvanalan.github.io/projects/ln3diff/)] [[Code](https://github.com/NIRVANALAN/LN3Diff)] [[Gradio Demo](https://huggingface.co/spaces/yslan/LN3Diff_I23D)]

#### UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation
Zexiang Liu, Yangguang Li, Youtian Lin, Xin Yu, Sida Peng, Yan-Pei Cao, Xiaojuan Qi, Xiaoshui Huang, Ding Liang*, Wanli Ouyang

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2312.08754)] [[Project](https://yg256li.github.io/UniDream/)] [[Code](https://github.com/YG256Li/UniDream)]

#### FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models
Wei WU*, Qingnan Fan, Shuai Qin, Hong Gu, Ruoyu Zhao, Antoni Chan*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2404.11895)] [[Code](https://github.com/Thermal-Dynamics/FreeDiff)]

#### Synchronous Diffusion for Unsupervised Smooth Non-Rigid 3D Shape Matching
Dongliang Cao*, Zorah Laehner, Florian Bernard

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2407.08244)]

#### Diffusion Models for Open-Vocabulary Segmentation
Laurynas Karazija*, Iro Laina, Andrea Vedaldi, Christian Rupprecht

![Oral Badge](https://img.shields.io/badge/Oral-blue) [[arXiv](https://arxiv.org/abs/2306.09316)] [[Project](https://www.robots.ox.ac.uk/~vgg/research/ovdiff/)] [[Video](https://youtu.be/OSDtkp7Ta-8)]

#### AccDiffusion: An Accurate Method for Higher-Resolution Image Generation
Zhihang Lin, Mingbao Lin, Meng Zhao, Rongrong Ji*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2407.10738)] [[Project](https://lzhxmu.github.io/accdiffusion/accdiffusion.html)] [[Code](https://github.com/lzhxmu/AccDiffusion)]

#### Learning Differentially Private Diffusion Models via Stochastic Adversarial Distillation
Bochao Liu, Pengju Wang, Shiming Ge*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2408.14738)]

#### Prompting Future Driven Diffusion Model for Hand Motion Prediction
Bowen Tang*, Kaihao Zhang*, Wenhan Luo*, Wei Liu, HONGDONG LI

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/01102.pdf)] [[Poster](https://eccv.ecva.net/media/PosterPDFs/ECCV%202024/653.png?t=1725929440.8208065)]

#### ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement
Muhammad Atif Butt*, Kai Wang, Javier Vazquez-Corral, Joost van de Weijer

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2407.07197)] [[Project](https://moatifbutt.github.io/colorpeel/)] [[Code](https://github.com/moatifbutt/color-peel)] [[Poster](https://github.com/moatifbutt/color-peel/blob/main/assets/ECCV2024_ColorPeel_.pdf)]

#### DiffiT: Diffusion Vision Transformers for Image Generation
Ali Hatamizadeh*, Jiaming Song, Guilin Liu, Jan Kautz, Arash Vahdat

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2312.02139)] [[Code](https://github.com/NVlabs/DiffiT)]

#### MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration
Yulin Ren, Xin Li*, Bingchen Li, Xingrui Wang, Mengxi China Guo, Shijie Zhao, Li Zhang, Zhibo Chen*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2407.10833)] [[Project](https://renyulin-f.github.io/MoE-DiffIR.github.io/)] [[Code](https://github.com/renyulin-f/MoE-DiffIR)] [[Data](https://drive.google.com/drive/folders/1Kn8SjJWpHITHlg5kuL1Ur7Ml-WNJJ064)]

#### MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection
Youngmin Oh, Hyung-Il Kim, Seong Tae Kim*, Jung Uk Kim*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2407.16448)] [[Code](https://github.com/VisualAIKHU/MonoWAD)] [[Data](https://drive.google.com/file/d/1iOpoZ-QbJdU2ytRmd9wPxH0RNjZ6KNdQ/view)]

#### Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization
Tao Yang*, Rongyuan Wu, Peiran Ren, Xuansong Xie, Lei Zhang

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2308.14469)] [[Code](https://github.com/yangxy/PASD/)] [[Data](https://huggingface.co/datasets/yangtao9009/PASD_dataset)] [[Demo](https://colab.research.google.com/drive/1lZ_-rSGcmreLCiRniVT973x6JLjFiC-b?usp=sharing)]

#### XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution
Qu Yunpeng*, Kun Yuan, Kai Zhao, Qizhi Xie, Jinhua Hao, Ming Sun, Chao Zhou

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2403.05049)] [[Code](https://github.com/qyp2000/XPSR)]

#### DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation
Yiqun Duan*, Xianda Guo*, Zheng Zhu

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2303.05021)] [[Code](https://github.com/duanyiqun/DiffusionDepth)]

#### DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation
Wenliang Zhao, Haolin Wang, Jie Zhou, Jiwen Lu*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2409.03755)] [[Code](https://github.com/wl-zhao/DC-Solver)]

#### Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models
Claudio Rota*, Marco Buzzelli, Joost van de Weijer

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2311.15908)] [[Code](https://github.com/claudiom4sir/StableVSR)]

#### DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors
Zizheng Yan*, Jiapeng Zhou, Fanpeng Meng, Yushuang Wu, Lingteng Qiu, Zisheng Ye, Shuguang Cui, Guanying CHEN, Xiaoguang Han*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2407.16260)] [[Project](https://chester256.github.io/dreamdissector/)] [[Video](https://youtu.be/qHiEoio7SJ0)]

#### Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion
Xiang Fan*, Anand Bhattad, Ranjay Krishna

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2403.14617)] [[Project](https://videoshop-editing.github.io/)] [[Code](https://github.com/sfanxiang/videoshop)] [[Supplementary](https://videoshop-editing.github.io/static/supplementary/)] [[Video](https://videoshop-editing.github.io/static/supplementary/assets/intro.mp4)]

#### Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation
Junsung Lee, Minsoo Kang, Bohyung Han*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2409.08077)] [[Code](https://github.com/JS-Lee525/PIC)]

#### RadEdit: stress-testing biomedical vision models via diffusion image editing
Fernando Pérez-García, Sam Bond-Taylor, Pedro Sanchez, Boris van Breugel, Daniel Coelho de Castro, Harshita Sharma, Valentina Salvatelli, Maria Teodora A Wetscherek, Hannah CM Richardson, Lungren Matthew, Aditya Nori, Javier Alvarez-Valle, Ozan Oktay, Maximilian Ilse*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2312.12865)] [[Project](https://huggingface.co/microsoft/radedit)]

#### AdaDiffSR: Adaptive Region-aware Dynamic acceleration Diffusion Model for Real-World Image Super-Resolution
Yuanting Fan, Chengxu Liu, Nengzhong Yin, Changlong Gao, Xueming Qian*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[paper](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/01944.pdf)] [[Video](https://www.youtube.com/watch?v=UcmJI3Cd9UM)]

#### Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Xuelu Feng, Dongdong Chen, Junsong Yuan, Chunming Qiao, Gang Hua, Zixin Zhu*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2403.12042)] [[Code](https://github.com/buxiangzhiren/VD-IT)] [[Video](https://youtu.be/da-Fs5-ZyLc)]

#### Co-synthesis of Histopathology Nuclei Image-Label Pairs using a Context-Conditioned Joint Diffusion Model
Seonghui Min, Hyun-Jic Oh, Won-Ki Jeong*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2407.14434)]

#### MVDD: Multi-View Depth Diffusion Models
Zhen Wang*, Qiangeng Xu, Feitong Tan, Menglei Chai, Shichen Liu, Rohit Pandey, Sean Fanello, Achuta Kadambi, Yinda Zhang

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2312.04875)] [[Project](https://mvdepth.github.io/)]

#### EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models
Eungbean Lee, Somi Jeong, Kwanghoon Sohn*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2205.07680)]

#### DreamDrone: Text-to-Image Diffusion Models are Zero-shot Perpetual View Generators
Hanyang Kong*, Dongze Lian, Michael Bi Mi, Xinchao Wang*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/html/2312.08746v3)] [[Project](https://hyokong.github.io/dreamdrone-page/)] [[Code](https://github.com/HyoKong/DreamDrone)] [[Demo](https://huggingface.co/spaces/imsuperkong/dreamdrone)]

#### Harnessing Text-to-Image Diffusion Models for Category-Agnostic Pose Estimation
Duo Peng, Zhengbo Zhang, Ping Hu, Qiuhong Ke, David Yau, Jun Liu*

![Oral Badge](https://img.shields.io/badge/Oral-blue) [[paper](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/02103.pdf)]

#### M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models
Seunggeun Chi*, Hyung-gun Chi, Hengbo Ma, Nakul Agarwal, Faizan Siddiqui, Karthik Ramani*, Kwonjoon Lee*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2407.14502)] [[Video](https://www.youtube.com/watch?v=DERy31VEK2g)]

#### Shapefusion: 3D localized human diffusion models
Rolandos Alexandros Potamias*, Michael Tarasiou, Stylianos Ploumpis, Stefanos Zafeiriou

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2403.19773) [[Project](https://rolpotamias.github.io/Shapefusion/)]

#### Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing
Wonjun Kang, Kevin Galim, Hyung Il Koo*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2403.09468) [[Code](https://github.com/furiosa-ai/eta-inversion)] [[Video](https://www.youtube.com/watch?v=NwqK9p4GKlo)]

#### MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization
Tianchen Zhao*, Xuefei Ning, Tongcheng Fang, Enshu Liu, Guyue Huang, Zinan Lin, Shengen Yan, Guohao Dai, Yu Wang

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2405.17873) [[Project](https://a-suozhang.xyz/mixdq.github.io/)] [[Code](https://github.com/A-suozhang/MixDQ)] [[Huggingface](https://huggingface.co/nics-efc/MixDQ)]

#### RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
Bowen Zhang, Yiji Cheng, Chunyu Wang*, Ting Zhang, Jiaolong Yang, Yansong Tang, Feng Zhao, Dong Chen, Baining Guo

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://www.arxiv.org/abs/2407.06938) [[Project](https://rodinhd.github.io/)] [[Code](https://github.com/RodinHD/RodinHD)]

#### A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting
Wouter Van Gansbeke*, Bert De Brabandere

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2401.10227) [[Code](https://github.com/segments-ai/latent-diffusion-segmentation)]

#### Lego: Learning to Disentangle and Invert Personalized Concepts Beyond Object Appearance in Text-to-Image Diffusion Models
Saman Motamed*, Danda Pani Paudel, Luc Van Gool

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2311.13833) [[Project](https://sam-motamed.github.io/projects/lego)]

#### IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Yuanhao Zhai*, Kevin Lin, Linjie Li, Chung-Ching Lin, Jianfeng Wang, Zhengyuan Yang, David Doermann, Junsong Yuan, Zicheng Liu, Lijuan Wang

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2407.10937) [[Code](https://github.com/yhZhai/idol)]

#### DCDM: Diffusion-Conditioned-Diffusion Model for Scene Text Image Super-Resolution
Shrey Singh*, Prateek Keserwani, Masakazu Iwamura*, Partha Pratim Roy

![Poster Badge](https://img.shields.io/badge/Poster-purple) [paper]([DCDM: Diffusion-Conditioned-Diffusion Model for Scene Text Image Super-Resolution](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/02357.pdf)) [[Code](https://github.com/shreygithub/DCDM)]

#### DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion
Liao Shen, Tianqi Liu, Huiqiang Sun, Xinyi Ye, Baopu Li, Jianming Zhang, Zhiguo Cao*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://www.arxiv.org/abs/2409.09605) [Code](https://github.com/leoShen917/DreamMover)

#### Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Yifan Pu*, Zhuofan Xia, Jiayi Guo, Dongchen Han, Qixiu Li, Duo Li, Yuhui Yuan, Ji Li, Yizeng Han, Shiji Song, Gao Huang*, Xiu Li*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2408.05710) [Code](https://github.com/LeapLabTHU/Attention-Mediators)

#### Diffusion Model is a Good Pose Estimator from 3D RF-Vision
Junqiao Fan, Jianfei Yang*, Yuecong Xu, Lihua Xie

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2403.16198)]

#### MVDiffHD: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction
Shitao Tang*, Jiacheng Chen, Dilin Wang, Chengzhou Tang, Fuyang Zhang, Yuchen Fan, Vikas Chandra, Yasutaka Furukawa, Rakesh Ranjan

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2402.12712)] [[Project](https://mvdiffusion-plusplus.github.io/)] [[Code](https://github.com/Tangshitao/MVDiffusion_plusplus)]

#### Diffusion-Generated Pseudo-Observations for High-Quality Sparse-View Reconstruction
Xinhang Liu*, Jiaben Chen, Shiu-Hong Kao, Yu-Wing Tai, Chi-Keung Tang

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2305.15171)] [[Project](https://xinhangliu.com/deceptive-nerf-3dgs)]

#### Memory-Efficient Fine-Tuning for Quantized Diffusion Model
Hyogon Ryu, Seohyun Lim, Hyunjung Shim*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2401.04339) [[Code](https://github.com/ugonfor/TuneQDM)] [[Poster](https://eccv.ecva.net/media/PosterPDFs/ECCV%202024/1781.png?t=1727550795.317845)]

#### COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation
Jiefeng Li*, Ye Yuan, Davis Rempe, Haotian Zhang, Pavlo Molchanov, Cewu Lu, Jan Kautz, Umar Iqbal*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2408.16426)

#### FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior
Zhekai Chen, Wen Wang, Zhen Yang, Zeqing Yuan, Hao Chen*, Chunhua Shen*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2407.04947) [[Code](https://github.com/aim-uofa/FreeCompose)] [[Poster](https://eccv.ecva.net/media/PosterPDFs/ECCV%202024/297.png?t=1725802844.8353653)] [[Slides](https://eccv.ecva.net/media/eccv-2024/Slides/297.pdf)]

#### WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models
Zijian He, Peixin Chen, Guangrun Wang, Guanbin Li*, Philip Torr, Liang Lin

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2407.10625) [[Project](https://wildvidfit-project.github.io/)] [[Poster](https://eccv.ecva.net/media/PosterPDFs/ECCV%202024/2494.png?t=1726192744.396735)]

#### RegionDrag: Fast Region-Based Image Editing with Diffusion Models
Jingyi Lu, Xinghui Li, Kai Han*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2407.18247) [[Project](https://visual-ai.github.io/regiondrag/)] [[Demo](https://colab.research.google.com/drive/1pnq9t_1zZ8yL_Oba20eBLVZLp3glniBR?usp=sharing)] [[Code](https://github.com/Visual-AI/RegionDrag)] [[Slides](https://eccv.ecva.net/media/eccv-2024/Slides/1756_57T8SZT.pdf)] [[Poster](https://eccv.ecva.net/media/PosterPDFs/ECCV%202024/1756.png?t=1726153953.1186402)] [[Dataset](https://visual-ai.github.io/regiondrag/#dataset)]

#### MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing
Haoyu Zhao, Tianyi Lu, Jiaxi Gu, Xing Zhang, Qingping Zheng, Zuxuan Wu*, Hang Xu, Yu-Gang Jiang

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://arxiv.org/abs/2311.17338)] [[Poster](https://eccv.ecva.net/media/PosterPDFs/ECCV%202024/665.png?t=1726062032.621387)]

#### Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion
Jian Ma, Wenguan Wang*, Yi Yang, Feng Zheng

![Poster Badge](https://img.shields.io/badge/Poster-purple) [[arXiv](https://www.arxiv.org/abs/2407.10373)] [[Project](https://hechang25.github.io/MVSD/)] [[Poster](https://eccv.ecva.net/media/PosterPDFs/ECCV%202024/1096.png?t=1726061514.9066596)] [[Code](https://github.com/hechang25/MVSD)]

#### SEDiff: Structure Extraction for Domain Adaptive Depth Estimation via Denoising Diffusion Models
Dongseok Shim*, Hyoun Jin Kim*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/02829.pdf) [[Poster](https://eccv.ecva.net/media/PosterPDFs/ECCV%202024/973.png?t=1726087766.2618341)]

#### MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model
Muyao Niu, Xiaodong Cun*, Xintao Wang, Yong Zhang, Ying Shan, Yinqiang Zheng*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2405.20222) [[Project]()] [[Code]()]

#### RoofDiffusion: Constructing Roofs from Severely Corrupted Point Data via Diffusion
Kyle Shih-Huang Lo*, Jorg Peters, Eric Spellman

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2404.09290) [[Project]()] [[Code]()]

#### L-DiffER: Single Image Reflection Removal with Language-based Diffusion Model
Yuchen Hong*, Haofeng Zhong*, Shuchen Weng, Jinxiu S Liang, Boxin Shi

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://assets.ctfassets.net/yreyglvi5sud/4uhN2PF7UyMGgiWQgCMSgi/41f4f9f46fbfa370b3ccd8fbcadbc2b3/2024______Hong_ECCV.pdf) [[Project]()] [[Code]()]

#### BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion
Xuan Ju*, Xian Liu, Xintao Wang*, Yuxuan Bian, Ying Shan, Qiang Xu*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2403.06976) [[Project]()] [[Code]()]

#### Realistic Human Motion Generation with Cross-Diffusion Models
Zeping Ren, Shaoli Huang*, Xiu Li*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2312.10993) [[Project]()] [Code]()

#### ZigMa: A DiT-style Zigzag Mamba Diffusion Model
Vincent Tao Hu*, Stefan A Baumann, Ming Gui, Olga Grebenkova, Pingchuan Ma, Johannes S Fischer, Bjorn Ommer

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2403.13802) [[Project]()] [Code]()

#### EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion
Guangyao Zhai*, Evin Pınar Örnek, Dave Zhenyu Chen, Ruotong Liao, Yan Di, Nassir Navab, Federico Tombari, Benjamin Busam

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2405.00915) [[Project]()] [Code]()

#### Safe-Sim: Safety-Critical Closed-Loop Traffic Simulation with Diffusion-Controllable Adversaries
Wei-Jer Chang*, Francesco Pittaluga, Masayoshi Tomizuka, Wei Zhan, Manmohan Chandraker

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2401.00391) [[Project]()] [Code]()

#### Implicit Concept Removal of Diffusion Models
Zhili Liu*, Kai Chen, Yifan Zhang, Jianhua Han, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung, James Kwok

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2310.05873) [[Project]()] [Code]()

#### GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image
Xiao Fu*, Wei Yin, Mu Hu, Kaixuan Wang, Yuexin Ma, Ping Tan, Shaojie Shen, Dahua Lin, Xiaoxiao Long

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2403.12013) [[Project]()] [Code]()

#### Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions
Fabio Tosi, Pierluigi Zama Ramirez, Matteo Poggi*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2407.16698) [[Project]()] [Code]()

#### Lazy Diffusion Transformer for Interactive Image Editing
Yotam Nitzan*, Zongze Wu, Richard Zhang, Eli Shechtman, Danny Cohen-Or, Taesung Park, Michaël Gharbi

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2404.12382) [[Project]()] [Code]()

#### ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance
Yongwei Chen, Tengfei Wang, Tong Wu, Xingang Pan, Kui Jia*, Ziwei Liu

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2403.12409) [[Project]()] [Code]()

#### 4Diff: 3D-Aware Diffusion Model for Third-to-First Viewpoint Translation
Feng Cheng*, Mi Luo*, Huiyu Wang, Alex Dimakis, Lorenzo Torresani, Gedas Bertasius, Kristen Grauman

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://eccv.ecva.net/virtual/2024/poster/1665) [[Project]()] [Code]()

#### Enhancing Diffusion Models with Text-Encoder Reinforcement Learning
Chaofeng Chen*, Annan Wang, Haoning Wu, Liang Liao, Wenxiu Sun, Qiong Yan, Weisi Lin*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2311.15657) [[Project]()] [Code]()

#### Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation
Peng Jin*, Hao Li, Zesen Cheng, Kehan Li, Runyi Yu, Chang Liu*, Xiangyang Ji, Li Yuan*, Jie Chen

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2407.10528) [[Project]()] [Code]()

#### MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion
Lehong Wu*, Lilang Lin, Jiahang Zhang, Yiyang Ma, Jiaying Liu*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://www.arxiv.org/abs/2409.10473) [[Project]()] [Code]()

#### Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
Ruibin Li*, Ruihuang Li, Song Guo, Lei Zhang

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2403.11105) [[Project]()] [Code]()

#### StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models
Wen Li*, Muyuan Fang, Cheng Zou, Biao Gong, Ruobing Zheng, Meng Wang, Jingdong Chen, Ming Yang

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2409.02543) [[Project]()] [Code]()

#### NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model
Zhongqun Zhang*, Hengfei Wang, Ziwei Yu, Yihua Cheng*, Angela Yao, Hyung Jin Chang

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2407.12727) [[Project]()] [Code]()

#### Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers
Zhengbo Zhang*, Li Xu, Duo Peng, Hossein Rahmani, Jun Liu*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2407.08394) [[Project]()] [Code]()

#### Transferable 3D Adversarial Shape Completion using Diffusion Models
Xuelong Dai*, Bin Xiao

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](http://arxiv.org/abs/2407.10077) [[Project]()] [Code]()

#### Distilling Diffusion Models into Conditional GANs
MinGuk Kang*, Richard Zhang, Connelly Barnes, Sylvain Paris, Suha Kwak, Jaesik Park, Eli Shechtman, Jun-Yan Zhu, Taesung Park*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2405.05967) [[Project]()] [Code]()

#### You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation
Mehdi Noroozi*, Isma Hadji*, Brais Martinez*, Adrian Bulat*, Georgios Tzimiropoulos*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2401.17258) [[Project]()] [Code]()

#### Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation
Yixiao Wang*, Chen Tang, Lingfeng Sun, Simone Rossi, Yichen Xie, Chensheng Peng, Thomas Hannagan, Stefano Sabatini, Nicola Poerio, Masayoshi TOMIZUKA, Wei Zhan

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2408.00766) [[Project]()] [Code]()

#### Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models
Xiaoyu Zhu*, Hao Zhou, Pengfei Xing, Long Zhao, Hao Xu, Junwei Liang, Alexander G. Hauptmann, Ting Liu, Andrew Gallagher

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2407.13642) [[Project]()] [Code]()

#### D-SCo: Dual-Stream Conditional Diffusion for Monocular Hand-Held Object Reconstruction
Bowen Fu*, Gu Wang*, Chenyangguang Zhang, Yan Di, Ziqin Huang, Zhiying Leng, Fabian Manhardt, Xiangyang Ji*, Federico Tombari*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2311.14189) [[Project]()] [Code]()

#### Probabilistic Weather Forecasting with Deterministic Guidance-based Diffusion Model
Donggeun Yoon, Minseok Seo, Doyi Kim, Yeji Choi, Donghyeon Cho*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/04326.pdf) [[Project]()] [Code]()

#### Diffusion-Driven Data Replay: A Novel Approach to Combat Forgetting in Federated Class Continual Learning
Jinglin Liang, Jin Zhong, Hanlin Gu, Zhongqi Lu, Xingxing Tang, Gang Dai, Shuangping Huang*, Lixin Fan, Qiang Yang

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2409.01128) [Code]()

#### View Selection for 3D Captioning via Diffusion Ranking
Tiange Luo*, Justin Johnson, Honglak Lee

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2404.07984) [Code]()

#### OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model
Runyi Li*, Xuhan Sheng, Weiqi Li, Jian Zhang*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2404.10312) [Code]()

#### UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models
Yiming Zhao*, Zhouhui Lian*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2312.04884) [Code]()

#### OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models
Zhe Kong*, Yong Zhang*, Tianyu Yang, Tao Wang, Kaihao Zhang, Bizhu Wu, Guanying Chen, Wei Liu, Wenhan Luo*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2403.10983) [Code]()

#### CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation
Hajin Shim, Changhun Kim, Eunho Yang*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2407.16193) [Code]()

#### DreamDiffusion: High-Quality EEG-to-Image Generation with Temporal Masked Signal Modeling and CLIP Alignment
Yunpeng Bai*, Xintao Wang, Yan-Pei Cao, Yixiao Ge, Chun Yuan, Ying Shan

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2306.16934) [Code]()

#### SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis
Huan-ang Gao, Mingju Gao, Jiaju Li, Wenyi Li, Rong Zhi, Hao Tang, Hao Zhao*


[arXiv](https://arxiv.org/abs/2403.09638) [Code]()

#### PixArt-Sigma: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Junsong Chen, Chongjian GE, Enze Xie*, Yue Wu, Lewei Yao, Xiaozhe Ren, Zhongdao Wang, Ping Luo, Huchuan Lu, Zhenguo Li


[arXiv](https://arxiv.org/abs/2403.04692) [Code]()

#### Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models
Yixuan Ren*, Yang Zhou, Jimei Yang, Jing Shi, Difan Liu, Feng Liu, Mingi Kwon, Abhinav Shrivastava


[arXiv](https://arxiv.org/abs/2402.14780) [Code]()

#### ∞-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions
Minh-Quan Le*, Alexandros Graikos, Srikar Yellapragada, Rajarsi Gupta, Joel Saltz, Dimitris Samaras


[arXiv](https://arxiv.org/abs/2407.14709) [Code]()

#### ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation
Yi Zhang, Yun Tang, Wenjie Ruan, Xiaowei Huang, Siddartha Khastgir, Paul A Jennings, Xingyu Zhao*


[arXiv](https://arxiv.org/abs/2402.15429) [Code]()

#### Latent Diffusion Prior Enhanced Deep Unfolding for Snapshot Spectral Compressive Imaging
Zongliang Wu*, Ruiying Lu, Ying Fu, Xin Yuan


[arXiv](https://arxiv.org/abs/2311.14280) [Code]()

#### Learning Diffusion Models for Multi-View Anomaly Detection
Chieh Liu*, Yu-Min Chu*, Ting-I Hsieh*, Hwann-Tzong Chen*, Tyng-Luh Liu*


[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/04907.pdf) [Code]()

#### Diff3DETR: Agent-based Diffusion Model for Semi-supervised 3D Object Detection
Jiacheng Deng*, Jiahao Lu, Tianzhu Zhang


[arXiv](https://arxiv.org/abs/2408.00286) [Code]()

#### Layout-Corrector: Alleviating Layout Sticking Phenomenon in Discrete Diffusion Model
Shoma Iwai*, Atsuki Osanai, Shunsuke Kitada, Shinichiro Omachi


[arXiv](https://arxiv.org/abs/2409.16689) [Code]()

#### Kinetic Typography Diffusion Model
Seonmi Park, Inhwan Bae, Seunghyun Shin, Hae-Gon Jeon*


[arXiv](https://arxiv.org/abs/2407.10476) [Code]()

#### GroupDiff: Diffusion-based Group Portrait Editing
Yuming Jiang, Nanxuan Zhao*, Qing Liu, Krishna Kumar Singh, Shuai Yang, Chen Change Loy, Ziwei Liu


[arXiv](https://arxiv.org/abs/2409.14379) [Code]()

#### TransFusion -- A Transparency-Based Diffusion Model for Anomaly Detection
Matic Fučka*, Vitjan Zavrtanik, Danijel Skočaj


[arXiv](https://arxiv.org/abs/2311.09999) [Code]()

#### Test-Time Stain Adaptation with Diffusion Models for Histopathology Image Classification
Cheng-Chang Tsai*, Yuan-Chih Chen, Chun-Shien Lu*


[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/05175.pdf) [Code]()

#### Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
Lanqing Guo, Yingqing HE, Haoxin Chen, Menghan Xia, Xiaodong Cun, Yufei Wang, Siyu Huang, Yong Zhang, Xintao Wang, Qifeng Chen, Ying Shan, Bihan Wen*


[arXiv](https://arxiv.org/abs/2402.10491) [Code]()

#### R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection
Zheyuan Zhou, Le Wang, Naiyu Fang, Zili Wang, Lemiao Qiu*, Shuyou Zhang


[arXiv](https://arxiv.org/abs/2407.10862) [Code]()

#### Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
Qinyu Yang, Haoxin Chen, Yong Zhang*, Menghan Xia, Xiaodong Cun, Zhixun Su*, Ying Shan


[arXiv](https://www.arxiv.org/abs/2407.10285) [Code]()

#### Revisiting Feature Disentanglement Strategy in Diffusion Training and Breaking Conditional Independence Assumption in Sampling
Wonwoong Cho*, Hareesh Ravi*, Midhun Harikumar, Vinh Khuc, Krishna Kumar Singh, Jingwan Lu, David Iseri Inouye*, Ajinkya Kale*


[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/05452.pdf) [Code]()

#### MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models
Nithin Gopalakrishnan Nair*, Jeya Maria Jose Valanarasu, Vishal Patel


[arXiv](https://arxiv.org/abs/2404.09977) [Code]()

#### DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control
Yuru Jia, Lukas Hoyer, Shengyu Huang, Tianfu Wang, Luc Van Gool, Konrad Schindler, Anton Obukhov*


[arXiv](https://arxiv.org/abs/2312.03048) [Code]()

#### Surf-D: Generating High-Quality Surfaces of Arbitrary Topologies Using Diffusion Models
Zhengming Yu*, Zhiyang Dou, Xiaoxiao Long, Cheng Lin, Zekun Li, Yuan Liu, Norman Müller, Taku Komura, Marc Habermann, Christian Theobalt, Xin Li, Wenping Wang*


[arXiv](https://arxiv.org/abs/2311.17050) [Code]()

#### Diffusion-Refined VQA Annotations for Semi-Supervised Gaze Following
Qiaomu Miao*, Alexandros Graikos, Jingwei Zhang, Sounak Mondal, Minh Hoai, Dimitris Samaras


[arXiv](https://arxiv.org/abs/2406.02774) [Code]()

#### Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models
Rohit Gandikota*, Joanna Materzynska, Tingrui Zhou, Antonio Torralba, David Bau


[arXiv](https://arxiv.org/abs/2311.12092) [Code]()

#### AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion
Yitong Jiang*, Zhaoyang Zhang, Tianfan Xue, Jinwei Gu*


[arXiv](https://arxiv.org/abs/2310.10123) [Code]()

#### Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers
Chi-Pin Huang*, Kai-Po Chang, Chung-Ting Tsai, Yung-Hsuan Lai, Fu-En Yang, Yu-Chiang Frank Wang


[arXiv](https://arxiv.org/abs/2311.17717) [Code]()

#### Temporal Residual Guided Diffusion Framework for Event-Driven Video Reconstruction
Lin Zhu*, Yunlong Zheng, Yijun Zhang, Xiao Wang, Lizhi Wang, Hua Huang


[arXiv](https://arxiv.org/abs/2407.10636) [Code]()

#### Free-ATM: Harnessing Free Attention Masks for Representation Learning on Diffusion-Generated Images
David Junhao Zhang*, Mutian Xu, Jay Zhangjie Wu, Chuhui Xue, Wenqing Zhang, Xiaoguang Han, Song Bai, Mike Zheng Shou*


[arXiv](https://arxiv.org/abs/2308.06739) [Code]()

#### AlignDiff: Aligning Diffusion Models for General Few-Shot Segmentation
Ri-Zhao Qiu*, Yu-Xiong Wang, Kris Hauser


[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/05794.pdf) [Code]()

#### Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors
Jae Joong Lee, Bosheng Li, Sara M Beery, Jonathan Huang, Songlin Fei, Raymond A. Yeh, Bedrich Benes*


[arXiv](https://www.arxiv.org/abs/2407.10330) [Code]()

#### DomainFusion: Generalizing To Unseen Domains with Latent Diffusion Models
Yuyang Huang, Yabo Chen, Yuchen Liu, xiaopeng zhang*, Wenrui Dai*, Hongkai Xiong, Qi Tian


[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/05806.pdf) [Code]()

#### Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models
Yasi Zhang*, Peiyu Yu, Ying Nian Wu


[arXiv](https://arxiv.org/abs/2404.07389) [Code]()

#### Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediciton Tasks
Manyuan Zhang*, Guanglu Song, Xiaoyu Shi, Yu Liu, Hongsheng Li


[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/05837.pdf) [Code]()

#### SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
Yuwei Guo, Ceyuan Yang*, Anyi Rao, Maneesh Agrawala, Dahua Lin*, Bo Dai*


[arXiv](https://arxiv.org/abs/2311.16933) [Code]()

#### Diffusion Reward: Learning Rewards via Conditional Video Diffusion
Tao Huang*, Guangqi Jiang, Yanjie Ze, Huazhe Xu*


[arXiv](https://arxiv.org/abs/2312.14134) [Code]()

#### SpeedUpNet: A Plug-and-Play Adapter Network for Accelerating Text-to-Image Diffusion Models
Weilong Chai*, Dandan Zheng, Jiajiong Cao, Zhiquan Chen, Changbao Wang, Chenguang Ma


[arXiv](https://arxiv.org/abs/2312.08887) [Code]()

#### DECap: Towards Generalized Explicit Caption Editing via Diffusion Mechanism
Zhen Wang, Xinyun Jiang, Jun Xiao, Tao Chen, Long Chen*


[arXiv](https://arxiv.org/abs/2311.14920) [Code]()

#### DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays
Xuhui Liu, Zhi Qiao, Runkun Liu, Hong Li, Xiantong Zhen*, Zhen Qian, Juan Zhang*, Baochang Zhang


[arXiv](https://arxiv.org/abs/2407.13545) [Code]()

#### MoVideo: Motion-Aware Video Generation with Diffusion Models
Jingyun Liang*, Yuchen Fan, Kai Zhang*, Radu Timofte, Luc Van Gool, Rakesh Ranjan


[arXiv](https://arxiv.org/abs/2311.11325) [Code]()

#### Learn to Optimize Denoising Scores: A Unified and Improved Diffusion Prior for 3D Generation
Xiaofeng Yang*, Yiwen Chen, Cheng Chen, Chi Zhang, Yi Xu, Xulei Yang, Fayao Liu, Guosheng Lin


[arXiv](https://arxiv.org/abs/2312.04820) [Code]()

#### Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution
Xi Yang*, Chenhang He, Jianqi Ma, Lei Zhang


[arXiv](https://arxiv.org/abs/2312.00853) [Code]()

#### DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency
Xiaojing Zhong, Xinyi Huang, Xiaofeng Yang, Guosheng Lin*, Qingyao Wu*


[arXiv](https://arxiv.org/abs/2408.07481) [Code]()

#### Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing
Zizheng Yang, Hu Yu, Bing Li, Jinghao Zhang, Jie Huang, Feng Zhao*


[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/06072.pdf) [Code]()

#### PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Guansong Lu*, Yuanfan Guo, Jianhua Han, Minzhe Niu, Yihan Zeng, Songcen Xu, Zeyi Huang, Zhao Zhong, Wei Zhang, Hang Xu


[arXiv](https://arxiv.org/abs/2312.16486) [Code]()

#### Closed-Loop Unsupervised Representation Disentanglement with $\\beta$-VAE Distillation and Diffusion Probabilistic Feedback
Xin Jin*, Bohan Li*, Baao Xie, Wenyao Zhang, Jinming Liu, Ziqiang Li, Tao Yang, Wenjun Zeng


[arXiv](https://arxiv.org/abs/2402.02346) [Code]()

#### Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model
Chen Rao, Guangyuan Li, Zehua Lan, Jiakai Sun, Junsheng Luan, Wei Xing*, Lei Zhao*, Huaizhong Lin*, Jianfeng Dong, Dalong Zhang


[arXiv](https://arxiv.org/abs/2408.13459) [Code]()

#### D4-VTON: Dynamic Semantics Disentangling for Differential Diffusion based Virtual Try-On
Zhaotong Yang, Zicheng Jiang, Xinzhe Li, Huiyu Zhou, Junyu Dong, Huaidong Zhang, Yong Du*


[arXiv](https://arxiv.org/abs/2407.15111) [Code]()

#### AdvDiff: Generating Unrestricted Adversarial Examples using Diffusion Models
Xuelong Dai*, Kaisheng Liang, Bin Xiao


[arXiv](https://arxiv.org/abs/2307.12499) [Code]()

#### DiffPMAE: Diffusion Masked Autoencoders for Point Cloud Reconstruction
Yanlong LI*, Chamara Madarasingha, Kanchana Thilakarathna


[arXiv](https://arxiv.org/abs/2312.03298) [Code]()

#### DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Jinbo Xing*, Menghan Xia, Yong Zhang, Haoxin Chen, Wangbo Yu, Hanyuan Liu, Gongye Liu, Xintao Wang, Ying Shan, Tien-Tsin Wong


[arXiv](https://arxiv.org/abs/2310.12190) [Code]()

#### Text-Anchored Score Composition: Tackling Condition Misalignment in Text-to-Image Diffusion Models
Luozhou Wang*, Guibao Shen, Wenhang Ge, Guangyong Chen, Yijun Li, Yingcong Chen*


[arXiv](https://arxiv.org/abs/2306.14408) [Code]()

#### LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models
Hai Jiang, Ao Luo, Xiaohong Liu, Songchen Han, Shuaicheng Liu*


[arXiv](https://arxiv.org/abs/2407.08939) [Code]()

#### DNI: Dilutional Noise Initialization for Diffusion Video Editing
Sunjae Yoon, Gwanhyeong Koo, Ji Woo Hong, Chang D. Yoo*


[arXiv](https://www.arxiv.org/abs/2409.13037) [Code]()

#### Diffusion-Guided Weakly Supervised Semantic Segmentation
Sung-Hoon Yoon, Hoyong Kwon, Jaeseok Jeong, Daehee Park, Kuk-Jin Yoon*


[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/06482.pdf) [Code]()

#### Improving Virtual Try-On with Garment-focused Diffusion Models
Siqi Wan, Yehao Li, Jingwen Chen, Yingwei Pan*, Ting Yao, Yang Cao, Tao Mei


[arXiv](https://arxiv.org/abs/2409.08258) [Code]()

#### Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control
Yue Han*, Junwei Zhu, Keke He, Xu Chen, Yanhao Ge, Wei Li, Xiangtai Li, Jiangning Zhang, Chengjie Wang, Yong Liu


[arXiv](https://arxiv.org/abs/2405.12970) [Code]()

#### Diffusion Models as Optimizers for Efficient Planning in Offline RL
Renming Huang, Yunqiang Pei, Guoqing Wang*, Yangming Zhang, Yang Yang, Peng Wang, Heng Tao Shen


[arXiv](https://arxiv.org/abs/2407.16142) [Code]()

#### HiDiffusion: Unlocking Higher-Resolution Creativity and Efficiency in Pretrained Diffusion Models
Shen Zhang, Zhaowei CHEN, Zhenyu Zhao, Yuhao Chen, Yao Tang, Jiajun Liang*


[arXiv](https://arxiv.org/abs/2311.17528) [Code]()

#### Dolfin: Diffusion Layout Transformers without Autoencoder
Yilin Wang, Zeyuan Chen, Liangjun Zhong, Zheng Ding, Zhuowen Tu*


[arXiv](https://arxiv.org/abs/2310.16305) [Code]()

#### StructLDM: Structured Latent Diffusion for 3D Human Generation
Tao Hu, Fangzhou Hong, Ziwei Liu*


[arXiv](https://arxiv.org/abs/2404.01241) [Code]()

#### Beyond the Contact: Discovering Comprehensive Affordance for 3D Objects from Pre-trained 2D Diffusion Models
Hyeonwoo Kim, Sookwan Han, Patrick Kwon, Hanbyul Joo*


[arXiv](https://arxiv.org/abs/2401.12978) [Code]()

#### DIFFender: Diffusion-Based Adversarial Defense against Patch Attacks
Caixin Kang*, Yinpeng Dong, Zhengyi Wang, Shouwei Ruan, Yubo Chen, Hang Su*, Xingxing Wei*


[arXiv](https://arxiv.org/abs/2306.09124) [Code]()

#### Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation
Kihong Kim, Haneol Lee, Jihye Park, Seyeon Kim, Kwang Hee Lee, Seungryong Kim*, Jaejun Yoo*


[arXiv](https://arxiv.org/abs/2402.13729) [Code]()

#### Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation
Yeongtak Oh, Jonghyun Lee, Jooyoung Choi, Dahuin Jung, Uiwon Hwang*, Sungroh Yoon*


[arXiv](https://arxiv.org/abs/2403.10911) [Code]()

#### Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-Resolution
Junxiong Lin*, Yan Wang, Zeng Tao, Boyang Wang, Qing Zhao, Haoran Wang, Xuan Tong, Xinji Mai, Yuxuan Lin, Wei Song, Jiawen Yu, Shaoqi Yan, Wenqiang Zhang


[arXiv](https://arxiv.org/abs/2403.05808) [Code]()

#### Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models
Chao Gong*, Kai Chen, Zhipeng Wei, Jingjing Chen*, Yu-Gang Jiang


[arXiv](https://arxiv.org/abs/2407.12383) [Code]()

#### Length-Aware Motion Synthesis via Latent Diffusion
Alessio Sampieri*, Alessio Palma, Indro Spinelli, Fabio Galasso


[arXiv](https://arxiv.org/abs/2407.11532) [Code]()

#### Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model
Danni Yang, Ruohan Dong, Jiayi Ji, Yiwei Ma, Haowei Wang, Xiaoshuai Sun*, Rongrong Ji


[arXiv](https://arxiv.org/abs/2407.05352) [Code]()

#### Improving image synthesis with diffusion-negative sampling
Alakh Desai*, Nuno Vasconcelos


[arXiv]() [Code]()

#### SignGen: End-to-End Sign Language Video Generation with Latent Diffusion
Fan Qi*, Yu Duan, Changsheng Xu, Huaiwen Zhang*


[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/06988.pdf) [Code]()

#### Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems
Sojin Lee, Dogyun Park, Inho Kong, Hyunwoo J. Kim*


[arXiv](https://arxiv.org/abs/2407.16125) [Code]()

#### TetraDiffusion: Tetrahedral Diffusion Models for 3D Shape Generation
Nikolai Kalischek*, Torben Peters, Jan Dirk Wegner, Konrad Schindler


[arXiv](https://arxiv.org/abs/2211.13220) [Code]()

#### Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts
Byeongjun Park, Hyojun Go, Jin-Young Kim, Sangmin Woo, Seokil Ham, Changick Kim*


[arXiv](https://arxiv.org/abs/2403.09176) [Code]()

#### DiffFAS: Face Anti-Spoofing via Generative Diffusion Models
Xinxu Ge, Xin Liu*, Zitong Yu*, Jingang Shi, Chun Qi, Jie Li, Heikki Kälviäinen


[arXiv](https://arxiv.org/abs/2409.08572) [Code]()

#### BK-SDM: A Lightweight, Fast, and Cheap Version of Stable Diffusion
Bo-Kyeong Kim*, Hyoung-Kyu Song, Thibault Castells, Shinkook Choi


[arXiv](https://arxiv.org/abs/2305.15798) [Code]()

#### CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection
Wuyang Li, Xinyu Liu, Jiayi Ma, Yixuan Yuan*


[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/07221.pdf) [Code]()

#### Gated Temporal Diffusion for Stochastic Long-term Dense Anticipation
Olga Zatsarynna*, Emad Bahrami*, Yazan Abu Farha, Gianpiero Francesca, Jürgen Gall*


[arXiv](https://arxiv.org/abs/2407.11954) [Code]()

#### MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Rui Zhao, Yuchao Gu, Jay Zhangjie Wu, David Junhao Zhang, Jia-Wei Liu, weijia wu, Jussi Keppo, Mike Zheng Shou*


[arXiv](https://arxiv.org/abs/2310.08465) [Code]()

#### Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models
Siao Tang, Xin Wang*, Hong Chen, Chaoyu Guan, Zewen Wu, Yansong Tang, Wenwu Zhu*


[arXiv](https://arxiv.org/abs/2311.06322) [Code]()

#### Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors
Ruicheng Wang*, Jianfeng Xiang, Jiaolong Yang, Xin Tong


[arXiv](https://arxiv.org/abs/2403.11503) [Code]()

#### Exact Diffusion Inversion via Bidirectional Integration Approximation
Guoqiang Zhang*, j.p. lewis, W. Bastiaan Kleijn


[arXiv](https://arxiv.org/abs/2307.10829) [Code]()

#### Object-Centric Diffusion for Efficient Video Editing
Kumara Kahatapitiya*, Adil Karjauv, Davide Abati*, Fatih Porikli, Yuki M Asano, Amirhossein Habibian


[arXiv](https://arxiv.org/abs/2401.05735) [Code]()

#### Diffusion for Natural Image Matting
Yihan Hu*, Yiheng Lin, Wei Wang, Yao Zhao, Yunchao Wei*, Humphrey Shi


[arXiv](https://arxiv.org/abs/2312.05915) [Code]()

#### Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning
Jianjie Luo, Jingwen Chen, Yehao Li, Yingwei Pan*, Jianlin Feng, Hongyang Chao, Ting Yao


[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/07445.pdf) [Code]()

#### Factorized Diffusion: Perceptual Illusions by Noise Decomposition
Daniel Geng*, Inbum Park, Andrew Owens


[arXiv](https://arxiv.org/abs/2404.11615) [Code]()

#### To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now
Yimeng Zhang*, jinghan jia, Xin Chen, Aochuan Chen, Yihua Zhang, Jiancheng Liu, Ke Ding, Sijia Liu


[arXiv](https://arxiv.org/abs/2310.11868) [Code]()

#### FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation
Xinzhi Mu*, Li Chen, Bohan CHEN, Shuyang Gu, Jianmin Bao, Dong Chen, Ji Li, Yuhui Yuan


[arXiv](https://arxiv.org/abs/2406.08392) [Code]()

#### One-Shot Diffusion Mimicker for Handwritten Text Generation
Gang Dai, Yifan Zhang, Quhui Ke, Qiangya Guo, Shuangping Huang*


[arXiv](https://www.arxiv.org/abs/2409.04004) [Code]()

#### Kernel Diffusion: An Alternate Approach to Blind Deconvolution
Yash Sanghvi*, Yiheng Chi, Stanley Chan


[arXiv](https://arxiv.org/abs/2312.02319) [Code]()

#### ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
Shaozhe Hao*, Kai Han*, Zhengyao Lv, Shihao Zhao, Kwan-Yee K. Wong*


[arXiv](https://arxiv.org/abs/2407.07077) [Code]()

#### TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models
Jeongho Kim*, Min-Jung Kim*, Junsoo Lee, Jaegul Choo*


[arXiv](http://arxiv.org/abs/2407.09012) [Code]()

#### DiffBIR: Toward Blind Image Restoration with Generative Diffusion Prior
Xinqi Lin*, Jingwen He, Ziyan Chen, Zhaoyang Lyu, Bo Dai, Fanghua Yu, Yu Qiao, Wanli Ouyang, Chao Dong*


[arXiv](https://arxiv.org/abs/2308.15070) [Code]()

#### Do text-free diffusion models learn discriminative visual representations?
Soumik Mukhopadhyay*, Matthew A Gwilliam*, Yosuke Yamaguchi, Vatsal Agarwal, Namitha Padmanabhan, Archana Swaminathan, Tianyi Zhou, Jun Ohya, Abhinav Shrivastava


[arXiv](https://arxiv.org/abs/2311.17921) [Code]()

#### LogoSticker: Inserting Logos into Diffusion Models for Customized Generation
Mingkang Zhu, Xi CHEN, Zhongdao Wang, Hengshuang Zhao*, Jiaya Jia*


[arXiv](https://arxiv.org/abs/2407.13752) [Code]()

#### ProCreate, Don't Reproduce! Propulsive Energy Diffusion for Creative Generation
Jack Lu*, Ryan Teehan*, Mengye Ren*


[arXiv](https://arxiv.org/abs/2408.02226) [Code]()

#### IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination
Xi Chen*, Sida Peng, Dongchen Yang, Yuan Liu, Bowen Pan, Chengfei Lyu, Xiaowei Zhou*


[arXiv](https://arxiv.org/abs/2404.11593) [Code]()

#### Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection
Alireza Ganjdanesh*, Yan Kang, Yuchen Liu, Richard Zhang, Zhe Lin, Heng Huang


[arXiv](https://arxiv.org/abs/2409.15557) [Code]()

#### Compensation Sampling for Improved Convergence in Diffusion Models
Hui Lu*, Albert Ali Salah, Ronald Poppe


[arXiv](https://arxiv.org/abs/2312.06285) [Code]()

#### Lossy Image Compression with Foundation Diffusion Models
Lucas Relic*, Roberto Azevedo, Markus Gross, Christopher Schroers*


[arXiv](https://arxiv.org/abs/2404.08580) [Code]()

#### FMBoost: Boosting Latent Diffusion with Flow Matching
Johannes S Fischer*, Ming Gui, Pingchuan Ma, Nick Stracke, Stefan Andreas Baumann, Vincent Tao Hu, Björn Ommer


[arXiv](https://arxiv.org/abs/2312.07360) [Code]()

#### Diffusion Models as Data Mining Tools
Ioannis Siglidis*, Aleksander Holynski, Alexei A. Efros, Mathieu Aubry, Shiry Ginosar


[arXiv](https://arxiv.org/abs/2408.02752) [Code]()

#### Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering
Ruofan Liang, Zan Gojcic, Merlin Nimier-David, David Acuna, Nandita Vijaykumar, Sanja Fidler, Zian Wang*


[arXiv](https://arxiv.org/abs/2408.09702) [Code]()

#### MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices
Yang Zhao*, Zhisheng Xiao*, Yanwu Xu, Haolin Jia, Tingbo Hou


[arXiv](https://arxiv.org/abs/2311.16567) [Code]()

#### Osmosis: RGBD Diffusion Prior for Underwater Image Restoration
Opher Bar Nathan*, Deborah Levy, Tali Treibitz, Dan Rosenbaum


[arXiv](https://arxiv.org/abs/2403.14837) [Code]()

#### Large-scale Reinforcement Learning for Diffusion Models
Yinan Zhang*, Eric Tzeng, Yilun Du, Dmitry Kislyuk*


[arXiv](https://arxiv.org/abs/2401.12244) [Code]()

#### CoMusion: Towards Consistent Stochastic Human Motion Prediction via Motion Diffusion
Jiarui Sun*, Girish Chowdhary*


[arXiv](https://arxiv.org/abs/2305.12554) [Code]()

#### EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models
Ruoxi Chen, Haibo Jin, Yixin Liu, Jinyin Chen*, Haohan Wang, Lichao Sun


[arXiv](https://arxiv.org/abs/2311.12066) [Code]()

#### Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities
Lorenzo Baraldi*, Federico Cocchi, Marcella Cornia, Lorenzo Baraldi, Alessandro Nicolosi, Rita Cucchiara


[arXiv](https://arxiv.org/abs/2407.20337) [Code]()

#### Diffusion Soup: Model Merging for Text-to-Image Diffusion Models
Benjamin J Biggs*, Arjun Seshadri, Yang Zou, Achin Jain, Aditya Golatkar, Yusheng Xie, Alessandro Achille, Ashwin Swaminathan, Stefano Soatto


[arXiv](https://arxiv.org/abs/2406.08431) [Code]()

#### DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks
Sarah Jabbour*, Gregory Kondas, Ella Kazerooni, Michael Sjoding, David Fouhey, Jenna Wiens


[arXiv](https://arxiv.org/abs/2407.14509) [Code]()

#### BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion
Gwanghyun Kim, Hayeon Kim, Hoigi Seo, Dong Un Kang, Se Young Chun*


[arXiv](https://arxiv.org/abs/2404.04544) [Code]()

#### Viewpoint textual inversion: discovering scene representations and 3D view control in 2D diffusion models
James Burgess*, Kuan-Chieh Wang, Serena Yeung-Levy


[arXiv](https://arxiv.org/abs/2309.07986) [Code]()

#### Loc3Diff: Local Diffusion for 3D Human Head Synthesis and Editing
Yushi Lan*, Feitong Tan, Qiangeng Xu, Di Qiu, Kyle Genova, Zeng Huang, Rohit Pandey, Sean Fanello, Thomas Funkhouser, Chen Change Loy, Yinda Zhang*


[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/08166.pdf) [Code]()

#### Diff-Reg: Diffusion Model in Doubly Stochastic Matrix Space for Registration Problem
Qianliang Wu*, Haobo Jiang*, Lei Luo, Jun Li, Yaqing Ding*, Jin Xie*, Jian Yang*


[arXiv](https://arxiv.org/pdf/2403.19919) [Code]()

#### Investigating Style Similarity in Diffusion Models
Gowthami Somepalli*, Anubhav Gupta, Kamal Gupta, Shramay Palta, Micah Goldblum, Jonas A. Geiping, Abhinav Shrivastava, Tom Goldstein


[arXiv](https://arxiv.org/abs/2404.01292) [Code]()

#### Timestep-Aware Correction for Quantized Diffusion Models
Yuzhe Yao, Feng Tian, Jun Chen*, Haonan Lin, Guang Dai, Yong Liu, Jingdong Wang


[arXiv](https://arxiv.org/abs/2407.03917) [Code]()

#### VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving
YIBO LIU*, Zheyuan Yang, Guile Wu, Yuan Ren, Kejian Lin, Liu Bingbing, Yang Liu, JINJUN SHAN


[arXiv](https://arxiv.org/abs/2407.06516) [Code]()

#### Unmasking Bias in Diffusion Model Training
Hu Yu, Li Shen, Jie Huang, Hongsheng Li, Feng Zhao*


[arXiv](https://arxiv.org/abs/2310.08442) [Code]()

#### Layered Rendering Diffusion Model for Controllable Zero-Shot Image Synthesis
Zipeng Qi, Guoxi Huang*, Chenyang Liu, Fei Ye


[arXiv](https://arxiv.org/abs/2311.18435) [Code]()

#### A Simple Background Augmentation Method for Object Detection with Diffusion Model
Yuhang Li, Xin Dong, Chen Chen, Weiming Zhuang, Lingjuan Lyu*


[arXiv](https://arxiv.org/abs/2408.00350) [Code]()

#### Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion
Sanghyun Kim*, Seohyeon Jung, Balhae Kim, Moonseok Choi, Jinwoo Shin, Juho Lee*


[arXiv](https://arxiv.org/abs/2407.21032) [Code]()

#### An Explainable Vision Question Answer Model via Diffusion Chain-of-Thought
Chunhao LU, Qiang Lu*, Jake Luo


[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/08395.pdf) [Code]()

#### FrePolad: Frequency-Rectified Point Latent Diffusion for Point Cloud Generation
Chenliang Zhou*, Fangcheng Zhong, Param Hanji, Zhilin Guo, Kyle Thomas Fogarty, Alejandro Sztrajman, Hongyun Gao, A. Cengiz Oztireli


[arXiv](https://arxiv.org/abs/2311.12090) [Code]()

#### GAMMA-FACE: GAussian Mixture Models Amend Diffusion Models for Bias Mitigation in Face Images
Basudha Pal*, Arunkumar Kannan*, Ram Prabhakar Kathirvel, Alice O'Toole, Rama Chellappa


[arXiv](https://bas-2k.github.io/gamma-face/) [Code]()

#### PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation
jian ma, Chen Chen*, Qingsong Xie, Haonan Lu*


[arXiv](https://arxiv.org/abs/2311.17086) [Code]()

#### Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation
Duy Tho Le*, Hengcan Shi*, Jianfei Cai, Hamid Rezatofighi


[arXiv](https://arxiv.org/html/2404.04629v1) [Code]()

#### Self-Guided Generation of Minority Samples Using Diffusion Models
Soobin Um, Jong Chul Ye*


[arXiv](https://arxiv.org/abs/2407.11555) [Code]()

#### Pyramid Diffusion for Fine 3D Large Scene Generation
Yuheng Liu*, Xinke Li, Xueting Li, Lu Qi*, Chongshou Li, Ming-Hsuan Yang


[arXiv](https://arxiv.org/abs/2311.12085) [Code]()

#### ShoeModel: Learning to Wear on the User-specified Shoes via Diffusion Model
Wenyu Li*, Binghui Chen, Yifeng Geng, Xuansong Xie, Wangmeng Zuo


[arXiv](https://arxiv.org/abs/2404.04833) [Code]()

#### A Watermark-Conditioned Diffusion Model for IP Protection
Rui Min*, Sen Li*, Hongyang Chen*, Minhao Cheng*


[arXiv](https://arxiv.org/abs/2403.10893) [Code]()

#### Lost in Translation: Latent Concept Misalignment in Text-to-Image Diffusion Models
Juntu Zhao, Junyu Deng, Yixin Ye, Chongxuan Li, Zhijie Deng*, Dequan Wang*


[arXiv](https://arxiv.org/abs/2408.00230) [Code]()

#### Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression
Animesh Sinha*, Bo Sun, Anmol Kalia, Arantxa Casanova, Elliot Blanchard, David Yan, Winnie Zhang, Tony Nelli, Jiahui Chen, Hardik Shah, Licheng Yu, Mitesh Kumar Singh, Ankit Ramchandani, Maziar Sanjabi, Sonal Gupta, Amy L Bearman, Dhruv Mahajan


[arXiv](https://arxiv.org/abs/2311.10794) [Code]()

#### GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection
Hang Yao, Ming Liu*, Zhicun Yin, Zifei Yan, Xiaopeng Hong, Wangmeng Zuo


[arXiv](https://arxiv.org/abs/2406.07487) [Code]()

#### CipherDM: Secure Three-Party Inference for Diffusion Model Sampling
Xin Zhao, Xiaojun Chen*, Xudong Chen, He Li, Tingyu Fan, Zhendong Zhao


[arXiv](https://www.arxiv.org/abs/2409.05414) [Code]()

#### Time-Efficient and Identity-Consistent Virtual Try-On Using A Variant of Altered Diffusion Models
Phuong Hoang Dam*, Jihoon Jeong*, Anh T Tran*, Daeyoung Kim*


[arXiv](https://arxiv.org/abs/2403.07371) [Code]()

#### Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance
Donghoon Ahn, Hyoungwon Cho, Jaewon Min, Jungwoo Kim, Wooseok Jang, SeonHwa Kim, Hyun Hee Park, Kyong Hwan Jin*, Seungryong Kim*


[arXiv](https://arxiv.org/abs/2403.17377) [Code]()

#### FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models
Junhyuk So, Jungwon Lee, Eunhyeok Park*


[arXiv](https://arxiv.org/abs/2312.03517) [Code]()

#### Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond
Silvio Galesso*, Philipp Schröppel*, Hssan Driss, Thomas Brox


[arXiv](https://arxiv.org/abs/2407.15739) [Code]()

#### MONTAGE: Monitoring Training for Attribution of Generative Diffusion Models
Jonathan Brokman*, Omer Hofman, Roman Vainshtein, Amit Giloni, Toshiya Shimizu, Inderjeet Singh, Oren Rachmil, Alon Zolfi, Asaf Shabtai, Yuki Unno, Hisashi Kojima


[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/09513.pdf) [Code]()

#### Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation
Mathias Öttl*, Frauke Wilm, Jana Steenpass, Jingna Qiu, Matthias Rübner, Prof Arndt Hartmann, Matthias W. Beckmann, Peter Fasching, Andreas K Maier, Ramona Erber, Bernhard Kainz, Katharina Breininger


[arXiv](https://arxiv.org/abs/2403.14429) [Code]()

#### Deep Diffusion Image Prior for Efficient OOD Adaptation in 3D Inverse Problems
Hyungjin Chung, Jong Chul Ye*


[arXiv](https://arxiv.org/abs/2407.10641) [Code]()

#### LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model
Runhui Huang, Kaixin Cai, Jianhua Han, Xiaodan Liang*, Renjing Pei, Guansong Lu, Songcen Xu, Wei Zhang, Hang Xu


[arXiv](https://arxiv.org/abs/2403.11929) [Code]()

#### UpFusion: Novel View Diffusion from Unposed Sparse View Observations
Bharath Raj Nagoor Kani*, Hsin-Ying Lee, Sergey Tulyakov, Shubham Tulsiani


[arXiv](https://arxiv.org/abs/2312.06661) [Code]()

#### Video Editing via Factorized Diffusion Distillation
Uriel Singer*, Amit Zohar*, Yuval Kirstain, Shelly Sheynin, Adam Polyak, Devi Parikh, Yaniv Taigman


[arXiv](https://arxiv.org/abs/2403.09334) [Code]()

#### CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion
Wendi Zheng*, Jiayan Teng, Zhuoyi Yang, Weihan Wang, Jidong Chen, Xiaotao Gu, Yuxiao Dong*, Ming Ding*, Jie Tang*


[arXiv](https://arxiv.org/abs/2403.05121) [Code]()

#### SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers
Nanye Ma*, Mark Goldstein, Michael Albergo, Nicholas M Boffi, Eric Vanden-Eijnden*, Saining Xie*


[arXiv](https://arxiv.org/abs/2401.08740) [Code]()

#### Curved Diffusion: A Generative Model With Optical Geometry Control
Andrey Voynov*, Amir Hertz, Moab Arar, Shlomi Fruchter, Daniel Cohen-Or


[arXiv](https://arxiv.org/abs/2311.17609) [Code]()

#### AnimateMe: 4D Facial Expressions via Diffusion Models
Dimitrios Gerogiannis*, Foivos Paraperas Papantoniou, Rolandos Alexandros Potamias, Alexandros Lattas, Stylianos Moschoglou, Stylianos Ploumpis, Stefanos Zafeiriou


[arXiv](https://arxiv.org/abs/2403.17213) [Code]()

#### Unveiling and Mitigating Memorization in Text-to-image Diffusion Models through Cross Attention
Jie Ren*, Yaxin Li, Shenglai Zeng, Han Xu, Lingjuan Lyu, Yue Xing, Jiliang Tang


[arXiv](https://arxiv.org/abs/2403.11052) [Code]()

#### Context Diffusion: In-Context Aware Image Generation
Ivona Najdenkoska*, Animesh Sinha, Abhimanyu Dubey, Dhruv Mahajan, Vignesh Ramanathan, Filip Radenovic


[arXiv](https://arxiv.org/abs/2312.03584) [Code]()

#### Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling
Noam Elata*, Tomer Michaeli, Michael Elad


[arXiv](https://arxiv.org/abs/2407.08256) [Code]()

#### Data Augmentation via Latent Diffusion for Saliency Prediction
Bahar Aydemir*, Deblina Bhattacharjee, Tong Zhang, Mathieu Salzmann, Sabine Süsstrunk


[arXiv](https://arxiv.org/abs/2409.07307) [Code]()

#### A Diffusion Model for Simulation Ready Coronary Anatomy with Morpho-skeletal Control
Karim Kadry*, Shreya Gupta, Jonas Sogbadji, Michiel Schaap, Kersten Petersen, Takuya Mizukami, Carlos Collet, Farhad R. Nezami, Elazer R Edelman


[arXiv](https://arxiv.org/abs/2407.15631) [Code]()

#### DrivingDiffusion: Layout-Guided Multi-View Driving Scenarios Video Generation with Latent Diffusion Model
Li Xiaofan*, Zhang Yifu*, Ye Xiaoqing*


[arXiv](https://arxiv.org/abs/2310.07771) [Code]()

#### GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction
Yuxuan Mu*, Xinxin Zuo, Chuan Guo, Yilin Wang, Juwei Lu, Xiaofei Wu, Songcen Xu, Peng Dai, Youliang Yan, Li Cheng


[arXiv](https://arxiv.org/abs/2407.04237) [Code]()

#### AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive Computation
Shengkun Tang*, Yaqing Wang, Caiwen Ding, Yi Liang, Yao Li, Dongkuan Xu


[arXiv](https://arxiv.org/abs/2309.17074) [Code]()

#### Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas
Fabio Quattrini*, Vittorio Pippi, Silvia Cascianelli*, Rita Cucchiara


[arXiv](https://arxiv.org/abs/2408.15660) [Code]()

#### Photorealistic Video Generation with Diffusion Models
Agrim Gupta*, Lijun Yu, Kihyuk Sohn, Xiuye Gu, Meera Hahn, Li Fei-Fei, Irfan Essa, Lu Jiang, Jose Lezama


[arXiv](https://arxiv.org/abs/2312.06662) [Code]()

#### WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation
Jiachen Lu, Ze Huang, Zeyu Yang, Zhang Jiahui, Li Zhang*


[arXiv](https://arxiv.org/abs/2312.02934) [Code]()

#### Soft Shadow Diffusion (SSD): Physics-inspired Learning for 3D Computational Periscopy
Fadlullah A Raji*, John Murray-Bruce*


[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/10427.pdf) [Code]()

#### Tackling Structural Hallucination in Image Translation with Local Diffusion
Seunghoi Kim*, Chen Jin, Tom Diethe, Matteo Figini, Henry FJ Tregidgo, Asher Mullokandov, Philip A Teare, Daniel Alexander


[arXiv](https://arxiv.org/abs/2404.05980) [Code]()

#### Adversarial Robustification via Text-to-Image Diffusion Models
Daewon Choi, Jongheon Jeong, Huiwon Jang, Jinwoo Shin*


[arXiv](https://arxiv.org/abs/2407.18658) [Code]()

#### Learning Quantized Adaptive Conditions for Diffusion Models
Yuchen Liang*, Yuchuan Tian, Lei Yu, Huaao Tang, Jie Hu, Xiangzhong Fang, Hanting Chen*


[arXiv](https://arxiv.org/abs/2409.17487) [Code]()

#### SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher
Trung Tuan Dao*, Thuan Hoang Nguyen, Thanh Van Le, Duc H Vu, Khoi Nguyen, Cuong Pham, Anh T Tran*


[arXiv](https://arxiv.org/abs/2408.14176) [Code]()

#### DiffSurf: A Transformer-based Diffusion Model for Generating and Reconstructing 3D Surfaces in Pose
Yusuke Yoshiyasu*, Leyuan Sun


[arXiv](https://www.arxiv.org/abs/2408.14860) [Code]()

#### SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow
Yuanzhi Zhu*, Xingchao Liu, Qiang Liu*


[arXiv](https://arxiv.org/abs/2407.12718) [Code]()

#### DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation
Jeongsol Kim, Geon Yeong Park, Jong Chul Ye*


[arXiv](https://arxiv.org/abs/2403.11415) [Code]()

#### PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control
Rishubh Parihar*, Sachidanand VS, Sabariswaran Mani, Tejan Karmali, Venkatesh Babu RADHAKRISHNAN


[arXiv](https://www.arxiv.org/abs/2408.05083) [Code]()

#### Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Xiaoshi Wu, Yiming Hao, Manyuan Zhang*, Keqiang Sun, Zhaoyang Huang, Guanglu Song, Yu Liu, Hongsheng Li*


[arXiv](https://arxiv.org/abs/2405.00760) [Code]()

#### Inf-DiT: Upsampling any-resolution image with memory-efficient diffusion transformer
Zhuoyi Yang*, Heyang Jiang, Wenyi Hong, Jiayan Teng, Wendi Zheng, Yuxiao Dong, Ming Ding, Jie Tang


[arXiv](https://arxiv.org/abs/2405.04312) [Code]()

#### EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Linrui Tian*, Qi Wang*, Bang Zhang*, Liefeng Bo*


[arXiv](https://arxiv.org/abs/2402.17485) [Code]()

#### Zero-Shot Adaptation for Approximate Posterior Sampling of Diffusion Models in Inverse Problems
Yasar U Alcalar*, Mehmet Akcakaya


[arXiv](https://arxiv.org/abs/2407.11288) [Code]()

#### R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model
Changhoon Kim*, Kyle Min*, Yezhou Yang


[arXiv](https://arxiv.org/abs/2405.16341) [Code]()

#### Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion
Yu Cao*, Shaogang Gong


[arXiv](https://arxiv.org/abs/2407.07249) [Code]()

#### A high-quality robust diffusion framework for corrupted dataset
Quan Dao*, Binh Ta, Tung Pham, Anh Tran


[arXiv](https://arxiv.org/abs/2311.17101) [Code]()

#### Identity-Consistent Diffusion Network for Grading Knee Osteoarthritis Progression in Radiographic Imaging
Wenhua Wu, Kun Hu*, Wenxi Yue, Wei Li, Milena Simic, Changyang Li, Wei Xiang, Zhiyong Wang


[arXiv](https://arxiv.org/abs/2407.21381) [Code]()

#### Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation
Shentong Mo, Enze Xie*, Yue Wu, Junsong Chen, Matthias Niessner, Zhenguo Li


[arXiv](https://arxiv.org/abs/2312.07231) [Code]()

#### Pix2Gif: Motion-Guided Diffusion for GIF Generation
Hitesh Kandala*, Jianfeng Gao, Jianwei Yang


[arXiv](https://arxiv.org/abs/2403.04634) [Code]()

#### T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models
Zhongqi Wang, Jie Zhang*, Shiguang Shan, Xilin Chen


[arXiv](https://arxiv.org/abs/2407.04215) [Code]()

#### DiffusionPen: Towards Controlling the Style of Handwritten Text Generation
Konstantina Nikolaidou*, George Retsinas, Giorgos Sfikas, Marcus Liwicki


[arXiv](https://arxiv.org/abs/2409.06065) [Code]()

#### Learning Pseudo 3D Guidance for View-consistent Texturing with 2D Diffusion
Kehan Li, Yanbo Fan*, Yang Wu, Zhongqian Sun, Wei Yang, Xiangyang Ji, Li Yuan, Jie Chen*


[arXiv](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/11528.pdf) [Code]()

#### Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models
Yang Zhang*, Tze Tzun Teoh, Wei Hern Lim, Kenji Kawaguchi


[arXiv](https://arxiv.org/abs/2403.06381) [Code]()

#### Adversarial Diffusion Distillation
Axel Sauer*, Dominik Lorenz, Andreas Blattmann, Robin Rombach


[arXiv](https://arxiv.org/abs/2311.17042) [Code]()

#### Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Yisol Choi*, Sangkyung Kwak, Kyungmin Lee, Hyungwon Choi, Jinwoo Shin*


[arXiv](https://arxiv.org/abs/2403.05139) [Code]()

#### Fast Diffusion-Based Counterfactuals for Shortcut Removal and Generation
Nina Weng*, Paraskevas Pegios, Eike Petersen, Aasa Feragen, Siavash Arjomand Bigdeli


[arXiv](https://arxiv.org/abs/2312.14223) [Code]()

#### Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion Models
Xiao Liu, Xiaoliu Guan, Yu Wu*, Jiaxu Miao*


[arXiv](https://arxiv.org/abs/2407.15328) [Code]()

#### DiffClass: Diffusion-Based Class Incremental Learning
Zichong Meng, Jie Zhang, Changdi Yang, Zheng Zhan, Pu Zhao*, Yanzhi Wang*


[arXiv](https://arxiv.org/abs/2403.05016) [Code]()

#### Instant 3D Human Avatar Generation using Image Diffusion Models
Nikos Kolotouros*, Thiemo Alldieck, Enric Corona, Eduard Gabriel Bazavan, Cristian Sminchisescu


[arXiv](https://arxiv.org/abs/2406.07516) [Code]()

#### Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models
Vitali Petsiuk*, Kate Saenko


[arXiv](https://arxiv.org/abs/2404.13706) [Code]()

#### ControlNet-XS: Rethinking the Control of Text-to-Image Diffusion Models as Feedback-Control Systems
Denis Zavadski*, Johann-Friedrich Feiden, Carsten Rother


[arXiv](https://arxiv.org/abs/2312.06573) [Code]()

#### Hierarchical Conditioning of Diffusion Models Using Tree-of-Life for Studying Species Evolution
Mridul Khurana*, Arka Daw, M. Maruf, Josef C. Uyeda, Wasila Dahdul, Caleb Charpentier, Yasin Bakış, Henry L. Bart Jr., Paula M. Mabee, Hilmar Lapp, James P. Balhoff, Wei-Lun Chao, Charles Stewart, Tanya Berger-Wolf, Anuj Karpatne*

![Poster Badge](https://img.shields.io/badge/Poster-purple) [arXiv](https://arxiv.org/abs/2408.00160) [Code]()