A Diffusion Model Translator for Efficient Image-to-Image Translation

被引:3
|
作者
Xia, Mengfei [1 ]
Zhou, Yu [1 ]
Yi, Ran [2 ]
Liu, Yong-Jin [1 ]
Wang, Wenping [3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, MOE Key Lab Pervas Comp, Beijing 100084, Peoples R China
[2] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
[3] Texas A&M Univ, Dept Comp Sci & Comp Engn, College Stn, TX 77840 USA
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Task analysis; Noise reduction; Diffusion models; Diffusion processes; Training; Computer science; Trajectory; image translation; deep learning; generative models;
D O I
10.1109/TPAMI.2024.3435448
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Applying diffusion models to image-to-image translation (I2I) has recently received increasing attention due to its practical applications. Previous attempts inject information from the source image into each denoising step for an iterative refinement, thus resulting in a time-consuming implementation. We propose an efficient method that equips a diffusion model with a lightweight translator, dubbed a Diffusion Model Translator (DMT), to accomplish I2I. Specifically, we first offer theoretical justification that in employing the pioneering DDPM work for the I2I task, it is both feasible and sufficient to transfer the distribution from one domain to another only at some intermediate step. We further observe that the translation performance highly depends on the chosen timestep for domain transfer, and therefore propose a practical strategy to automatically select an appropriate timestep for a given task. We evaluate our approach on a range of I2I applications, including image stylization, image colorization, segmentation to image, and sketch to image, to validate its efficacy and general utility. The comparisons show that our DMT surpasses existing methods in both quality and efficiency. Code is available at https://github.com/THU-LYJ-Lab/dmt.
引用
收藏
页码:10272 / 10283
页数:12
相关论文
共 50 条
  • [1] DiffI2I: Efficient Diffusion Model for Image-to-Image Translation
    Xia, Bin
    Zhang, Yulun
    Wang, Shiyin
    Wang, Yitong
    Wu, Xinglong
    Tian, Yapeng
    Yang, Wenming
    Timotfe, Radu
    Van Gool, Luc
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (03) : 1578 - 1593
  • [2] Unpaired Image-to-Image Translation with Diffusion Adversarial Network
    Tu, Hangyao
    Wang, Zheng
    Zhao, Yanwei
    MATHEMATICS, 2024, 12 (20)
  • [3] BBDM: Image-to-Image Translation with Brownian Bridge Diffusion Models
    Li, Bo
    Xue, Kaitao
    Liu, Bin
    Lai, Yu-Kun
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1952 - 1961
  • [4] Reversible GANs for Memory-efficient Image-to-Image Translation
    van der Ouderaa, Tycho F. A.
    Worrall, Daniel E.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4715 - 4723
  • [5] Wavelet Knowledge Distillation: Towards Efficient Image-to-Image Translation
    Zhang, Linfeng
    Chen, Xin
    Tu, Xiaobing
    Wan, Pengfei
    Xu, Ning
    Ma, Kaisheng
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12454 - 12464
  • [6] Conditional Image-to-Image translation
    Lin, Jianxin
    Xia, Yingce
    Qin, Tao
    Chen, Zhibo
    Liu, Tie-Yan
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5524 - 5532
  • [7] UGC: Unified GAN Compression for Efficient Image-to-Image Translation
    Ren, Yuxi
    Wu, Jie
    Zhang, Peng
    Zhang, Manlin
    Xiao, Xuefeng
    He, Qian
    Wang, Rui
    Zheng, Min
    Pan, Xin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17235 - 17245
  • [8] Hypercomplex Image-to-Image Translation
    Grassucci, Eleonora
    Sigillo, Luigi
    Uncini, Aurelio
    Comminiello, Danilo
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [9] IMAGE DATA AUGMENTATION WITH UNPAIRED IMAGE-TO-IMAGE CAMERA MODEL TRANSLATION
    Foo, Chi Fa
    Winkler, Stefan
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3246 - 3250
  • [10] Dissecting and Mitigating Semantic Discrepancy in Stable Diffusion for Image-to-Image Translation
    Yuan, Yifan
    Yang, Guanqun
    Wang, James Z.
    Zhang, Hui
    Shan, Hongming
    Wang, Fei-Yue
    Zhang, Junping
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2025, 12 (04) : 705 - 718