Diffusion Models for Cross-Domain Image-to-Image Translation with Paired and Partially Paired Datasets

被引:0
|
作者
Bell, Trisk [1 ]
Li, Dan [1 ]
机构
[1] Eastern Washington Univ, Dept CSEE, Spokane, WA 99202 USA
关键词
image generation; GAN; diffusion model; conditional v-diffusion model;
D O I
10.1109/DSAA61799.2024.10722775
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The line-art colorization problem is a task in generative modeling with the goal of generating colored artworks from an artist's hand-drawn line-arts. Machine learning models such as generative adversarial networks have been applied to this task. At the time of this research, the application of diffusion models to this task has not been well studied, despite the impressive results in image generation that diffusion models have demonstrated. We propose to apply conditional diffusion models to the line-art colorization problem and expand the capability of such models by proposing conditional cross-domain diffusion models, capable of a two-way transformation between image domains. The main findings are 1) the conditional diffusion models are effective at the task of line-art colorization and they provide state-of-the-art results compared to previous methods, and 2) the proposed conditional cross-domain diffusion models are capable of two-way cross domain image-to-image translation with high quality results and they can be trained on both paired and partially paired images(1).
引用
收藏
页码:38 / 45
页数:8
相关论文
共 50 条
  • [1] Image-to-image translation for cross-domain disentanglement
    Gonzalez-Garcia, Abel
    van de Weijer, Joost
    Bengio, Yoshua
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [2] Cross-Domain Interpolation for Unpaired Image-to-Image Translation
    Lopez, Jorge
    Mauricio, Antoni
    Diaz, Jose
    Camara, Guillermo
    COMPUTER VISION SYSTEMS (ICVS 2019), 2019, 11754 : 542 - 551
  • [3] Cross-Domain Infrared Image Classification via Image-to-Image Translation and Deep Domain Generalization
    Guo, Zhao-Rui
    Niu, Jia-Wei
    Liu, Zhun-Ga
    2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 487 - 493
  • [4] Image-To-Image Translation Using a Cross-Domain Auto-Encoder and Decoder
    Yoo, Jaechang
    Eom, Heesong
    Choi, Yong Suk
    APPLIED SCIENCES-BASEL, 2019, 9 (22):
  • [5] Learning Unsupervised Cross-domain Image-to-Image Translation using a Shared Discriminator
    Kumar, Rajiv
    Dabral, Rishabh
    Sivakumar, G.
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 256 - 264
  • [6] Learning Image-to-Image Translation Using Paired and Unpaired Training Samples
    Tripathy, Soumya
    Kannala, Juho
    Rahtu, Esa
    COMPUTER VISION - ACCV 2018, PT II, 2019, 11362 : 51 - 66
  • [7] CACOLIT: Cross-domain Adaptive Co-learning for Imbalanced Image-to-Image Translation
    Wang, Yijun
    Liang, Tao
    Lin, Jianxin
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1068 - 1076
  • [8] Domain-knowledge Inspired Pseudo Supervision (DIPS) for unsupervised image-to-image translation models to support cross-domain classification
    Al-Hindawi, Firas
    Siddiquee, Md Mahfuzur Rahman
    Wu, Teresa
    Hu, Han
    Sun, Ying
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
  • [9] CROSS-DOMAIN SAR SHIP DETECTION IN STRONG INTERFERENCE ENVIRONMENT BASED ON IMAGE-TO-IMAGE TRANSLATION
    Pu, Xinyang
    Jia, Hecheng
    Xu, Feng
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 1798 - 1801
  • [10] Cross-Domain Car Detection Using Unsupervised Image-to-Image Translation: From Day to Night
    Arruda, Vinicius F.
    Paixao, Thiago M.
    Berriel, Rodrigo F.
    De Souza, Alberto F.
    Badue, Claudine
    Sebe, Nicu
    Oliveira-Santos, Thiago
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,