Diffusion Models for Cross-Domain Image-to-Image Translation with Paired and Partially Paired Datasets

被引:0
|
作者
Bell, Trisk [1 ]
Li, Dan [1 ]
机构
[1] Eastern Washington Univ, Dept CSEE, Spokane, WA 99202 USA
来源
2024 IEEE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, DSAA 2024 | 2024年
关键词
image generation; GAN; diffusion model; conditional v-diffusion model;
D O I
10.1109/DSAA61799.2024.10722775
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The line-art colorization problem is a task in generative modeling with the goal of generating colored artworks from an artist's hand-drawn line-arts. Machine learning models such as generative adversarial networks have been applied to this task. At the time of this research, the application of diffusion models to this task has not been well studied, despite the impressive results in image generation that diffusion models have demonstrated. We propose to apply conditional diffusion models to the line-art colorization problem and expand the capability of such models by proposing conditional cross-domain diffusion models, capable of a two-way transformation between image domains. The main findings are 1) the conditional diffusion models are effective at the task of line-art colorization and they provide state-of-the-art results compared to previous methods, and 2) the proposed conditional cross-domain diffusion models are capable of two-way cross domain image-to-image translation with high quality results and they can be trained on both paired and partially paired images(1).
引用
收藏
页码:38 / 45
页数:8
相关论文
共 50 条
  • [31] DMDIT: Diverse multi-domain image-to-image translation
    Shao, Mingwen
    Zhang, Youcai
    Liu, Huan
    Wang, Chao
    Li, Le
    Shao, Xun
    KNOWLEDGE-BASED SYSTEMS, 2021, 229
  • [32] Comparison of Deep Learning Image-to-image Models for Medical Image Translation
    Yang, Zeyu
    Zoellner, Frank G.
    BILDVERARBEITUNG FUR DIE MEDIZIN 2024, 2024, : 344 - 349
  • [33] Cross-domain object detection using unsupervised image translation
    Arruda, Vinicius F.
    Berriel, Rodrigo F.
    Paixao, Thiago M.
    Badue, Claudine
    De Souza, Alberto F.
    Sebe, Nicu
    Oliveira-Santos, Thiago
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 192
  • [34] Paired Image to Image Translation for Strikethrough Removal from Handwritten Words
    Heil, Raphaela
    Vats, Ekta
    Hast, Anders
    DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 309 - 322
  • [35] Cross-domain object detection using minimized instance shift image–image translation
    Gen Liu
    Jin Han
    The Visual Computer, 2023, 39 : 5013 - 5026
  • [36] Literature Review of Generative models for Image-to-Image translation problems
    Kamil, Anwar
    Shaikh, Talal
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND KNOWLEDGE ECONOMY (ICCIKE' 2019), 2019, : 341 - 346
  • [37] TriGAN: image-to-image translation for multi-source domain adaptation
    Roy, Subhankar
    Siarohin, Aliaksandr
    Sangineto, Enver
    Sebe, Nicu
    Ricci, Elisa
    MACHINE VISION AND APPLICATIONS, 2021, 32 (01)
  • [38] TriGAN: image-to-image translation for multi-source domain adaptation
    Subhankar Roy
    Aliaksandr Siarohin
    Enver Sangineto
    Nicu Sebe
    Elisa Ricci
    Machine Vision and Applications, 2021, 32
  • [39] Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
    Gomez, Raul
    Liu, Yahui
    De Nadai, Marco
    Karatzas, Dimosthenis
    Lepri, Bruno
    Sebe, Nicu
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3164 - 3172
  • [40] IMPROVING OPEN SET DOMAIN ADAPTATION USING IMAGE-TO-IMAGE TRANSLATION
    Zhang, Hongjie
    Li, Ang
    Han, Xu
    Chen, Zhaoming
    Zhang, Yang
    Guo, Yanwen
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1258 - 1263