Diffusion Models for Cross-Domain Image-to-Image Translation with Paired and Partially Paired Datasets

被引：0

作者：

Bell, Trisk ^{[1
]}

Li, Dan ^{[1
]}

机构：

[1] Eastern Washington Univ, Dept CSEE, Spokane, WA 99202 USA

来源：

2024 IEEE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, DSAA 2024 | 2024年

关键词：

image generation; GAN; diffusion model; conditional v-diffusion model;

D O I：

10.1109/DSAA61799.2024.10722775

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The line-art colorization problem is a task in generative modeling with the goal of generating colored artworks from an artist's hand-drawn line-arts. Machine learning models such as generative adversarial networks have been applied to this task. At the time of this research, the application of diffusion models to this task has not been well studied, despite the impressive results in image generation that diffusion models have demonstrated. We propose to apply conditional diffusion models to the line-art colorization problem and expand the capability of such models by proposing conditional cross-domain diffusion models, capable of a two-way transformation between image domains. The main findings are 1) the conditional diffusion models are effective at the task of line-art colorization and they provide state-of-the-art results compared to previous methods, and 2) the proposed conditional cross-domain diffusion models are capable of two-way cross domain image-to-image translation with high quality results and they can be trained on both paired and partially paired images(1).

引用

页码：38 / 45

页数：8

共 50 条

[31] DMDIT: Diverse multi-domain image-to-image translation
Shao, Mingwen
Zhang, Youcai
Liu, Huan
Wang, Chao
Li, Le
Shao, Xun
KNOWLEDGE-BASED SYSTEMS, 2021, 229
[32] Comparison of Deep Learning Image-to-image Models for Medical Image Translation
Yang, Zeyu
Zoellner, Frank G.
BILDVERARBEITUNG FUR DIE MEDIZIN 2024, 2024, : 344 - 349
[33] Cross-domain object detection using unsupervised image translation
Arruda, Vinicius F.
Berriel, Rodrigo F.
Paixao, Thiago M.
Badue, Claudine
De Souza, Alberto F.
Sebe, Nicu
Oliveira-Santos, Thiago
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 192
[34] Paired Image to Image Translation for Strikethrough Removal from Handwritten Words
Heil, Raphaela
Vats, Ekta
Hast, Anders
DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 309 - 322
[35] Cross-domain object detection using minimized instance shift image–image translation
Gen Liu
Jin Han
The Visual Computer, 2023, 39 : 5013 - 5026
[36] Literature Review of Generative models for Image-to-Image translation problems
Kamil, Anwar
Shaikh, Talal
PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND KNOWLEDGE ECONOMY (ICCIKE' 2019), 2019, : 341 - 346
[37] TriGAN: image-to-image translation for multi-source domain adaptation
Roy, Subhankar
Siarohin, Aliaksandr
Sangineto, Enver
Sebe, Nicu
Ricci, Elisa
MACHINE VISION AND APPLICATIONS, 2021, 32 (01)
[38] TriGAN: image-to-image translation for multi-source domain adaptation
Subhankar Roy
Aliaksandr Siarohin
Enver Sangineto
Nicu Sebe
Elisa Ricci
Machine Vision and Applications, 2021, 32
[39] Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
Gomez, Raul
Liu, Yahui
De Nadai, Marco
Karatzas, Dimosthenis
Lepri, Bruno
Sebe, Nicu
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3164 - 3172
[40] IMPROVING OPEN SET DOMAIN ADAPTATION USING IMAGE-TO-IMAGE TRANSLATION
Zhang, Hongjie
Li, Ang
Han, Xu
Chen, Zhaoming
Zhang, Yang
Guo, Yanwen
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1258 - 1263

← 1 2 3 4 5 →