Diffusion Models for Cross-Domain Image-to-Image Translation with Paired and Partially Paired Datasets

被引：0

作者：

Bell, Trisk ^{[1
]}

Li, Dan ^{[1
]}

机构：

[1] Eastern Washington Univ, Dept CSEE, Spokane, WA 99202 USA

来源：

2024 IEEE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, DSAA 2024 | 2024年

关键词：

image generation; GAN; diffusion model; conditional v-diffusion model;

D O I：

10.1109/DSAA61799.2024.10722775

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The line-art colorization problem is a task in generative modeling with the goal of generating colored artworks from an artist's hand-drawn line-arts. Machine learning models such as generative adversarial networks have been applied to this task. At the time of this research, the application of diffusion models to this task has not been well studied, despite the impressive results in image generation that diffusion models have demonstrated. We propose to apply conditional diffusion models to the line-art colorization problem and expand the capability of such models by proposing conditional cross-domain diffusion models, capable of a two-way transformation between image domains. The main findings are 1) the conditional diffusion models are effective at the task of line-art colorization and they provide state-of-the-art results compared to previous methods, and 2) the proposed conditional cross-domain diffusion models are capable of two-way cross domain image-to-image translation with high quality results and they can be trained on both paired and partially paired images(1).

引用

页码：38 / 45

页数：8

共 50 条

[1] Image-to-image translation for cross-domain disentanglement
Gonzalez-Garcia, Abel
van de Weijer, Joost
Bengio, Yoshua
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[2] Cross-Domain Interpolation for Unpaired Image-to-Image Translation
Lopez, Jorge
Mauricio, Antoni
Diaz, Jose
Camara, Guillermo
COMPUTER VISION SYSTEMS (ICVS 2019), 2019, 11754 : 542 - 551
[3] Cross-Domain Infrared Image Classification via Image-to-Image Translation and Deep Domain Generalization
Guo, Zhao-Rui
Niu, Jia-Wei
Liu, Zhun-Ga
2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 487 - 493
[4] Image-To-Image Translation Using a Cross-Domain Auto-Encoder and Decoder
Yoo, Jaechang
Eom, Heesong
Choi, Yong Suk
APPLIED SCIENCES-BASEL, 2019, 9 (22):
[5] Learning Unsupervised Cross-domain Image-to-Image Translation using a Shared Discriminator
Kumar, Rajiv
Dabral, Rishabh
Sivakumar, G.
VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 256 - 264
[6] Learning Image-to-Image Translation Using Paired and Unpaired Training Samples
Tripathy, Soumya
Kannala, Juho
Rahtu, Esa
COMPUTER VISION - ACCV 2018, PT II, 2019, 11362 : 51 - 66
[7] CACOLIT: Cross-domain Adaptive Co-learning for Imbalanced Image-to-Image Translation
Wang, Yijun
Liang, Tao
Lin, Jianxin
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1068 - 1076
[8] Domain-knowledge Inspired Pseudo Supervision (DIPS) for unsupervised image-to-image translation models to support cross-domain classification
Al-Hindawi, Firas
Siddiquee, Md Mahfuzur Rahman
Wu, Teresa
Hu, Han
Sun, Ying
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
[9] CROSS-DOMAIN SAR SHIP DETECTION IN STRONG INTERFERENCE ENVIRONMENT BASED ON IMAGE-TO-IMAGE TRANSLATION
Pu, Xinyang
Jia, Hecheng
Xu, Feng
IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 1798 - 1801
[10] Cross-Domain Car Detection Using Unsupervised Image-to-Image Translation: From Day to Night
Arruda, Vinicius F.
Paixao, Thiago M.
Berriel, Rodrigo F.
De Souza, Alberto F.
Badue, Claudine
Sebe, Nicu
Oliveira-Santos, Thiago
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,

← 1 2 3 4 5 →