Multimodal image-to-image translation between domains with high internal variability

被引：6

作者：

Wang, Jian ^{[1
]}

Lv, Jiancheng ^{[1
]}

Yang, Xue ^{[1
]}

Tang, Chenwei ^{[1
]}

Peng, Xi ^{[1
]}

机构：

[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China

来源：

SOFT COMPUTING | 2020年 / 24卷 / 23期

基金：

国家重点研发计划; 美国国家科学基金会;

关键词：

GANs; Image translation; High internal variability; Catastrophic forgetting; Generator regulating;

D O I：

10.1007/s00500-020-05073-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multimodal image-to-image translation based on generative adversarial networks (GANs) shows suboptimal performance in the visual domains with high internal variability, e.g., translation from multiple breeds of cats to multiple breeds of dogs. To alleviate this problem, we recast the training procedure as modeling distinct distributions which are observed sequentially, for example, when different classes are encountered over time. As a result, the discriminator may forget about the previous target distributions, known as catastrophic forgetting, leading to non-/slow convergence. Through experimental observation, we found that the discriminator does not always forget the previously learned distributions during training. Therefore, we propose a novel generator regulating GAN (GR-GAN). The proposed method encourages the discriminator to teach the generator more effectively when it remembers more of the previously learned distributions, while discouraging the discriminator to guide the generator when catastrophic forgetting happens on the discriminator. Both qualitative and quantitative results show that the proposed method is significantly superior to the state-of-the-art methods in handling the image data that are with high variability.

引用

页码：18173 / 18184

页数：12

共 50 条

[21] Generative image completion with image-to-image translation
Xu, Shuzhen
Zhu, Qing
Wang, Jin
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (11): : 7333 - 7345
[22] High-Resolution Semantically Consistent Image-to-Image Translation
Sokolov, Mikhail
Henry, Christopher
Storie, Joni
Storie, Christopher
Alhassan, Victor
Turgeon-Pelchat, Mathieu
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 482 - 492
[23] Vector Quantized Image-to-Image Translation
Chen, Yu-Jie
Cheng, Shin-I
Chiu, Wei-Chen
Tseng, Hung-Yu
Lee, Hsin-Ying
COMPUTER VISION - ECCV 2022, PT XVI, 2022, 13676 : 440 - 456
[24] Deliberation Learning for Image-to-Image Translation
He, Tianyu
Xia, Yingce
Lin, Jianxin
Tan, Xu
He, Di
Qin, Tao
Chen, Zhibo
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2484 - 2490
[25] Image-to-Image Translation: Methods and Applications
Pang, Yingxue
Lin, Jianxin
Qin, Tao
Chen, Zhibo
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 3859 - 3881
[26] Domain Adaptive Image-to-image Translation
Chen, Ying-Cong
Xu, Xiaogang
Jia, Jiaya
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5273 - 5282
[27] Unsupervised Image-to-Image Translation: A Review
Hoyez, Henri
Schockaert, Cedric
Rambach, Jason
Mirbach, Bruno
Stricker, Didier
SENSORS, 2022, 22 (21)
[28] Unsupervised Image-to-Image Translation Networks
Liu, Ming-Yu
Breuel, Thomas
Kautz, Jan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[29] SuperstarGAN: Generative adversarial networks for image-to-image translation in large-scale domains
Ko, Kanghyeok
Yeom, Taesun
Lee, Minhyeok
NEURAL NETWORKS, 2023, 162 : 330 - 339
[30] A novel framework for image-to-image translation and image compression
Yang, Fei
Wang, Yaxing
Herranz, Luis
Cheng, Yongmei
Mozerov, Mikhail G.
NEUROCOMPUTING, 2022, 508 : 58 - 70

← 1 2 3 4 5 →