Multimodal image-to-image translation between domains with high internal variability

被引:6
|
作者
Wang, Jian [1 ]
Lv, Jiancheng [1 ]
Yang, Xue [1 ]
Tang, Chenwei [1 ]
Peng, Xi [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
基金
国家重点研发计划; 美国国家科学基金会;
关键词
GANs; Image translation; High internal variability; Catastrophic forgetting; Generator regulating;
D O I
10.1007/s00500-020-05073-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal image-to-image translation based on generative adversarial networks (GANs) shows suboptimal performance in the visual domains with high internal variability, e.g., translation from multiple breeds of cats to multiple breeds of dogs. To alleviate this problem, we recast the training procedure as modeling distinct distributions which are observed sequentially, for example, when different classes are encountered over time. As a result, the discriminator may forget about the previous target distributions, known as catastrophic forgetting, leading to non-/slow convergence. Through experimental observation, we found that the discriminator does not always forget the previously learned distributions during training. Therefore, we propose a novel generator regulating GAN (GR-GAN). The proposed method encourages the discriminator to teach the generator more effectively when it remembers more of the previously learned distributions, while discouraging the discriminator to guide the generator when catastrophic forgetting happens on the discriminator. Both qualitative and quantitative results show that the proposed method is significantly superior to the state-of-the-art methods in handling the image data that are with high variability.
引用
收藏
页码:18173 / 18184
页数:12
相关论文
共 50 条
  • [1] Multimodal image-to-image translation between domains with high internal variability
    Jian Wang
    Jiancheng Lv
    Xue Yang
    Chenwei Tang
    Xi Peng
    Soft Computing, 2020, 24 : 18173 - 18184
  • [2] Toward Multimodal Image-to-Image Translation
    Zhu, Jun-Yan
    Zhang, Richard
    Pathak, Deepak
    Darrell, Trevor
    Efros, Alexei A.
    Wang, Oliver
    Shechtman, Eli
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [3] Multimodal Unsupervised Image-to-Image Translation
    Huang, Xun
    Liu, Ming-Yu
    Belongie, Serge
    Kautz, Jan
    COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 179 - 196
  • [4] Leveraging Local Domains for Image-to-Image Translation
    Dell'Eva, Anthony
    Pizzati, Fabio
    Bertozzi, Massimo
    de Charette, Raoul
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, : 179 - 189
  • [5] MULTIMODAL IMAGE-TO-IMAGE TRANSLATION FOR GENERATION OF GASTRITIS IMAGES
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2466 - 2470
  • [6] Multimodal Structure-Consistent Image-to-Image Translation
    Lin, Che-Tsung
    Wu, Yen-Yi
    Hsu, Po-Hao
    Lai, Shang-Hong
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11490 - 11498
  • [7] LOSSLESS CODING OF MULTIMODAL IMAGE PAIRS BASED ON IMAGE-TO-IMAGE TRANSLATION
    Parracho, Joao O.
    Thomaz, Lucas A.
    Tavora, Luis M. N.
    Assuncao, Pedro A. A.
    Faria, Sergio M. M.
    2022 10TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP), 2022,
  • [8] Is image-to-image translation the panacea for multimodal image registration? A comparative study
    Lu, Jiahao
    Ofverstedt, Johan
    Lindblad, Joakim
    Sladoje, Natasa
    PLOS ONE, 2022, 17 (11):
  • [9] SUNIT: multimodal unsupervised image-to-image translation with shared encoder
    Lin, Liyuan
    Ji, Shulin
    Zhou, Yuan
    Zhang, Shun
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (01)
  • [10] Latent Filter Scaling for Multimodal Unsupervised Image-to-Image Translation
    Alharbi, Yazeed
    Smith, Neil
    Wonka, Peter
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1458 - 1466