Multimodal image-to-image translation between domains with high internal variability

被引:6
|
作者
Wang, Jian [1 ]
Lv, Jiancheng [1 ]
Yang, Xue [1 ]
Tang, Chenwei [1 ]
Peng, Xi [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
基金
国家重点研发计划; 美国国家科学基金会;
关键词
GANs; Image translation; High internal variability; Catastrophic forgetting; Generator regulating;
D O I
10.1007/s00500-020-05073-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal image-to-image translation based on generative adversarial networks (GANs) shows suboptimal performance in the visual domains with high internal variability, e.g., translation from multiple breeds of cats to multiple breeds of dogs. To alleviate this problem, we recast the training procedure as modeling distinct distributions which are observed sequentially, for example, when different classes are encountered over time. As a result, the discriminator may forget about the previous target distributions, known as catastrophic forgetting, leading to non-/slow convergence. Through experimental observation, we found that the discriminator does not always forget the previously learned distributions during training. Therefore, we propose a novel generator regulating GAN (GR-GAN). The proposed method encourages the discriminator to teach the generator more effectively when it remembers more of the previously learned distributions, while discouraging the discriminator to guide the generator when catastrophic forgetting happens on the discriminator. Both qualitative and quantitative results show that the proposed method is significantly superior to the state-of-the-art methods in handling the image data that are with high variability.
引用
收藏
页码:18173 / 18184
页数:12
相关论文
共 50 条
  • [21] Generative image completion with image-to-image translation
    Xu, Shuzhen
    Zhu, Qing
    Wang, Jin
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (11): : 7333 - 7345
  • [22] High-Resolution Semantically Consistent Image-to-Image Translation
    Sokolov, Mikhail
    Henry, Christopher
    Storie, Joni
    Storie, Christopher
    Alhassan, Victor
    Turgeon-Pelchat, Mathieu
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 482 - 492
  • [23] Vector Quantized Image-to-Image Translation
    Chen, Yu-Jie
    Cheng, Shin-I
    Chiu, Wei-Chen
    Tseng, Hung-Yu
    Lee, Hsin-Ying
    COMPUTER VISION - ECCV 2022, PT XVI, 2022, 13676 : 440 - 456
  • [24] Deliberation Learning for Image-to-Image Translation
    He, Tianyu
    Xia, Yingce
    Lin, Jianxin
    Tan, Xu
    He, Di
    Qin, Tao
    Chen, Zhibo
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2484 - 2490
  • [25] Image-to-Image Translation: Methods and Applications
    Pang, Yingxue
    Lin, Jianxin
    Qin, Tao
    Chen, Zhibo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 3859 - 3881
  • [26] Domain Adaptive Image-to-image Translation
    Chen, Ying-Cong
    Xu, Xiaogang
    Jia, Jiaya
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5273 - 5282
  • [27] Unsupervised Image-to-Image Translation: A Review
    Hoyez, Henri
    Schockaert, Cedric
    Rambach, Jason
    Mirbach, Bruno
    Stricker, Didier
    SENSORS, 2022, 22 (21)
  • [28] Unsupervised Image-to-Image Translation Networks
    Liu, Ming-Yu
    Breuel, Thomas
    Kautz, Jan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [29] SuperstarGAN: Generative adversarial networks for image-to-image translation in large-scale domains
    Ko, Kanghyeok
    Yeom, Taesun
    Lee, Minhyeok
    NEURAL NETWORKS, 2023, 162 : 330 - 339
  • [30] A novel framework for image-to-image translation and image compression
    Yang, Fei
    Wang, Yaxing
    Herranz, Luis
    Cheng, Yongmei
    Mozerov, Mikhail G.
    NEUROCOMPUTING, 2022, 508 : 58 - 70