Multimodal image-to-image translation between domains with high internal variability

被引:6
|
作者
Wang, Jian [1 ]
Lv, Jiancheng [1 ]
Yang, Xue [1 ]
Tang, Chenwei [1 ]
Peng, Xi [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
基金
国家重点研发计划; 美国国家科学基金会;
关键词
GANs; Image translation; High internal variability; Catastrophic forgetting; Generator regulating;
D O I
10.1007/s00500-020-05073-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal image-to-image translation based on generative adversarial networks (GANs) shows suboptimal performance in the visual domains with high internal variability, e.g., translation from multiple breeds of cats to multiple breeds of dogs. To alleviate this problem, we recast the training procedure as modeling distinct distributions which are observed sequentially, for example, when different classes are encountered over time. As a result, the discriminator may forget about the previous target distributions, known as catastrophic forgetting, leading to non-/slow convergence. Through experimental observation, we found that the discriminator does not always forget the previously learned distributions during training. Therefore, we propose a novel generator regulating GAN (GR-GAN). The proposed method encourages the discriminator to teach the generator more effectively when it remembers more of the previously learned distributions, while discouraging the discriminator to guide the generator when catastrophic forgetting happens on the discriminator. Both qualitative and quantitative results show that the proposed method is significantly superior to the state-of-the-art methods in handling the image data that are with high variability.
引用
收藏
页码:18173 / 18184
页数:12
相关论文
共 50 条
  • [41] Rethinking the Truly Unsupervised Image-to-Image Translation
    Baek, Kyungjune
    Choi, Yunjey
    Uh, Youngjung
    Yoo, Jaejun
    Shim, Hyunjung
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14134 - 14143
  • [42] Panoptic-aware Image-to-Image Translation
    Zhang, Liyun
    Ratsamee, Photchara
    Wang, Bowen
    Luo, Zhaojie
    Uranishi, Yuki
    Higashida, Manabu
    Takemura, Haruo
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 259 - 268
  • [43] Unpaired image-to-image translation of structural damage
    Varghese, Subin
    Hoskere, Vedhus
    ADVANCED ENGINEERING INFORMATICS, 2023, 56
  • [44] Equivariant Adversarial Network for Image-to-image Translation
    Zareapoor, Masoumeh
    Yang, Jie
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (02)
  • [45] Avoiding Shortcuts in Unpaired Image-to-Image Translation
    Fontanini, Tomaso
    Botti, Filippo
    Bertozzi, Massimo
    Prati, Andrea
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT I, 2022, 13231 : 463 - 475
  • [46] Visualization Techniques applied to Image-to-Image Translation
    Protas, Eglen
    Bratti, Jose
    Ribeiro, Pedro O. C. S.
    Drews-, Paulo, Jr.
    Botelho, Silvia
    2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 242 - 247
  • [47] Image-to-image translation for wavefront and PSF estimation
    Smith, Jeffrey
    Cranney, Jesse
    Gretton, Charles
    Gratadour, Damien
    ADAPTIVE OPTICS SYSTEMS VIII, 2022, 12185
  • [48] Consistent Embedded GAN for Image-to-Image Translation
    Xiong, Feng
    Wang, Qianqian
    Gao, Quanxue
    IEEE ACCESS, 2019, 7 : 126651 - 126661
  • [49] Semantic Example Guided Image-to-Image Translation
    Huang, Jialu
    Liao, Jing
    Kwong, Sam
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1654 - 1665
  • [50] Robotic Instrument Segmentation With Image-to-Image Translation
    Colleoni, Emanuele
    Stoyanov, Danail
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 935 - 942