Zero-shot unsupervised image-to-image translation via exploiting semantic attributes

被引:2
|
作者
Chen, Yuanqi [1 ,2 ]
Yu, Xiaoming [1 ,2 ]
Liu, Shan [3 ]
Gao, Wei [1 ,2 ]
Li, Ge [1 ]
机构
[1] Peking Univ, Sch Elect & Comp Engn, Shenzhen Grad Sch, Shenzhen 518055, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518055, Peoples R China
[3] Tencent Inc, Shenzhen 518000, Peoples R China
基金
国家重点研发计划;
关键词
Image -to-image translation; Image synthesis; Zero-shot learning; Generative adversarial networks; GENERATIVE ADVERSARIAL NETWORKS; GAN; CLASSIFICATION;
D O I
10.1016/j.imavis.2022.104489
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent studies have shown remarkable success in unsupervised image-to-image translation. However, if there is no access to enough images in target classes, learning a mapping from source classes to the target classes always suffers from mode collapse, especially the zero shot case, which limits the application of the existing methods. In this work, we propose a zero-shot unsupervised image-to-image translation framework to address this limita-tion, by effectively associating categories with their side information like attributes. To generalize the translator to previously unseen classes, we introduce two strategies for exploiting the semantic attribute space. First, we propose to preserve semantic relations to the visual space for effective guidance on where to map the input image. Second, expanding attribute space is introduced by utilizing attribute vectors of unseen classes, which al-leviates the mapping bias for unseen classes. Both of these strategies encourage the translator to explore the modes of unseen classes. Quantitative and qualitative results on different datasets validate the effectiveness of our proposed approach. Moreover, we demonstrate that our framework can be applied to fashion design task. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Contrastive learning for unsupervised image-to-image translation
    Lee, Hanbit
    Seol, Jinseok
    Lee, Sang-goo
    Park, Jaehui
    Shim, Junho
    APPLIED SOFT COMPUTING, 2024, 151
  • [22] INCREMENTAL ZERO-SHOT LEARNING BASED ON ATTRIBUTES FOR IMAGE CLASSIFICATION
    Xue, Nan
    Wang, Yi
    Fan, Xin
    Min, Maomao
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 850 - 854
  • [23] SCSP: An Unsupervised Image-to-Image Translation Network Based on Semantic Cooperative Shape Perception
    Yang, Xi
    Wang, Zihan
    Wei, Ziyu
    Yang, Dong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4950 - 4960
  • [24] Semantic Example Guided Image-to-Image Translation
    Huang, Jialu
    Liao, Jing
    Kwong, Sam
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1654 - 1665
  • [25] DualGAN: Unsupervised Dual Learning for Image-to-Image Translation
    Yi, Zili
    Zhang, Hao
    Tan, Ping
    Gong, Minglun
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2868 - 2876
  • [26] Dual Contrastive Learning for Unsupervised Image-to-Image Translation
    Han, Junlin
    Shoeiby, Mehrdad
    Petersson, Lars
    Armin, Mohammad Ali
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 746 - 755
  • [27] Zero-Shot Image Dehazing
    Li, Boyun
    Gou, Yuanbiao
    Liu, Jerry Zitao
    Zhu, Hongyuan
    Zhou, Joey Tianyi
    Peng, Xi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8457 - 8466
  • [28] RelGAN: Multi-Domain Image-to-Image Translation via Relative Attributes
    Wu, Po-Wei
    Lin, Yu-Jing
    Chang, Che-Han
    Chang, Edward Y.
    Liao, Shih-Wei
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5913 - 5921
  • [29] Improving Learning time in Unsupervised Image-to-Image Translation
    Min, Tae-Hong
    Kim, Do-Yun
    Choi, Young-June
    2019 1ST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (ICAIIC 2019), 2019, : 455 - 458
  • [30] Unsupervised Structure-Consistent Image-to-Image Translation
    Shahfar, Shima
    Poullis, Charalambos
    ADVANCES IN VISUAL COMPUTING, ISVC 2022, PT I, 2022, 13598 : 3 - 21