Zero-shot unsupervised image-to-image translation via exploiting semantic attributes

被引:2
|
作者
Chen, Yuanqi [1 ,2 ]
Yu, Xiaoming [1 ,2 ]
Liu, Shan [3 ]
Gao, Wei [1 ,2 ]
Li, Ge [1 ]
机构
[1] Peking Univ, Sch Elect & Comp Engn, Shenzhen Grad Sch, Shenzhen 518055, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518055, Peoples R China
[3] Tencent Inc, Shenzhen 518000, Peoples R China
基金
国家重点研发计划;
关键词
Image -to-image translation; Image synthesis; Zero-shot learning; Generative adversarial networks; GENERATIVE ADVERSARIAL NETWORKS; GAN; CLASSIFICATION;
D O I
10.1016/j.imavis.2022.104489
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent studies have shown remarkable success in unsupervised image-to-image translation. However, if there is no access to enough images in target classes, learning a mapping from source classes to the target classes always suffers from mode collapse, especially the zero shot case, which limits the application of the existing methods. In this work, we propose a zero-shot unsupervised image-to-image translation framework to address this limita-tion, by effectively associating categories with their side information like attributes. To generalize the translator to previously unseen classes, we introduce two strategies for exploiting the semantic attribute space. First, we propose to preserve semantic relations to the visual space for effective guidance on where to map the input image. Second, expanding attribute space is introduced by utilizing attribute vectors of unseen classes, which al-leviates the mapping bias for unseen classes. Both of these strategies encourage the translator to explore the modes of unseen classes. Quantitative and qualitative results on different datasets validate the effectiveness of our proposed approach. Moreover, we demonstrate that our framework can be applied to fashion design task. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] GAIT: GRADIENT ADJUSTED UNSUPERVISED IMAGE-TO-IMAGE TRANSLATION
    Akkaya, Ibrahim Batuhan
    Halici, Ugur
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1591 - 1595
  • [32] The Surprising Effectiveness of Linear Unsupervised Image-to-Image Translation
    Richardson, Eitan
    Weiss, Yair
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7855 - 7861
  • [33] Zero-Shot Medical Image Translation via Frequency-Guided Diffusion Models
    Li, Yunxiang
    Shao, Hua-Chieh
    Liang, Xiao
    Chen, Liyuan
    Li, Ruiqi
    Jiang, Steve
    Wang, Jing
    Zhang, You
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (03) : 980 - 993
  • [34] Improving Shape Deformation in Unsupervised Image-to-Image Translation
    Gokaslan, Aaron
    Ramanujan, Vivek
    Ritchie, Daniel
    Kim, Kwang In
    Tompkin, James
    COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 662 - 678
  • [35] Memory-guided Unsupervised Image-to-image Translation
    Jeong, Somi
    Kim, Youngjung
    Lee, Eungbean
    Sohn, Kwanghoon
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6554 - 6563
  • [36] Unsupervised Attention-guided Image-to-Image Translation
    Mejjati, Youssef A.
    Richardt, Christian
    Tompkin, James
    Cosker, Darren
    Kim, Kwang In
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [37] Unsupervised Multi-Modal Image Registration via Geometry Preserving Image-to-Image Translation
    Arar, Moab
    Ginger, Yiftach
    Danon, Dov
    Bermano, Amit H.
    Cohen-Or, Daniel
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 13407 - 13416
  • [38] Illustrated character face super-deformation via unsupervised image-to-image translation
    Sawada, Tomoya
    Katsurai, Marie
    MULTIMEDIA SYSTEMS, 2024, 30 (02)
  • [39] Illustrated character face super-deformation via unsupervised image-to-image translation
    Tomoya Sawada
    Marie Katsurai
    Multimedia Systems, 2024, 30
  • [40] General Image-to-Image Translation with One-Shot Image Guidance
    Cheng, Bin
    Liu, Zuhao
    Peng, Yunbo
    Lin, Yue
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22679 - 22689