Disentangling latent space better for few-shot image-to-image translation

被引:2
|
作者
Liu, Peng [1 ]
Wang, Yueyue [1 ]
Du, Angang [2 ]
Zhang, Liqiang [2 ]
Wei, Bin [4 ,5 ]
Gu, Zhaorui [2 ]
Wang, Xiaodong [3 ]
Zheng, Haiyong [2 ]
Li, Juan [6 ]
机构
[1] Ocean Univ China, Comp Ctr, Qingdao 266100, Peoples R China
[2] Ocean Univ China, Dept Elect Engn, Qingdao 266100, Peoples R China
[3] Ocean Univ China, Dept Comp Sci & Technol, Qingdao 266100, Peoples R China
[4] Qingdao Univ, Affiliated Hosp, Qingdao 266000, Peoples R China
[5] Shandong Key Lab Digital Med & Comp Assisted Surg, Qingdao 266000, Peoples R China
[6] Qingdao Agr Univ, Coll Mech & Elect Engn, Qingdao 266000, Peoples R China
基金
中国国家自然科学基金;
关键词
Image-to-image translation; Generative adversarial network; Latent space; Few-shot learning; Disentanglement;
D O I
10.1007/s13042-022-01552-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In an unpaired image-to-image translation, the main concept is to learn an underlying mapping between the source and target domains. Previous approaches required large numbers of data from both domains to learn this mapping. However, under a few-shot condition, that is, few-shot image-to-image translation, only one domain can meet the required number of data , and thus, the underlying mapping becomes ill-conditioned owing to the limited data as well as the imbalanced distribution of the two domains. We argue that a powerful model with a better disentangled representation of the latent space can better tackle the more challenging few-shot image-to-image translation . Motivated by this, under a partially-shared assumption, we propose a better disentanglement of the content and style latent space using a domain-specific style latent classifier and a domain-shared cross-content latent discriminator. Moreover, we design asymmetric weak/strong domain discriminators to achieve a better translation performance with limited data within the few-shot domain. Furthermore, our method can be easily embedded into any latent space disentangled model of an image-to-image translation for a few-shot setting. Subjective evaluation and objective evaluation results both show that compared with other state-of-the-art methods, the images synthesized by our method have higher fidelity while maintaining certain diversity.
引用
收藏
页码:419 / 427
页数:9
相关论文
共 50 条
  • [31] Few-Shot Image-to-Semantics Translation for Policy Transfer in Reinforcement Learning
    Sato, Rei
    Fukuchi, Kazuto
    Sakuma, Jun
    Akimoto, Youhei
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [32] Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesis
    Vashist, Chirag
    Peng, Shichong
    Li, Ke
    COMPUTER VISION - ECCV 2024, PT XXI, 2025, 15079 : 441 - 456
  • [33] A Closer Look at Few-shot Image Generation
    Zhao, Yunqing
    Ding, Henghui
    Huang, Houjing
    Cheung, Ngai-Man
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9130 - 9140
  • [34] Decision fusion for few-shot image classification
    Tianhao Yuan
    Weifeng Liu
    Fei Yan
    Baodi Liu
    International Journal of Multimedia Information Retrieval, 2023, 12
  • [35] Semantic Prompt for Few-Shot Image Recognition
    Chen, Wentao
    Si, Chenyang
    Zhang, Zhang
    Wang, Liang
    Wang, Zilei
    Tan, Tieniu
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23581 - 23591
  • [36] An Image Enhancement Method for Few-shot Classification
    Wu, Benze
    Wu, Yirui
    Wan, Shaohua
    2021 IEEE 19TH INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC 2021), 2021, : 159 - 165
  • [37] Decision fusion for few-shot image classification
    Yuan, Tianhao
    Liu, Weifeng
    Yan, Fei
    Liu, Baodi
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2023, 12 (02)
  • [38] A Image Enhancement Method for Few-shot Classification
    Wu, Benze
    Wu, Yirui
    Wan, Shaohua
    2021 IEEE 19TH INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC 2021), 2021, : 201 - 207
  • [39] Multiscale attention for few-shot image classification
    Zhou, Tong
    Dong, Changyin
    Song, Junshu
    Zhang, Zhiqiang
    Wang, Zhen
    Chang, Bo
    Chen, Dechun
    COMPUTATIONAL INTELLIGENCE, 2024, 40 (02)
  • [40] Few-Shot Microscopy Image Cell Segmentation
    Dawoud, Youssef
    Hornauer, Julia
    Carneiro, Gustavo
    Belagiannis, Vasileios
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2020, PT V, 2021, 12461 : 139 - 154