Disentangling latent space better for few-shot image-to-image translation

被引:2
|
作者
Liu, Peng [1 ]
Wang, Yueyue [1 ]
Du, Angang [2 ]
Zhang, Liqiang [2 ]
Wei, Bin [4 ,5 ]
Gu, Zhaorui [2 ]
Wang, Xiaodong [3 ]
Zheng, Haiyong [2 ]
Li, Juan [6 ]
机构
[1] Ocean Univ China, Comp Ctr, Qingdao 266100, Peoples R China
[2] Ocean Univ China, Dept Elect Engn, Qingdao 266100, Peoples R China
[3] Ocean Univ China, Dept Comp Sci & Technol, Qingdao 266100, Peoples R China
[4] Qingdao Univ, Affiliated Hosp, Qingdao 266000, Peoples R China
[5] Shandong Key Lab Digital Med & Comp Assisted Surg, Qingdao 266000, Peoples R China
[6] Qingdao Agr Univ, Coll Mech & Elect Engn, Qingdao 266000, Peoples R China
基金
中国国家自然科学基金;
关键词
Image-to-image translation; Generative adversarial network; Latent space; Few-shot learning; Disentanglement;
D O I
10.1007/s13042-022-01552-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In an unpaired image-to-image translation, the main concept is to learn an underlying mapping between the source and target domains. Previous approaches required large numbers of data from both domains to learn this mapping. However, under a few-shot condition, that is, few-shot image-to-image translation, only one domain can meet the required number of data , and thus, the underlying mapping becomes ill-conditioned owing to the limited data as well as the imbalanced distribution of the two domains. We argue that a powerful model with a better disentangled representation of the latent space can better tackle the more challenging few-shot image-to-image translation . Motivated by this, under a partially-shared assumption, we propose a better disentanglement of the content and style latent space using a domain-specific style latent classifier and a domain-shared cross-content latent discriminator. Moreover, we design asymmetric weak/strong domain discriminators to achieve a better translation performance with limited data within the few-shot domain. Furthermore, our method can be easily embedded into any latent space disentangled model of an image-to-image translation for a few-shot setting. Subjective evaluation and objective evaluation results both show that compared with other state-of-the-art methods, the images synthesized by our method have higher fidelity while maintaining certain diversity.
引用
收藏
页码:419 / 427
页数:9
相关论文
共 50 条
  • [21] Few-Shot Learning for Image Denoising
    Jiang, Bo
    Lu, Yao
    Zhang, Bob
    Lu, Guangming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4741 - 4753
  • [22] Latent Filter Scaling for Multimodal Unsupervised Image-to-Image Translation
    Alharbi, Yazeed
    Smith, Neil
    Wonka, Peter
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1458 - 1466
  • [23] Few-Shot Remote Sensing Aircraft Image Generation Algorithm Based on Feature Disentangling
    Liu, Muyun
    Bian, Chunjiang
    Chen, Hongzhen
    Computer Engineering and Applications, 2024, 60 (09) : 244 - 253
  • [24] Image-to-Image Translation With Disentangled Latent Vectors for Face Editing
    Dalva, Yusuf
    Pehlivan, Hamza
    Hatipoglu, Oyku Irmak
    Moran, Cansu
    Dundar, Aysegul
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 14777 - 14788
  • [25] Unpaired Image-to-Image Translation via Latent Energy Transport
    Zhao, Yang
    Chen, Changyou
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16413 - 16422
  • [26] Towards Fine-Grained Control over Latent Space for Unpaired Image-to-Image Translation
    Luo, Lei
    Hsu, William
    Wang, Shangxian
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT III, 2021, 12893 : 408 - 420
  • [27] Unsupervised Few-Shot Image Classification by Learning Features into Clustering Space
    Li, Shuo
    Liu, Fang
    Hao, Zehua
    Zhao, Kaibo
    Jiao, Licheng
    COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 : 420 - 436
  • [28] Generating Adversarial Examples in One Shot With Image-to-Image Translation GAN
    Zhang, Weijia
    IEEE ACCESS, 2019, 7 : 151103 - 151119
  • [29] FeatEMD: Better Patch Sampling and Distance Metric for Few-Shot Image Classification
    Deng, Shisheng
    Liao, Dongping
    Gao, Xitong
    Zhao, Juanjuan
    Ye, Kejiang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT I, 2023, 14254 : 183 - 194
  • [30] Interactive Few-Shot Learning: Limited Supervision, Better Medical Image Segmentation
    Feng, Ruiwei
    Zheng, Xiangshang
    Gao, Tianxiang
    Chen, Jintai
    Wang, Wenzhe
    Chen, Danny Z.
    Wu, Jian
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (10) : 2575 - 2588