Few-Shot Face Sketch-to-Photo Synthesis via Global-Local Asymmetric Image-to-Image Translation

被引:1
|
作者
Li, Yongkang [1 ,2 ]
Liang, Qifan [1 ,2 ]
Han, Zhen [1 ,2 ]
Mai, Wenjun [1 ,2 ]
Wang, Zhongyuan [1 ,2 ]
机构
[1] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Wuhan, Peoples R China
[2] Wuhan Univ, Sch Comp Sci, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Face sketch-to-photo synthesis; image-to-image translation; global-local face fusion; MODEL;
D O I
10.1145/3672400
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Face sketch-to-photo synthesis is widely used in law enforcement and digital entertainment, which can be achieved by Image-to-Image (I2I) translation. Traditional I2I translation algorithms usually regard the bidirectional translation of two image domains as two symmetric processes, so the two translation networks adopt the same structure. However, due to the scarcity of face sketches and the abundance of face photos, the sketch-to-photo and photo-to-sketch processes are asymmetric. Considering this issue, we propose a few-shot face sketch-to-photo synthesis model based on asymmetric I2I translation, where the sketch-to-photo process uses a feature-embedded generating network, while the photo-to-sketch process uses a style transfer network. On this basis, a three-stage asymmetric training strategy with style transfer as the trigger is proposed to optimize the proposed model by utilizing the advantage that the style transfer network only needs few-shot face sketches for training. Additionally, we discover that stylistic differences between the global and local sketch faces lead to inconsistencies between the global and local sketch-to-photo processes. Thus, a dual branch of the global face and local face is adopted in the sketch-to-photo synthesis model to learn the specific transformation processes for global structure and local details. Finally, the high-quality synthetic face photo can be generated through the global-local face fusion sub-network. Extensive experimental results demonstrate that the proposed Global-Local Asymmetric (GLAS) I2I translation algorithm compared to SOTA methods, at least improves FSIM by 0.0126, and reduces LPIPS (alex), LPIPS (squeeze), and LPIPS (vgg) by 0.0610, 0.0883, and 0.0719, respectively.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Few-Shot Unsupervised Image-to-Image Translation
    Liu, Ming-Yu
    Huang, Xun
    Mallya, Arun
    Karras, Tero
    Aila, Timo
    Lehtinen, Jaakko
    Kautz, Jan
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10550 - 10559
  • [2] Disentangling latent space better for few-shot image-to-image translation
    Liu, Peng
    Wang, Yueyue
    Du, Angang
    Zhang, Liqiang
    Wei, Bin
    Gu, Zhaorui
    Wang, Xiaodong
    Zheng, Haiyong
    Li, Juan
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (02) : 419 - 427
  • [3] Semi-supervised Learning for Few-shot Image-to-Image Translation
    Wang, Yaxing
    Khan, Salman
    Gonzalez-Garcia, Abel
    van de Weijer, Joost
    Khan, Fahad Shahbaz
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4452 - 4461
  • [4] Disentangling latent space better for few-shot image-to-image translation
    Peng Liu
    Yueyue Wang
    Angang Du
    Liqiang Zhang
    Bin Wei
    Zhaorui Gu
    Xiaodong Wang
    Haiyong Zheng
    Juan Li
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 419 - 427
  • [5] Quality Guided Sketch-to-Photo Image Synthesis
    Osahor, Uche
    Kazemi, Hadi
    Dabouei, Ali
    Nasrabadi, Nasser
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 3575 - 3584
  • [6] Text-Guided Sketch-to-Photo Image Synthesis
    Osahor, Uche
    Nasrabadi, Nasser M.
    IEEE ACCESS, 2022, 10 : 98278 - 98289
  • [7] OA-FSUI2IT: A Novel Few-Shot Cross Domain Object Detection Framework with Object-Aware Few-Shot Unsupervised Image-to-Image Translation
    Zhao, Lifan
    Meng, Yunlong
    Xu, Lin
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3426 - 3435
  • [8] Global-Local Interplay in Semantic Alignment for Few-Shot Learning
    Hao, Fusheng
    He, Fengxiang
    Cheng, Jun
    Tao, Dacheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4351 - 4363
  • [9] Global-local prototype-based few-shot learning for cross-domain hyperspectral image classification
    Tang, Haojin
    Wu, Yuelin
    Li, Hongyi
    Tang, Dong
    Yang, Xiaofei
    Xie, Weixin
    KNOWLEDGE-BASED SYSTEMS, 2025, 314
  • [10] Few-Shot Text Classification with Global-Local Feature Information
    Wang, Depei
    Wang, Zhuowei
    Cheng, Lianglun
    Zhang, Weiwen
    SENSORS, 2022, 22 (12)