Image-to-Image Translation With Disentangled Latent Vectors for Face Editing

被引:14
|
作者
Dalva, Yusuf [1 ]
Pehlivan, Hamza [2 ]
Hatipoglu, Oyku Irmak [3 ]
Moran, Cansu [4 ]
Dundar, Aysegul [5 ]
机构
[1] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA
[2] Max Planck Inst, D-80539 Munich, Germany
[3] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland
[4] Tech Univ Munich, D-80333 Munich, Germany
[5] Bilkent Univ, Dept Comp Sci, TR-06800 Bilkent, Turkiye
关键词
Image translation; generative adversarial net works; latent space manipulation; face attribute editing;
D O I
10.1109/TPAMI.2023.3308102
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an image-to-image translation framework for facial attribute editing with disentangled interpretable latent directions. Facial attribute editing task faces the challenges of targeted attribute editing with controllable strength and disentanglement in the representations of attributes to preserve the other attributes during edits. For this goal, inspired by the latent space factorization works of fixed pretrained GANs, we design the attribute editing by latent space factorization, and for each attribute, we learn a linear direction that is orthogonal to the others. We train these directions with orthogonality constraints and disentanglement losses. To project images to semantically organized latent spaces, we set an encoder-decoder architecture with attention-based skip connections. We extensively compare with previous image translation algorithms and editing with pretrained GAN works. Our extensive experiments show that our method significantly improves over the state-of-the-arts.
引用
收藏
页码:14777 / 14788
页数:12
相关论文
共 50 条
  • [41] Unsupervised Image-to-Image Translation with Generative Prior
    Yang, Shuai
    Jiang, Liming
    Liu, Ziwei
    Loy, Chen Change
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18311 - 18320
  • [42] Leveraging Local Domains for Image-to-Image Translation
    Dell'Eva, Anthony
    Pizzati, Fabio
    Bertozzi, Massimo
    de Charette, Raoul
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, : 179 - 189
  • [43] A Latent Transformer for Disentangled Face Editing in Images and Videos
    Yao, Xu
    Newson, Alasdair
    Gousseau, Yann
    Hellier, Pierre
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13769 - 13778
  • [44] Unsupervised Image-to-Image Translation with Style Consistency
    Lai, Binxin
    Wang, Yuan-Gen
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 322 - 334
  • [45] Image-to-Image Translation with Conditional Adversarial Networks
    Isola, Phillip
    Zhu, Jun-Yan
    Zhou, Tinghui
    Efros, Alexei A.
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5967 - 5976
  • [46] Breaking the Dilemma of Medical Image-to-image Translation
    Kong, Lingke
    Lian, Chenyu
    Huang, Detian
    Li, Zhenjiang
    Hu, Yanle
    Zhou, Qichao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [47] Random Reconstructed Unpaired Image-to-Image Translation
    Zhang, Xiaoqin
    Fan, Chenxiang
    Xiao, Zhiheng
    Zhao, Li
    Chen, Huiling
    Chang, Xiaojun
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (03) : 3144 - 3154
  • [48] Research on Image-to-Image Translation with Capsule Network
    Ye, Jian
    Chang, Qing
    Jia, Xiaotian
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 141 - 151
  • [49] Edge Sensitive Unsupervised Image-to-Image Translation
    Akkaya, Ibrahim Batuhan
    Halici, Ugur
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [50] Zero-shot Image-to-Image Translation
    Parmar, Gaurav
    Singh, Krishna Kumar
    Zhang, Richard
    Li, Yijun
    Lu, Jingwan
    Zhu, Jun-Yan
    PROCEEDINGS OF SIGGRAPH 2023 CONFERENCE PAPERS, SIGGRAPH 2023, 2023,