Image-to-Image Translation With Disentangled Latent Vectors for Face Editing

被引：14

作者：

Dalva, Yusuf ^{[1
]}

Pehlivan, Hamza ^{[2
]}

Hatipoglu, Oyku Irmak ^{[3
]}

Moran, Cansu ^{[4
]}

Dundar, Aysegul ^{[5
]}

机构：

[1] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA

[2] Max Planck Inst, D-80539 Munich, Germany

[3] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland

[4] Tech Univ Munich, D-80333 Munich, Germany

[5] Bilkent Univ, Dept Comp Sci, TR-06800 Bilkent, Turkiye

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2023年 / 45卷 / 12期

关键词：

Image translation; generative adversarial net works; latent space manipulation; face attribute editing;

D O I：

10.1109/TPAMI.2023.3308102

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose an image-to-image translation framework for facial attribute editing with disentangled interpretable latent directions. Facial attribute editing task faces the challenges of targeted attribute editing with controllable strength and disentanglement in the representations of attributes to preserve the other attributes during edits. For this goal, inspired by the latent space factorization works of fixed pretrained GANs, we design the attribute editing by latent space factorization, and for each attribute, we learn a linear direction that is orthogonal to the others. We train these directions with orthogonality constraints and disentanglement losses. To project images to semantically organized latent spaces, we set an encoder-decoder architecture with attention-based skip connections. We extensively compare with previous image translation algorithms and editing with pretrained GAN works. Our extensive experiments show that our method significantly improves over the state-of-the-arts.

引用

页码：14777 / 14788

页数：12

共 50 条

[41] Unsupervised Image-to-Image Translation with Generative Prior
Yang, Shuai
Jiang, Liming
Liu, Ziwei
Loy, Chen Change
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18311 - 18320
[42] Leveraging Local Domains for Image-to-Image Translation
Dell'Eva, Anthony
Pizzati, Fabio
Bertozzi, Massimo
de Charette, Raoul
PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, : 179 - 189
[43] A Latent Transformer for Disentangled Face Editing in Images and Videos
Yao, Xu
Newson, Alasdair
Gousseau, Yann
Hellier, Pierre
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13769 - 13778
[44] Unsupervised Image-to-Image Translation with Style Consistency
Lai, Binxin
Wang, Yuan-Gen
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 322 - 334
[45] Image-to-Image Translation with Conditional Adversarial Networks
Isola, Phillip
Zhu, Jun-Yan
Zhou, Tinghui
Efros, Alexei A.
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5967 - 5976
[46] Breaking the Dilemma of Medical Image-to-image Translation
Kong, Lingke
Lian, Chenyu
Huang, Detian
Li, Zhenjiang
Hu, Yanle
Zhou, Qichao
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[47] Random Reconstructed Unpaired Image-to-Image Translation
Zhang, Xiaoqin
Fan, Chenxiang
Xiao, Zhiheng
Zhao, Li
Chen, Huiling
Chang, Xiaojun
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (03) : 3144 - 3154
[48] Research on Image-to-Image Translation with Capsule Network
Ye, Jian
Chang, Qing
Jia, Xiaotian
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 141 - 151
[49] Edge Sensitive Unsupervised Image-to-Image Translation
Akkaya, Ibrahim Batuhan
Halici, Ugur
2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
[50] Zero-shot Image-to-Image Translation
Parmar, Gaurav
Singh, Krishna Kumar
Zhang, Richard
Li, Yijun
Lu, Jingwan
Zhu, Jun-Yan
PROCEEDINGS OF SIGGRAPH 2023 CONFERENCE PAPERS, SIGGRAPH 2023, 2023,

← 1 2 3 4 5 →