Everything is There in Latent Space: Attribute Editing and Attribute Style Manipulation by StyleGAN Latent Space Exploration

被引:8
|
作者
Parihar, Rishubh [1 ]
Dhiman, Ankit [1 ,2 ]
Karmali, Tejan [1 ]
Babu, R. Venkatesh [1 ]
机构
[1] Indian Inst Sci, Bangalore, Karnataka, India
[2] Samsung Res, Delhi, India
关键词
GANs; Image-Editing; Latent space; Image Manipulation;
D O I
10.1145/3503161.3547972
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Unconstrained Image generation with high realism is now possible using recent Generative Adversarial Networks (GANs). However, it is quite challenging to generate images with a given set of attributes. Recent methods use style-based GAN models to perform image editing by leveraging the semantic hierarchy present in the layers of the generator. We present Few-shot Latent-based Attribute Manipulation and Editing (FLAME), a simple yet effective framework to perform highly controlled image editing by latent space manipulation. Specifically, we estimate linear directions in the latent space (of a pre-trained StyleGAN) that controls semantic attributes in the generated image. In contrast to previous methods that either rely on large-scale attribute labeled datasets or attribute classifiers, FLAME uses minimal supervision of a few curated image pairs to estimate disentangled edit directions. FLAME can perform both individual and sequential edits with high precision on a diverse set of images while preserving identity. Further, we propose a novel task of Attribute Style Manipulation to generate diverse styles for attributes such as eyeglass and hair. We first encode a set of synthetic images of the same identity but having different attribute styles in the latent space to estimate an attribute style manifold. Sampling a new latent from this manifold will result in a new attribute style in the generated image. We propose a novel sampling method to sample latent from the manifold, enabling us to generate a diverse set of attribute styles beyond the styles present in the training set. FLAME can generate diverse attribute styles in a disentangled manner. We illustrate the superior performance of FLAME against previous image editing methods by extensive qualitative and quantitative comparisons. FLAME generalizes well on out-of-distribution images from art domain as well as on other datasets such as cars and churches. Project page.
引用
收藏
页码:1828 / 1836
页数:9
相关论文
共 50 条
  • [1] Improved Attribute Manipulation in the Latent Space of StyleGAN for Semantic Face Editing
    Rai, Aashish
    Ducher, Clara
    Cooperstock, Jeremy R.
    20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 38 - 43
  • [2] Facial Attribute Editing by Latent Space Adversarial Variational Autoencoders
    Li, Defang
    Zhang, Min
    Chen, Weifu
    Feng, Guocan
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1337 - 1342
  • [3] Attribute-Based Facial Image Manipulation on Latent Space
    Lin, Chien-Hung
    Pan, Yi-Lun
    Wu, Ja-Ling
    2021 17TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2021), 2021,
  • [4] Latent attribute space tree classifiers
    He, Ping
    Xu, Xiao-Hua
    Chen, Ling
    Ruan Jian Xue Bao/Journal of Software, 2009, 20 (07): : 1735 - 1745
  • [5] Transforming the latent space of StyleGAN for real face editing
    Heyi Li
    Jinlong Liu
    Xinyu Zhang
    Yunzhi Bai
    Huayan Wang
    Klaus Mueller
    The Visual Computer, 2024, 40 : 3553 - 3568
  • [6] Transforming the latent space of StyleGAN for real face editing
    Li, Heyi
    Liu, Jinlong
    Zhang, Xinyu
    Bai, Yunzhi
    Wang, Huayan
    Mueller, Klaus
    VISUAL COMPUTER, 2024, 40 (05): : 3553 - 3568
  • [7] Disentangled Face Attribute Editing via Instance-Aware Latent Space Search
    Han, Yuxuan
    Yang, Jiaolong
    Fu, Ying
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 715 - 721
  • [8] CONTROLLING FACIAL ATTRIBUTE SYNTHESIS BY DISENTANGLING ATTRIBUTE FEATURE AXES IN LATENT SPACE
    Wei, Qiyu
    Zheng, Weihua
    Li, Yuxin
    Cheng, Zhongyao
    Zeng, Zeng
    Yang, Xulei
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 346 - 350
  • [9] Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint
    Liu, Hongyu
    Song, Yibing
    Chen, Qifeng
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10072 - 10082
  • [10] Language Detoxification with Attribute-Discriminative Latent Space
    Kwak, Jin Myung
    Kim, Minseon
    Hwang, Sung Ju
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 10149 - 10171