Everything is There in Latent Space: Attribute Editing and Attribute Style Manipulation by StyleGAN Latent Space Exploration

被引:8
|
作者
Parihar, Rishubh [1 ]
Dhiman, Ankit [1 ,2 ]
Karmali, Tejan [1 ]
Babu, R. Venkatesh [1 ]
机构
[1] Indian Inst Sci, Bangalore, Karnataka, India
[2] Samsung Res, Delhi, India
关键词
GANs; Image-Editing; Latent space; Image Manipulation;
D O I
10.1145/3503161.3547972
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Unconstrained Image generation with high realism is now possible using recent Generative Adversarial Networks (GANs). However, it is quite challenging to generate images with a given set of attributes. Recent methods use style-based GAN models to perform image editing by leveraging the semantic hierarchy present in the layers of the generator. We present Few-shot Latent-based Attribute Manipulation and Editing (FLAME), a simple yet effective framework to perform highly controlled image editing by latent space manipulation. Specifically, we estimate linear directions in the latent space (of a pre-trained StyleGAN) that controls semantic attributes in the generated image. In contrast to previous methods that either rely on large-scale attribute labeled datasets or attribute classifiers, FLAME uses minimal supervision of a few curated image pairs to estimate disentangled edit directions. FLAME can perform both individual and sequential edits with high precision on a diverse set of images while preserving identity. Further, we propose a novel task of Attribute Style Manipulation to generate diverse styles for attributes such as eyeglass and hair. We first encode a set of synthetic images of the same identity but having different attribute styles in the latent space to estimate an attribute style manifold. Sampling a new latent from this manifold will result in a new attribute style in the generated image. We propose a novel sampling method to sample latent from the manifold, enabling us to generate a diverse set of attribute styles beyond the styles present in the training set. FLAME can generate diverse attribute styles in a disentangled manner. We illustrate the superior performance of FLAME against previous image editing methods by extensive qualitative and quantitative comparisons. FLAME generalizes well on out-of-distribution images from art domain as well as on other datasets such as cars and churches. Project page.
引用
收藏
页码:1828 / 1836
页数:9
相关论文
共 50 条
  • [21] Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space?
    Abdal, Rameen
    Qin, Yipeng
    Wonka, Peter
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4431 - 4440
  • [22] Attribute Hierarchy Models in Cognitive Diagnosis: Identifiability of the Latent Attribute Space and Conditions for Completeness of the Q-Matrix
    Hans-Friedrich Köhn
    Chia-Yi Chiu
    Journal of Classification, 2019, 36 : 541 - 565
  • [23] Attribute Hierarchy Models in Cognitive Diagnosis: Identifiability of the Latent Attribute Space and Conditions for Completeness of the Q-Matrix
    Kohn, Hans-Friedrich
    Chiu, Chia-Yi
    JOURNAL OF CLASSIFICATION, 2019, 36 (03) : 541 - 565
  • [24] VideoMap: Supporting Video Editing Exploration, Brainstorming, and Prototyping in the Latent Space
    Lin, David Chuan-En
    Heilbron, Fabian Caba
    Lee, Joon-Young
    Wang, Oliver
    Martelaro, Nikolas
    PROCEEDINGS OF THE 16TH CONFERENCE ON CREATIVITY AND COGNITION, C&C 2024, 2024, : 311 - 327
  • [25] Facial Expression Video Synthesis from the StyleGAN Latent Space
    Zhang, Lei
    Pollett, Chris
    THIRTEENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2021), 2021, 11878
  • [26] Emotion Editing in Head Reenactment Videos using Latent Space Manipulation.
    Strizhkova, Valeriya
    Wang, Yaohui
    Anghelone, David
    Yang, Di
    Dantcheva, Antitza
    Bremond, Francois
    2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [27] GuidedStyle: Attribute knowledge guided style manipulation for semantic face editing
    Hou, Xianxu
    Zhang, Xiaokang
    Liang, Hanbang
    Shen, Linlin
    Lai, Zhihui
    Wan, Jun
    Neural Networks, 2022, 145 : 209 - 220
  • [28] Assessing the Dimensionality of the Latent Attribute Space in Cognitive Diagnosis Through Testing for Conditional Independence
    Lim, Youn Seon
    Drasgow, Fritz
    QUANTITATIVE PSYCHOLOGY, 2019, 265 : 183 - 194
  • [29] GuidedStyle: Attribute knowledge guided style manipulation for semantic face editing
    Hou, Xianxu
    Zhang, Xiaokang
    Liang, Hanbang
    Shen, Linlin
    Lai, Zhihui
    Wan, Jun
    NEURAL NETWORKS, 2022, 145 : 209 - 220
  • [30] Prostate Cancer Histology Synthesis Using StyleGAN Latent Space Annotation
    Daroach, Gagandeep B.
    Duenweg, Savannah R.
    Brehler, Michael
    Lowman, Allison K.
    Iczkowski, Kenneth A.
    Jacobsohn, Kenneth M.
    Yoder, Josiah A.
    LaViolette, Peter S.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 398 - 408