Disentangled Image Attribute Editing in Latent Space via Mask-based Retention Loss

被引:0
|
作者
Ohaga, Shunya [1 ]
Togo, Ren [1 ]
Ogawa, Takahiro [1 ]
Haseyama, Miki [1 ]
机构
[1] Hokkaido Univ, Sapporo, Hokkaido, Japan
关键词
computer vision; image editing; image manipulation;
D O I
10.1145/3551626.3564949
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We propose an image attribute editing method with the mask-based retention loss. Although conventional image attribute editing methods can edit a particular attribute, they cannot retain non-editing attributes including unknown attributes before and after editing, which causes unexpected changes in the edited images. We solve this problem by dividing the pre- and post-edited images into the editing and non-editing regions and increasing the image similarity in the non-editing regions. In this paper, we introduce the novel mask-based retention loss to retain the non-editing regions. To compute the mask-based retention loss, we divide the images into the editing and non-editing regions by using a binary mask generated from the difference between the pre- and post-edited images. Experimental results show that our proposed method is qualitatively and quantitatively superior to state-of-the-art methods.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] Wasserstein loss for Semantic Editing in the Latent Space of GANs
    Doubinsky, Perla
    Audebert, Nicolas
    Crucianu, Michel
    Le Borgne, Herve
    20TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING, CBMI 2023, 2023, : 55 - 60
  • [22] Mask-based Style-Controlled Image Synthesis Using a Mask Style Encoder
    Cho, Jaehyeong
    Shimoda, Wataru
    Yanai, Keiji
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5176 - 5183
  • [23] OPTIMIZING LATENT SPACE DIRECTIONS FOR GAN-BASED LOCAL IMAGE EDITING
    Pajouheshgar, Ehsan
    Zhang, Tong
    Susstrunk, Sabine
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1740 - 1744
  • [24] DeepLIR: Attention-based approach for Mask-Based Lensless Image Reconstruction
    Poudel, Arpan
    Nakarmi, Ukash
    2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 431 - 439
  • [25] Using Mask-Based Enhancement and Feature Aggregation for Single Image Deraining
    Qin, Shengdi
    Zhang, Shunli
    Zhang, Yu
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 828 - 832
  • [26] Controllable Anime Image Editing via Probability of Attribute Tags
    Song, Zhenghao
    Mo, Haoran
    Gao, Chengying
    COMPUTER GRAPHICS FORUM, 2024, 43 (07)
  • [27] Multichannel Loss Function for Supervised Speech Source Separation by Mask-based Beamforming
    Masuyama, Yoshiki
    Togami, Masahito
    Komatsu, Tatsuya
    INTERSPEECH 2019, 2019, : 2708 - 2712
  • [28] Controllable Unsupervised Text Attribute Transfer via Editing Entangled Latent Representation
    Wang, Ke
    Hua, Hang
    Wan, Xiaojun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [29] Detecting Silicone Mask-Based Presentation Attack via Deep Dictionary Learning
    Manjani, Ishan
    Tariyal, Snigdha
    Vatsa, Mayank
    Singh, Richa
    Majumdar, Angshul
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (07) : 1713 - 1723
  • [30] Latent shape image learning via disentangled representation for cross-sequence image registration and segmentation
    Wu, Jiong
    Yang, Qi
    Zhou, Shuang
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (04) : 621 - 628