Disentangled Image Attribute Editing in Latent Space via Mask-based Retention Loss

被引:0
|
作者
Ohaga, Shunya [1 ]
Togo, Ren [1 ]
Ogawa, Takahiro [1 ]
Haseyama, Miki [1 ]
机构
[1] Hokkaido Univ, Sapporo, Hokkaido, Japan
关键词
computer vision; image editing; image manipulation;
D O I
10.1145/3551626.3564949
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We propose an image attribute editing method with the mask-based retention loss. Although conventional image attribute editing methods can edit a particular attribute, they cannot retain non-editing attributes including unknown attributes before and after editing, which causes unexpected changes in the edited images. We solve this problem by dividing the pre- and post-edited images into the editing and non-editing regions and increasing the image similarity in the non-editing regions. In this paper, we introduce the novel mask-based retention loss to retain the non-editing regions. To compute the mask-based retention loss, we divide the images into the editing and non-editing regions by using a binary mask generated from the difference between the pre- and post-edited images. Experimental results show that our proposed method is qualitatively and quantitatively superior to state-of-the-art methods.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Latent shape image learning via disentangled representation for cross-sequence image registration and segmentation
    Jiong Wu
    Qi Yang
    Shuang Zhou
    International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 621 - 628
  • [32] Coherent diffraction lithography: Periodic patterns via mask-based interference lithography
    Fucetola, Corey P.
    Patel, Amil A.
    Moon, Euclid E.
    O'Reilly, Thomas B.
    Smith, Henry I.
    JOURNAL OF VACUUM SCIENCE & TECHNOLOGY B, 2009, 27 (06): : 2947 - 2950
  • [33] Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint
    Liu, Hongyu
    Song, Yibing
    Chen, Qifeng
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10072 - 10082
  • [34] Seamless Registration of Dual Camera Images Using Optimal Mask-Based Image Fusion
    Kim, Hyeonji
    Jo, Jieun
    Jang, Jinbeum
    Park, Sangwoo
    Paik, Joonki
    2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-ASIA (ICCE-ASIA), 2016,
  • [35] A study on combination of loss functions for effective mask-based speech enhancement in noisy environments
    Jung, Jaehee
    Kim, Wooil
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (03): : 234 - 240
  • [36] Unsupervised Underwater Image Enhancement Based on Disentangled Representations via Double-Order Contrastive Loss
    Yin, Jiankai
    Wang, Yan
    Guan, Bowen
    Zeng, Xianchao
    Guo, Lei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [37] Robust unsupervised image categorization based on variational autoencoder with disentangled latent representations
    Yang, Lin
    Fan, Wentao
    Bouguila, Nizar
    KNOWLEDGE-BASED SYSTEMS, 2022, 246
  • [38] A Generative Image Steganography Based on Disentangled Attribute Feature Transformation and Invertible Mapping Rule
    Zhang, Xiang
    Han, Shenyan
    Huang, Wenbin
    Fu, Daoyong
    Computers, Materials and Continua, 2025, 83 (01): : 1149 - 1171
  • [39] Face image inpainting via latent features reconstruction and mask awareness
    Chen, Feng
    Zhang, Tongtong
    Liu, Heng
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 103
  • [40] The Euclidean Space is Evil: Hyperbolic Attribute Editing for Few-shot Image Generation
    Li, Lingxiao
    Zhang, Yi
    Wang, Shuhui
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22657 - 22667