Disentangled Image Attribute Editing in Latent Space via Mask-based Retention Loss

被引：0

作者：

Ohaga, Shunya ^{[1
]}

Togo, Ren ^{[1
]}

Ogawa, Takahiro ^{[1
]}

Haseyama, Miki ^{[1
]}

机构：

[1] Hokkaido Univ, Sapporo, Hokkaido, Japan

来源：

PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022 | 2022年

关键词：

computer vision; image editing; image manipulation;

D O I：

10.1145/3551626.3564949

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

We propose an image attribute editing method with the mask-based retention loss. Although conventional image attribute editing methods can edit a particular attribute, they cannot retain non-editing attributes including unknown attributes before and after editing, which causes unexpected changes in the edited images. We solve this problem by dividing the pre- and post-edited images into the editing and non-editing regions and increasing the image similarity in the non-editing regions. In this paper, we introduce the novel mask-based retention loss to retain the non-editing regions. To compute the mask-based retention loss, we divide the images into the editing and non-editing regions by using a binary mask generated from the difference between the pre- and post-edited images. Experimental results show that our proposed method is qualitatively and quantitatively superior to state-of-the-art methods.

引用

页数：7

共 50 条

[31] Latent shape image learning via disentangled representation for cross-sequence image registration and segmentation
Jiong Wu
Qi Yang
Shuang Zhou
International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 621 - 628
[32] Coherent diffraction lithography: Periodic patterns via mask-based interference lithography
Fucetola, Corey P.
Patel, Amil A.
Moon, Euclid E.
O'Reilly, Thomas B.
Smith, Henry I.
JOURNAL OF VACUUM SCIENCE & TECHNOLOGY B, 2009, 27 (06): : 2947 - 2950
[33] Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint
Liu, Hongyu
Song, Yibing
Chen, Qifeng
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10072 - 10082
[34] Seamless Registration of Dual Camera Images Using Optimal Mask-Based Image Fusion
Kim, Hyeonji
Jo, Jieun
Jang, Jinbeum
Park, Sangwoo
Paik, Joonki
2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-ASIA (ICCE-ASIA), 2016,
[35] A study on combination of loss functions for effective mask-based speech enhancement in noisy environments
Jung, Jaehee
Kim, Wooil
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (03): : 234 - 240
[36] Unsupervised Underwater Image Enhancement Based on Disentangled Representations via Double-Order Contrastive Loss
Yin, Jiankai
Wang, Yan
Guan, Bowen
Zeng, Xianchao
Guo, Lei
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
[37] Robust unsupervised image categorization based on variational autoencoder with disentangled latent representations
Yang, Lin
Fan, Wentao
Bouguila, Nizar
KNOWLEDGE-BASED SYSTEMS, 2022, 246
[38] A Generative Image Steganography Based on Disentangled Attribute Feature Transformation and Invertible Mapping Rule
Zhang, Xiang
Han, Shenyan
Huang, Wenbin
Fu, Daoyong
Computers, Materials and Continua, 2025, 83 (01): : 1149 - 1171
[39] Face image inpainting via latent features reconstruction and mask awareness
Chen, Feng
Zhang, Tongtong
Liu, Heng
COMPUTERS & ELECTRICAL ENGINEERING, 2022, 103
[40] The Euclidean Space is Evil: Hyperbolic Attribute Editing for Few-shot Image Generation
Li, Lingxiao
Zhang, Yi
Wang, Shuhui
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22657 - 22667

← 1 2 3 4 5 →