Disentangled Image Attribute Editing in Latent Space via Mask-based Retention Loss

被引：0

作者：

Ohaga, Shunya ^{[1
]}

Togo, Ren ^{[1
]}

Ogawa, Takahiro ^{[1
]}

Haseyama, Miki ^{[1
]}

机构：

[1] Hokkaido Univ, Sapporo, Hokkaido, Japan

来源：

PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022 | 2022年

关键词：

computer vision; image editing; image manipulation;

D O I：

10.1145/3551626.3564949

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

We propose an image attribute editing method with the mask-based retention loss. Although conventional image attribute editing methods can edit a particular attribute, they cannot retain non-editing attributes including unknown attributes before and after editing, which causes unexpected changes in the edited images. We solve this problem by dividing the pre- and post-edited images into the editing and non-editing regions and increasing the image similarity in the non-editing regions. In this paper, we introduce the novel mask-based retention loss to retain the non-editing regions. To compute the mask-based retention loss, we divide the images into the editing and non-editing regions by using a binary mask generated from the difference between the pre- and post-edited images. Experimental results show that our proposed method is qualitatively and quantitatively superior to state-of-the-art methods.

引用

页数：7

共 50 条

[21] Wasserstein loss for Semantic Editing in the Latent Space of GANs
Doubinsky, Perla
Audebert, Nicolas
Crucianu, Michel
Le Borgne, Herve
20TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING, CBMI 2023, 2023, : 55 - 60
[22] Mask-based Style-Controlled Image Synthesis Using a Mask Style Encoder
Cho, Jaehyeong
Shimoda, Wataru
Yanai, Keiji
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5176 - 5183
[23] OPTIMIZING LATENT SPACE DIRECTIONS FOR GAN-BASED LOCAL IMAGE EDITING
Pajouheshgar, Ehsan
Zhang, Tong
Susstrunk, Sabine
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1740 - 1744
[24] DeepLIR: Attention-based approach for Mask-Based Lensless Image Reconstruction
Poudel, Arpan
Nakarmi, Ukash
2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 431 - 439
[25] Using Mask-Based Enhancement and Feature Aggregation for Single Image Deraining
Qin, Shengdi
Zhang, Shunli
Zhang, Yu
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 828 - 832
[26] Controllable Anime Image Editing via Probability of Attribute Tags
Song, Zhenghao
Mo, Haoran
Gao, Chengying
COMPUTER GRAPHICS FORUM, 2024, 43 (07)
[27] Multichannel Loss Function for Supervised Speech Source Separation by Mask-based Beamforming
Masuyama, Yoshiki
Togami, Masahito
Komatsu, Tatsuya
INTERSPEECH 2019, 2019, : 2708 - 2712
[28] Controllable Unsupervised Text Attribute Transfer via Editing Entangled Latent Representation
Wang, Ke
Hua, Hang
Wan, Xiaojun
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[29] Detecting Silicone Mask-Based Presentation Attack via Deep Dictionary Learning
Manjani, Ishan
Tariyal, Snigdha
Vatsa, Mayank
Singh, Richa
Majumdar, Angshul
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (07) : 1713 - 1723
[30] Latent shape image learning via disentangled representation for cross-sequence image registration and segmentation
Wu, Jiong
Yang, Qi
Zhou, Shuang
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (04) : 621 - 628

← 1 2 3 4 5 →