Hierarchical Vector-Quantized Variational Autoencoder and Vector Credibility Mechanism for High-Quality Image Inpainting

被引：0

作者：

Li, Cheng ^{[1
]}

Xu, Dan ^{[1
]}

Chen, Kuai ^{[2
]}

机构：

[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650106, Peoples R China

[2] Yunnan Univ, Sch Govt, Kunming 650106, Peoples R China

来源：

ELECTRONICS | 2024年 / 13卷 / 10期

基金：

中国国家自然科学基金;

关键词：

image inpainting; VQ-VAE; vector credibility; codebook;

D O I：

10.3390/electronics13101852

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Image inpainting infers the missing areas of a corrupted image according to the information of the undamaged part. Many existing image inpainting methods can generate plausible inpainted results from damaged images with the fast-developed deep-learning technology. However, they still suffer from over-smoothed textures or textural distortion in the cases of complex textural details or large damaged areas. To restore textures at a fine-grained level, we propose an image inpainting method based on a hierarchical VQ-VAE with a vector credibility mechanism. It first trains the hierarchical VQ-VAE with ground truth images to update two codebooks and to obtain two corresponding vector collections containing information on ground truth images. The two vector collections are fed to a decoder to generate the corresponding high-fidelity outputs. An encoder then is trained with the corresponding damaged image. It generates vector collections approximating the ground truth by the help of the prior knowledge provided by the codebooks. After that, the two vector collections pass through the decoder from the hierarchical VQ-VAE to produce the inpainted results. In addition, we apply a vector credibility mechanism to promote vector collections from damaged images and approximate vector collections from ground truth images. To further improve the inpainting result, we apply a refinement network, which uses residual blocks with different dilation rates to acquire both global information and local textural details. Extensive experiments conducted on several datasets demonstrate that our method outperforms the state-of-the-art ones.

引用

页数：17

共 50 条

[21] VECTOR-QUANTIZED LATENT FLOWS FOR MEDICAL IMAGE SYNTHESIS AND OUT-OF-DISTRIBUTION DETECTION
Khader, Firas
Mueller-Franzes, Gustav
Arasteh, Soroosh Tayebi
Han, Tianyu
Kather, Jakob Nikolas
Stegmaier, Johannes
Nebelung, Sven
Truhn, Daniel
2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
[22] TVQVC: Transformer based Vector Quantized Variational Autoencoder with CTC loss for Voice Conversion
Chen, Ziyi
Zhang, Pengyuan
INTERSPEECH 2021, 2021, : 826 - 830
[23] Data augmentation for Gram-stain images based on Vector Quantized Variational AutoEncoder
Shwetha, V
Prasad, Keerthana
Mukhopadhyay, Chiranjay
Banerjee, Barnini
NEUROCOMPUTING, 2024, 600
[24] Vector Quantized Convolutional Autoencoder Network for LDCT Image Reconstruction with Hybrid Loss
Ramanathan S.
Ramasundaram M.
SN Computer Science, 5 (1)
[25] A study on the application of residual vector quantization for vector quantized-variational autoencoder-based foley sound generation model
Lee, Seokjin
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2024, 43 (02): : 243 - 252
[26] Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion
Ding, Shaojin
Gutierrez-Osuna, Ricardo
INTERSPEECH 2019, 2019, : 724 - 728
[27] Digital pathology whole slide image compression with Vector Quantized Variational Autoencoders
Keighley, Jason
de Kamps, Marc
Wright, Alexander
Treanor, Darren
MEDICAL IMAGING 2023, 2023, 12471
[28] Connectionist temporal classification loss for vector quantized variational autoencoder in zero-shot voice conversion
Kang, Xiao
Huang, Hao
Hu, Ying
Huang, Zhihua
DIGITAL SIGNAL PROCESSING, 2021, 116
[29] Enhancing Hierarchical Vector Quantized Autoencoders for Image Synthesis Through Multiple Decoders
Serez, Dario
Cristani, Marco
Murino, Vittorio
Del Bue, Alessio
Morerio, Pietro
IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT II, 2023, 14234 : 393 - 405
[30] SSL-VQ: vector-quantized variational autoencoders for semi-supervised prediction of therapeutic targets across diverse diseases
Namba, Satoko
Li, Chen
Yuyama Otani, Noriko
Yamanishi, Yoshihiro
BIOINFORMATICS, 2025, 41 (02)

← 1 2 3 4 5 →