Hierarchical Vector-Quantized Variational Autoencoder and Vector Credibility Mechanism for High-Quality Image Inpainting

被引:0
|
作者
Li, Cheng [1 ]
Xu, Dan [1 ]
Chen, Kuai [2 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650106, Peoples R China
[2] Yunnan Univ, Sch Govt, Kunming 650106, Peoples R China
基金
中国国家自然科学基金;
关键词
image inpainting; VQ-VAE; vector credibility; codebook;
D O I
10.3390/electronics13101852
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image inpainting infers the missing areas of a corrupted image according to the information of the undamaged part. Many existing image inpainting methods can generate plausible inpainted results from damaged images with the fast-developed deep-learning technology. However, they still suffer from over-smoothed textures or textural distortion in the cases of complex textural details or large damaged areas. To restore textures at a fine-grained level, we propose an image inpainting method based on a hierarchical VQ-VAE with a vector credibility mechanism. It first trains the hierarchical VQ-VAE with ground truth images to update two codebooks and to obtain two corresponding vector collections containing information on ground truth images. The two vector collections are fed to a decoder to generate the corresponding high-fidelity outputs. An encoder then is trained with the corresponding damaged image. It generates vector collections approximating the ground truth by the help of the prior knowledge provided by the codebooks. After that, the two vector collections pass through the decoder from the hierarchical VQ-VAE to produce the inpainted results. In addition, we apply a vector credibility mechanism to promote vector collections from damaged images and approximate vector collections from ground truth images. To further improve the inpainting result, we apply a refinement network, which uses residual blocks with different dilation rates to acquire both global information and local textural details. Extensive experiments conducted on several datasets demonstrate that our method outperforms the state-of-the-art ones.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] VECTOR-QUANTIZED LATENT FLOWS FOR MEDICAL IMAGE SYNTHESIS AND OUT-OF-DISTRIBUTION DETECTION
    Khader, Firas
    Mueller-Franzes, Gustav
    Arasteh, Soroosh Tayebi
    Han, Tianyu
    Kather, Jakob Nikolas
    Stegmaier, Johannes
    Nebelung, Sven
    Truhn, Daniel
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [22] TVQVC: Transformer based Vector Quantized Variational Autoencoder with CTC loss for Voice Conversion
    Chen, Ziyi
    Zhang, Pengyuan
    INTERSPEECH 2021, 2021, : 826 - 830
  • [23] Data augmentation for Gram-stain images based on Vector Quantized Variational AutoEncoder
    Shwetha, V
    Prasad, Keerthana
    Mukhopadhyay, Chiranjay
    Banerjee, Barnini
    NEUROCOMPUTING, 2024, 600
  • [24] Vector Quantized Convolutional Autoencoder Network for LDCT Image Reconstruction with Hybrid Loss
    Ramanathan S.
    Ramasundaram M.
    SN Computer Science, 5 (1)
  • [25] A study on the application of residual vector quantization for vector quantized-variational autoencoder-based foley sound generation model
    Lee, Seokjin
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2024, 43 (02): : 243 - 252
  • [26] Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion
    Ding, Shaojin
    Gutierrez-Osuna, Ricardo
    INTERSPEECH 2019, 2019, : 724 - 728
  • [27] Digital pathology whole slide image compression with Vector Quantized Variational Autoencoders
    Keighley, Jason
    de Kamps, Marc
    Wright, Alexander
    Treanor, Darren
    MEDICAL IMAGING 2023, 2023, 12471
  • [28] Connectionist temporal classification loss for vector quantized variational autoencoder in zero-shot voice conversion
    Kang, Xiao
    Huang, Hao
    Hu, Ying
    Huang, Zhihua
    DIGITAL SIGNAL PROCESSING, 2021, 116
  • [29] Enhancing Hierarchical Vector Quantized Autoencoders for Image Synthesis Through Multiple Decoders
    Serez, Dario
    Cristani, Marco
    Murino, Vittorio
    Del Bue, Alessio
    Morerio, Pietro
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT II, 2023, 14234 : 393 - 405
  • [30] SSL-VQ: vector-quantized variational autoencoders for semi-supervised prediction of therapeutic targets across diverse diseases
    Namba, Satoko
    Li, Chen
    Yuyama Otani, Noriko
    Yamanishi, Yoshihiro
    BIOINFORMATICS, 2025, 41 (02)