Video Colorization Based on Variational Autoencoder

被引:1
|
作者
Zhang, Guangzi [1 ]
Hong, Xiaolin [1 ]
Liu, Yan [1 ]
Qian, Yulin [1 ]
Cai, Xingquan [1 ]
机构
[1] North China Univ Technol, Sch Informat Sci & Technol, Beijing 100144, Peoples R China
关键词
video colorization; temporal consistency; variational autoencoder;
D O I
10.3390/electronics13122412
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces a variational autoencoder network designed for video colorization using reference images, addressing the challenge of colorizing black-and-white videos. Although recent techniques perform well in some scenarios, they often struggle with color inconsistencies and artifacts in videos that feature complex scenes and long durations. To tackle this, we propose a variational autoencoder framework that incorporates spatio-temporal information for efficient video colorization. To improve temporal consistency, we unify semantic correspondence with color propagation, allowing for simultaneous guidance in colorizing grayscale video frames. Additionally, the variational autoencoder learns spatio-temporal feature representations by mapping video frames into a latent space through an encoder network. The decoder network then transforms these latent features back into color images. Compared to traditional coloring methods, our approach accurately captures temporal relationships between video frames, providing precise colorization while ensuring video consistency. To further enhance video quality, we apply a specialized loss function that constrains the generated output, ensuring that the colorized video remains spatio-temporally consistent and natural. Experimental results demonstrate that our method significantly improves the video colorization process.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Longitudinal Variational Autoencoder
    Ramchandran, Siddharth
    Tikhonov, Gleb
    Kujanpaa, Kalle
    Koskinen, Miika
    Lahdesmaki, Harri
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [32] The Variational InfoMax AutoEncoder
    Crescimanna, Vincenzo
    Graham, Bruce
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [33] LatentColorization: Latent Diffusion-Based Speaker Video Colorization
    Ward, Rory
    Bigioi, Dan
    Basak, Shubhajit
    Breslin, John G.
    Corcoran, Peter
    IEEE ACCESS, 2024, 12 : 81105 - 81121
  • [34] Generative autoencoder to prevent overregularization of variational autoencoder
    Ko, YoungMin
    Ko, SunWoo
    Kim, YoungSoo
    ETRI JOURNAL, 2025, 47 (01) : 80 - 89
  • [35] Adversarial Attention-Based Variational Graph Autoencoder
    Weng, Ziqiang
    Zhang, Weiyu
    Dou, Wei
    IEEE ACCESS, 2020, 8 : 152637 - 152645
  • [36] Variational autoencoder for prosody-based speaker recognition
    Ben Alex, Starlet
    Mary, Leena
    ETRI JOURNAL, 2023, 45 (04) : 678 - 689
  • [37] A Transformer-Based Variational Autoencoder for Sentence Generation
    Liu, Danyang
    Liu, Gongshen
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [38] Port Container Throughput Prediction Based on Variational AutoEncoder
    Li, Jingze
    Shi, Shengmin
    Chen, Tongbing
    Tian, Yu
    Ding, Yihua
    Xiao, Yiyong
    Sun, Weiwei
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13246 LNCS : 590 - 605
  • [39] Port Container Throughput Prediction Based on Variational AutoEncoder
    Li, Jingze
    Shi, Shengmin
    Chen, Tongbing
    Tian, Yu
    Ding, Yihua
    Xiao, Yiyong
    Sun, Weiwei
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT II, 2022, : 590 - 605
  • [40] Text feature extraction based on stacked variational autoencoder
    Che, Lei
    Yang, Xiaoping
    Wang, Liang
    MICROPROCESSORS AND MICROSYSTEMS, 2020, 76 (76)