Video Colorization Based on Variational Autoencoder

被引:1
|
作者
Zhang, Guangzi [1 ]
Hong, Xiaolin [1 ]
Liu, Yan [1 ]
Qian, Yulin [1 ]
Cai, Xingquan [1 ]
机构
[1] North China Univ Technol, Sch Informat Sci & Technol, Beijing 100144, Peoples R China
关键词
video colorization; temporal consistency; variational autoencoder;
D O I
10.3390/electronics13122412
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces a variational autoencoder network designed for video colorization using reference images, addressing the challenge of colorizing black-and-white videos. Although recent techniques perform well in some scenarios, they often struggle with color inconsistencies and artifacts in videos that feature complex scenes and long durations. To tackle this, we propose a variational autoencoder framework that incorporates spatio-temporal information for efficient video colorization. To improve temporal consistency, we unify semantic correspondence with color propagation, allowing for simultaneous guidance in colorizing grayscale video frames. Additionally, the variational autoencoder learns spatio-temporal feature representations by mapping video frames into a latent space through an encoder network. The decoder network then transforms these latent features back into color images. Compared to traditional coloring methods, our approach accurately captures temporal relationships between video frames, providing precise colorization while ensuring video consistency. To further enhance video quality, we apply a specialized loss function that constrains the generated output, ensuring that the colorized video remains spatio-temporally consistent and natural. Experimental results demonstrate that our method significantly improves the video colorization process.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] IMAGE AND VIDEO COLORIZATION BASED ON PRIORITIZED SOURCE PROPAGATION
    Heu, Jun-Hee
    Hyun, Dae-Young
    Kim, Chang-Su
    Lee, Sang-Uk
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 465 - +
  • [22] Flow-Based Variational Sequence Autoencoder
    Chien, Jen-Tzung
    Luo, Tien-Ching
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1418 - 1425
  • [23] A Deep Convolutional Autoencoder Architecture for Automatic Image Colorization
    Cevallos, Stefano
    Perez, Noel
    Riofrio, Daniel
    Benitez, Diego
    Moyano, Ricardo Flores
    Baldeon-Calisto, Maria
    2022 IEEE COLOMBIAN CONFERENCE ON APPLICATIONS OF COMPUTATIONAL INTELLIGENCE (COLCACI 2022), 2022,
  • [24] Dirichlet Variational Autoencoder
    Joo, Weonyoung
    Lee, Wonsung
    Park, Sungrae
    Moon, Il-Chul
    PATTERN RECOGNITION, 2020, 107
  • [25] The Autoencoding Variational Autoencoder
    Cemgil, A. Taylan
    Ghaisas, Sumedh
    Dvijotham, Krishnamurthy
    Gowal, Sven
    Kohli, Pushmeet
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [26] Grammar Variational Autoencoder
    Kusner, Matt J.
    Paige, Brooks
    Hernandez-Lobato, Jose Miguel
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [27] Quantum variational autoencoder
    Khoshaman, Amir
    Vinci, Walter
    Denis, Brandon
    Andriyash, Evgeny
    Amin, Mohammad H.
    QUANTUM SCIENCE AND TECHNOLOGY, 2019, 4 (01)
  • [28] Video Colorization Dataset and Benchmark
    Abeysinghe, Chamath
    Wijesinghe, Thejan
    Wijayakoon, Chanuka
    Jayathilake, Lahiru
    Thayasivam, Uthayasanker
    2019 MORATUWA ENGINEERING RESEARCH CONFERENCE (MERCON) / 5TH INTERNATIONAL MULTIDISCIPLINARY ENGINEERING RESEARCH CONFERENCE, 2019, : 37 - 42
  • [29] Variational Selective Autoencoder
    Gong, Yu
    Hajimirsadeghi, Hossein
    He, Jiawei
    Nawhal, Megha
    Durand, Thibaut
    Mori, Greg
    SYMPOSIUM ON ADVANCES IN APPROXIMATE BAYESIAN INFERENCE, VOL 118, 2019, 118
  • [30] Reference-Based Deep Line Art Video Colorization
    Shi, Min
    Zhang, Jia-Qi
    Chen, Shu-Yu
    Gao, Lin
    Lai, Yu-Kun
    Zhang, Fang-Lue
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 29 (06) : 2965 - 2979