SwinTExCo: Exemplar-based video colorization using Swin Transformer

被引:0
|
作者
Tran, Duong Thanh [1 ]
Nguyen, Nguyen Doan Hieu [1 ]
Pham, Trung Thanh [1 ]
Tran, Phuong-Nam [2 ]
Vu, Thuy-Duong Thi [1 ]
Nguyen, Cuong Tuan [3 ]
Dang-Ngoc, Hanh [4 ]
Dang, Duc Ngoc Minh [1 ]
机构
[1] FPT Univ, Long Thanh My Ward, Dept Comp Fundamental, AiTA Lab, D1 St,Saigon Hi Tech Pk, Ho Chi Minh City 71216, Vietnam
[2] Kyung Hee Univ, Dept Comp Sci & Engn, Yongin 446701, South Korea
[3] Vietnamese German Univ, Thoi Hoa Ward, Fac Engn, Ring Rd 4,Quarter 4, Ben Cat 75000, Binh Duong, Vietnam
[4] Ho Chi Minh City Univ Technol HCMUT, Fac Elect & Elect Engn, VNU HCM, 268 Ly Thuong Kiet,Dist 10, Ho Chi Minh City 72506, Vietnam
关键词
Computer vision; Image colorization; Video colorization; Exemplar-based; Vision transformer; Swin transformer; IMAGE;
D O I
10.1016/j.eswa.2024.125437
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video colorization represents a compelling domain within the field of Computer Vision. The traditional approach in this field relies on Convolutional Neural Networks (CNNs) to extract features from each video frame and employs a recurrent network to learn information between video frames. While demonstrating considerable success in colorization, most traditional CNNs suffer from a limited receptive field size, capturing local information within a fixed-sized window. Consequently, they struggle to directly grasp long-range dependencies or pixel relationships that span large image or video frame areas. To address this limitation, recent advancements in the field have leveraged Vision Transformer (ViT) and their variants to enhance performance. This article introduces Swin Transformer Exemplar-based Video Colorization (SwinTExCo), an end-to-end model for the video colorization process that incorporates the Swin Transformer architecture as the backbone. The experimental results demonstrate that our proposed method outperforms many other state-ofthe-art methods in both quantitative and qualitative metrics. The achievements of this research have significant implications for the domain of documentary and history video restoration, contributing to the broader goal of preserving cultural heritage and facilitating a deeper understanding of historical events through enhanced audiovisual materials.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Exemplar-Based Video Inpainting Approach Using Temporal Relationship of Consecutive Frames
    Hung, Kuo-Lung
    Lai, Shih-che
    2017 IEEE 8TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST), 2017, : 373 - 378
  • [22] ON EXEMPLAR-BASED EXEMPLAR REPRESENTATIONS - REPLY
    NOSOFSKY, RM
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 1988, 117 (04) : 412 - 414
  • [23] Summary Transfer: Exemplar-based Subset Selection for Video Summarization
    Zhang, Ke
    Chao, Wei-Lun
    Sha, Fei
    Grauman, Kristen
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1059 - 1067
  • [24] Action recognition using exemplar-based embedding
    Weinland, Daniel
    Boyer, Edmond
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 3033 - 3039
  • [25] Towards exemplar-based polysemy
    Rais-Ghasem, M
    Corriveau, JP
    PROCEEDINGS OF THE TWENTY FIRST ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 1999, : 566 - 571
  • [26] Swin-VEC: Video Swin Transformer-based GAN for video error concealment of VVC
    Zhang, Bing
    Ma, Ran
    Cao, Yu
    An, Ping
    VISUAL COMPUTER, 2024, 40 (10): : 7335 - 7347
  • [27] Exemplar-based inpainting using local binary patterns
    Voronin, V. V.
    Marchuk, V. I.
    Gapon, N. V.
    Sizyakin, R. A.
    Sherstobitov, A. I.
    Egiazarian, K. O.
    IMAGE PROCESSING: ALGORITHMS AND SYSTEMS XII, 2014, 9019
  • [28] Exemplar-based Image Inpainting using Structure Tesnor
    Liu Kui
    Tan Jieqing
    Su Benyue
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND ELECTRONICS INFORMATION (ICACSEI 2013), 2013, 41 : 619 - 623
  • [29] Exemplar-based image completion using global optimization
    Chen, Zhonggui
    Liu, Ligang
    Wang, Guojin
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2009, 46 (01): : 144 - 150
  • [30] Exemplar-Based Colour Constancy
    Joze, Hamid Reza Vaezi
    Drew, Mark S.
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,