FLOW-GUIDED TRANSFORMER FOR VIDEO COLORIZATION

被引:0
|
作者
Zhai, Yan [1 ]
Tao, Zhulin [1 ]
Dai, Longquan [2 ]
Wang, He [2 ]
Huang, Xianglin [1 ]
Yang, Lifang [1 ]
机构
[1] Commun Univ China, Beijing, Peoples R China
[2] Nanjing Univ Sci & Technol, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Video Colorization; Flow-Guided Attention; Transformer;
D O I
10.1109/ICIP49359.2023.10223177
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video colorization aims to add color to black-and-white films. However, propagating color information to the whole video clip accurately is a challenging task. In this paper, we propose Flow-Guided Transformer for Video Colorization (FGTVC), consisting of a Global Motion Aggregation (GMA) module, Residual modules, Flow-Guided Attention blocks (FGAB) based on encoder and decoder, to exploit the information from the neighbor patch with high similarity for each video patch colorization. Specifically, we employ Transformer to capture the long-distance dependencies between frames and learn non-local self-similarity in the frame. To overcome the shortcomings of previous optical flow-based methods, FGAB enjoys the guidance of optical flow to sample elements from spatio-temporal adjacent frames when calculating self-attention. Experiments show that the proposed FGTVC has an outstanding performance than the state-of-the-art methods. In addition, comprehensive findings demonstrate the superiority of our framework in real-world video colorization tasks.
引用
收藏
页码:2485 / 2489
页数:5
相关论文
共 50 条
  • [1] Flow-Guided Transformer for Video Colorization
    Zhai, Yan
    Tao, Zhulin
    Dai, Longquan
    Wang, He
    Huang, Xianglin
    Yang, Lifang
    Proceedings - International Conference on Image Processing, ICIP, 2023, : 2485 - 2489
  • [2] Flow-Guided Transformer for Video Inpainting
    Zhang, Kaidong
    Fu, Jingjing
    Liu, Dong
    COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 : 74 - 90
  • [3] Flow-Guided Sparse Transformer for Video Deblurring
    Lin, Jing
    Cai, Yuanhao
    Hu, Xiaowan
    Wang, Haoqian
    Yan, Youliang
    Zou, Xueyi
    Ding, Henghui
    Zhang, Yulun
    Timofte, Radu
    Van Gool, Luc
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [4] FSTT: Flow-Guided Spatial Temporal Transformer for Deep Video Inpainting
    Liu, Ruixin
    Zhu, Yuesheng
    ELECTRONICS, 2023, 12 (21)
  • [5] Deep Flow-Guided Video Inpainting
    Xu, Rui
    Li, Xiaoxiao
    Zhou, Bolei
    Loy, Chen Change
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3718 - 3727
  • [6] FVIFormer: Flow-Guided Global-Local Aggregation Transformer Network for Video Inpainting
    Yan, Weiqing
    Sun, Yiqiu
    Yue, Guanghui
    Zhou, Wei
    Liu, Hantao
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2024, 14 (02) : 235 - 244
  • [7] Local and nonlocal flow-guided video inpainting
    Jing Wang
    Zongju Yang
    Zhanqiang Huo
    Wei Chen
    Multimedia Tools and Applications, 2024, 83 : 10321 - 10340
  • [8] Local and nonlocal flow-guided video inpainting
    Wang, Jing
    Yang, Zongju
    Huo, Zhanqiang
    Chen, Wei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 10321 - 10340
  • [9] Flow-Guided Video Inpainting with Scene Templates
    Lao, Dong
    Zhu, Peihao
    Wonka, Peter
    Sundaramoorthi, Ganesh
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14579 - 14588
  • [10] Code Comments Generation with Data Flow-Guided Transformer
    Zhou, Wen
    Wu, Junhua
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13579 LNCS : 168 - 180