FLOW-GUIDED TRANSFORMER FOR VIDEO COLORIZATION

被引：0

作者：

Zhai, Yan ^{[1
]}

Tao, Zhulin ^{[1
]}

Dai, Longquan ^{[2
]}

Wang, He ^{[2
]}

Huang, Xianglin ^{[1
]}

Yang, Lifang ^{[1
]}

机构：

[1] Commun Univ China, Beijing, Peoples R China

[2] Nanjing Univ Sci & Technol, Nanjing, Peoples R China

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年

基金：

中国国家自然科学基金;

关键词：

Video Colorization; Flow-Guided Attention; Transformer;

D O I：

10.1109/ICIP49359.2023.10223177

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video colorization aims to add color to black-and-white films. However, propagating color information to the whole video clip accurately is a challenging task. In this paper, we propose Flow-Guided Transformer for Video Colorization (FGTVC), consisting of a Global Motion Aggregation (GMA) module, Residual modules, Flow-Guided Attention blocks (FGAB) based on encoder and decoder, to exploit the information from the neighbor patch with high similarity for each video patch colorization. Specifically, we employ Transformer to capture the long-distance dependencies between frames and learn non-local self-similarity in the frame. To overcome the shortcomings of previous optical flow-based methods, FGAB enjoys the guidance of optical flow to sample elements from spatio-temporal adjacent frames when calculating self-attention. Experiments show that the proposed FGTVC has an outstanding performance than the state-of-the-art methods. In addition, comprehensive findings demonstrate the superiority of our framework in real-world video colorization tasks.

引用

页码：2485 / 2489

页数：5

共 50 条

[1] Flow-Guided Transformer for Video Colorization
Zhai, Yan
Tao, Zhulin
Dai, Longquan
Wang, He
Huang, Xianglin
Yang, Lifang
Proceedings - International Conference on Image Processing, ICIP, 2023, : 2485 - 2489
[2] Flow-Guided Transformer for Video Inpainting
Zhang, Kaidong
Fu, Jingjing
Liu, Dong
COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 : 74 - 90
[3] Flow-Guided Sparse Transformer for Video Deblurring
Lin, Jing
Cai, Yuanhao
Hu, Xiaowan
Wang, Haoqian
Yan, Youliang
Zou, Xueyi
Ding, Henghui
Zhang, Yulun
Timofte, Radu
Van Gool, Luc
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[4] FSTT: Flow-Guided Spatial Temporal Transformer for Deep Video Inpainting
Liu, Ruixin
Zhu, Yuesheng
ELECTRONICS, 2023, 12 (21)
[5] Deep Flow-Guided Video Inpainting
Xu, Rui
Li, Xiaoxiao
Zhou, Bolei
Loy, Chen Change
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3718 - 3727
[6] FVIFormer: Flow-Guided Global-Local Aggregation Transformer Network for Video Inpainting
Yan, Weiqing
Sun, Yiqiu
Yue, Guanghui
Zhou, Wei
Liu, Hantao
IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2024, 14 (02) : 235 - 244
[7] Local and nonlocal flow-guided video inpainting
Jing Wang
Zongju Yang
Zhanqiang Huo
Wei Chen
Multimedia Tools and Applications, 2024, 83 : 10321 - 10340
[8] Local and nonlocal flow-guided video inpainting
Wang, Jing
Yang, Zongju
Huo, Zhanqiang
Chen, Wei
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 10321 - 10340
[9] Flow-Guided Video Inpainting with Scene Templates
Lao, Dong
Zhu, Peihao
Wonka, Peter
Sundaramoorthi, Ganesh
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14579 - 14588
[10] Code Comments Generation with Data Flow-Guided Transformer
Zhou, Wen
Wu, Junhua
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13579 LNCS : 168 - 180

← 1 2 3 4 5 →