Multi-Stage Spatio-Temporal Fusion Network for Fast and Accurate Video Bit-Depth Enhancement

被引：0

作者：

Liu, Jing ^{[1
]}

Fan, Zhiwei ^{[1
]}

Yang, Ziwen ^{[1
]}

Su, Yuting ^{[1
]}

Yang, Xiaokang ^{[2
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

[2] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

关键词：

Feature extraction; Image reconstruction; Task analysis; Fuses; Motion compensation; Distortion; Image color analysis; Video bit-depth enhancement; multiple stages; spatio-temporal fusion; EXPANSION;

D O I：

10.1109/TMM.2023.3296225

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

For video bit-depth enhancement (VBDE) tasks, inter-frame information is critical for removing false contours and recovering the details in low bit-depth (LBD) videos. However, due to different structural distortions and complex motions in the neighboring frames, it is difficult to effectively utilized inter-frame information. Most algorithms rely on alignment operations to provide information of neighboring frames, suffering from slow inference speed due to the complex alignment module design. Meanwhile, most existing methods sequentially perform the intra-frame feature extractions and inter-frame information fusions, but fail to efficiently fuse spatio-temporal information. Therefore, in this paper, we propose a two-stage progressive group (TSPG) network to find complementary information related to the target frame without adopting an alignment operation. To simultaneously achieve intra-frame feature extractions and inter-frame feature fusions, we propose a parallel spatio-temporal fusion (PSTF) module with a dual-branch spatial-temporal residual (DSTR) block to focus on more useful temporal information while ensuring a faster inference speeds. Extensive experiments on public datasets demonstrate that our proposed multi-stage spatio-temporal fusion network (named MSTFN) can quickly and effectively eliminate false contours and recover high quality target frames. Furthermore, our method outperforms the state-of-the-art methods in terms of both PSNR and SSIM, and can reach faster inference speeds.

引用

页码：2444 / 2455

页数：12

共 50 条

[21] SPATIO-TEMPORAL CONSISTENT DEPTH MAPS FROM MULTI-VIEW VIDEO
Mueller, Marcus
Zilly, Frederik
Riechert, Christian
Kauff, Peter
2011 3DTV CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO (3DTV-CON), 2011,
[22] Efficient Spatio-Temporal Network with Gated Fusion for Video Super-Resolution
Li, Changyu
Zhang, Dongyang
Xie, Ning
Shao, Jie
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 640 - 651
[23] Spatio-temporal co-attention fusion network for video splicing localization
Lin, Man
Cao, Gang
Lou, Zijie
Zhang, Chi
JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (03) : 33027
[24] DeepVideoMVS: Multi-View Stereo on Video with Recurrent Spatio-Temporal Fusion
Duzceker, Arda
Galliani, Silvano
Vogel, Christoph
Speciale, Pablo
Dusmanu, Mihai
Pollefeys, Marc
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15319 - 15328
[25] Coarse-to-Fine Spatio-Temporal Information Fusion for Compressed Video Quality Enhancement
Luo, Dengyan
Ye, Mao
Li, Shuai
Li, Xue
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 543 - 547
[26] Unified multi-stage fusion network for affective video content analysis
Yi, Yun
Wang, Hanli
Tang, Pengjie
ELECTRONICS LETTERS, 2022, 58 (21) : 795 - 797
[27] Multi-Stage Feature Fusion Network for Video Super-Resolution
Song, Huihui
Xu, Wenjie
Liu, Dong
Liu, Bo
Liu, Qingshan
Metaxas, Dimitris N.
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2923 - 2934
[28] Spatio-Temporal Multi-stage OpenFlow Switch Model for Software Defined Cellular Networks
Ozcevik, Yusuf
Erel, Muge
Canberk, Berk
2015 IEEE 82ND VEHICULAR TECHNOLOGY CONFERENCE (VTC FALL), 2015,
[29] Dast-Net: Depth-Aware Spatio-Temporal Network for Video Deblurring
Zhu, Qi
Xiao, Zeyu
Huang, Jie
Zhao, Feng
Proceedings - IEEE International Conference on Multimedia and Expo, 2022, 2022-July
[30] SPATIO-TEMPORAL CONVOLUTIONAL NEURAL NETWORK FOR ELDERLY FALL DETECTION IN DEPTH VIDEO CAMERAS
Rahnemoonfar, Maryam
Alkittawi, Hend
2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2868 - 2873

← 1 2 3 4 5 →