Multi-Stage Spatio-Temporal Fusion Network for Fast and Accurate Video Bit-Depth Enhancement

被引：0

作者：

Liu, Jing ^{[1
]}

Fan, Zhiwei ^{[1
]}

Yang, Ziwen ^{[1
]}

Su, Yuting ^{[1
]}

Yang, Xiaokang ^{[2
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

[2] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

关键词：

Feature extraction; Image reconstruction; Task analysis; Fuses; Motion compensation; Distortion; Image color analysis; Video bit-depth enhancement; multiple stages; spatio-temporal fusion; EXPANSION;

D O I：

10.1109/TMM.2023.3296225

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

For video bit-depth enhancement (VBDE) tasks, inter-frame information is critical for removing false contours and recovering the details in low bit-depth (LBD) videos. However, due to different structural distortions and complex motions in the neighboring frames, it is difficult to effectively utilized inter-frame information. Most algorithms rely on alignment operations to provide information of neighboring frames, suffering from slow inference speed due to the complex alignment module design. Meanwhile, most existing methods sequentially perform the intra-frame feature extractions and inter-frame information fusions, but fail to efficiently fuse spatio-temporal information. Therefore, in this paper, we propose a two-stage progressive group (TSPG) network to find complementary information related to the target frame without adopting an alignment operation. To simultaneously achieve intra-frame feature extractions and inter-frame feature fusions, we propose a parallel spatio-temporal fusion (PSTF) module with a dual-branch spatial-temporal residual (DSTR) block to focus on more useful temporal information while ensuring a faster inference speeds. Extensive experiments on public datasets demonstrate that our proposed multi-stage spatio-temporal fusion network (named MSTFN) can quickly and effectively eliminate false contours and recover high quality target frames. Furthermore, our method outperforms the state-of-the-art methods in terms of both PSNR and SSIM, and can reach faster inference speeds.

引用

页码：2444 / 2455

页数：12

共 50 条

[1] Spatio-temporal progressive optimization network for video bit depth enhancement
Li, Qingying
Lin, Xin
Liu, Jing
Su, Yuting
Ma, Rui
MULTIMEDIA SYSTEMS, 2024, 30 (05)
[2] Efficient Multi-Stage Video Denoising with Recurrent Spatio-Temporal Fusion
Maggioni, Matteo
Huang, Yibin
Li, Cheng
Xiao, Shuai
Fu, Zhongqian
Song, Fenglong
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3465 - 3474
[3] A multi-stage spatio-temporal adaptive network for video super-resolution
Zhang, Yuhang
Chen, Zhenzhong
Liu, Shan
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
[4] TANet: Target Attention Network for Video Bit-Depth Enhancement
Liu, Jing
Yang, Ziwen
Su, Yuting
Yang, Xiaokang
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4212 - 4223
[5] Spatio-Temporal Consistency in Depth Video Enhancement
Li, Li
Zhang, Caiming
JOURNAL OF ADVANCED MECHANICAL DESIGN SYSTEMS AND MANUFACTURING, 2013, 7 (05): : 808 - 817
[6] MFDGCN: Multi-Stage Spatio-Temporal Fusion Diffusion Graph Convolutional Network for Traffic Prediction
Cui, Zhengyan
Zhang, Junjun
Noh, Giseop
Park, Hyun Jun
APPLIED SCIENCES-BASEL, 2022, 12 (05):
[7] Spatio-Temporal Information Fusion Network for Compressed Video Quality Enhancement
Huang, Weiwei
Jia, Kebin
Liu, Pengyu
Yu, Yuan
2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 343 - 343
[8] Spatiotemporal Symmetric Convolutional Neural Network for Video Bit-Depth Enhancement
Liu, Jing
Liu, Pingping
Su, Yuting
Jing, Peiguang
Yang, Xiaokang
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (09) : 2397 - 2406
[9] Residual-Guided Multiscale Fusion Network for Bit-Depth Enhancement
Liu, Jing
Wen, Xin
Nie, Weizhi
Su, Yuting
Jing, Peiguang
Yang, Xiaokang
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 2773 - 2786
[10] User-Ranking Video Summarization With Multi-Stage Spatio-Temporal Representation
Huang, Siyu
Li, Xi
Zhang, Zhongfei
Wu, Fei
Han, Junwei
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (06) : 2654 - 2664

← 1 2 3 4 5 →