Multi-Stage Spatio-Temporal Fusion Network for Fast and Accurate Video Bit-Depth Enhancement

被引:0
|
作者
Liu, Jing [1 ]
Fan, Zhiwei [1 ]
Yang, Ziwen [1 ]
Su, Yuting [1 ]
Yang, Xiaokang [2 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
关键词
Feature extraction; Image reconstruction; Task analysis; Fuses; Motion compensation; Distortion; Image color analysis; Video bit-depth enhancement; multiple stages; spatio-temporal fusion; EXPANSION;
D O I
10.1109/TMM.2023.3296225
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For video bit-depth enhancement (VBDE) tasks, inter-frame information is critical for removing false contours and recovering the details in low bit-depth (LBD) videos. However, due to different structural distortions and complex motions in the neighboring frames, it is difficult to effectively utilized inter-frame information. Most algorithms rely on alignment operations to provide information of neighboring frames, suffering from slow inference speed due to the complex alignment module design. Meanwhile, most existing methods sequentially perform the intra-frame feature extractions and inter-frame information fusions, but fail to efficiently fuse spatio-temporal information. Therefore, in this paper, we propose a two-stage progressive group (TSPG) network to find complementary information related to the target frame without adopting an alignment operation. To simultaneously achieve intra-frame feature extractions and inter-frame feature fusions, we propose a parallel spatio-temporal fusion (PSTF) module with a dual-branch spatial-temporal residual (DSTR) block to focus on more useful temporal information while ensuring a faster inference speeds. Extensive experiments on public datasets demonstrate that our proposed multi-stage spatio-temporal fusion network (named MSTFN) can quickly and effectively eliminate false contours and recover high quality target frames. Furthermore, our method outperforms the state-of-the-art methods in terms of both PSNR and SSIM, and can reach faster inference speeds.
引用
收藏
页码:2444 / 2455
页数:12
相关论文
共 50 条
  • [21] SPATIO-TEMPORAL CONSISTENT DEPTH MAPS FROM MULTI-VIEW VIDEO
    Mueller, Marcus
    Zilly, Frederik
    Riechert, Christian
    Kauff, Peter
    2011 3DTV CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO (3DTV-CON), 2011,
  • [22] Efficient Spatio-Temporal Network with Gated Fusion for Video Super-Resolution
    Li, Changyu
    Zhang, Dongyang
    Xie, Ning
    Shao, Jie
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 640 - 651
  • [23] Spatio-temporal co-attention fusion network for video splicing localization
    Lin, Man
    Cao, Gang
    Lou, Zijie
    Zhang, Chi
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (03) : 33027
  • [24] DeepVideoMVS: Multi-View Stereo on Video with Recurrent Spatio-Temporal Fusion
    Duzceker, Arda
    Galliani, Silvano
    Vogel, Christoph
    Speciale, Pablo
    Dusmanu, Mihai
    Pollefeys, Marc
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15319 - 15328
  • [25] Coarse-to-Fine Spatio-Temporal Information Fusion for Compressed Video Quality Enhancement
    Luo, Dengyan
    Ye, Mao
    Li, Shuai
    Li, Xue
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 543 - 547
  • [26] Unified multi-stage fusion network for affective video content analysis
    Yi, Yun
    Wang, Hanli
    Tang, Pengjie
    ELECTRONICS LETTERS, 2022, 58 (21) : 795 - 797
  • [27] Multi-Stage Feature Fusion Network for Video Super-Resolution
    Song, Huihui
    Xu, Wenjie
    Liu, Dong
    Liu, Bo
    Liu, Qingshan
    Metaxas, Dimitris N.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2923 - 2934
  • [28] Spatio-Temporal Multi-stage OpenFlow Switch Model for Software Defined Cellular Networks
    Ozcevik, Yusuf
    Erel, Muge
    Canberk, Berk
    2015 IEEE 82ND VEHICULAR TECHNOLOGY CONFERENCE (VTC FALL), 2015,
  • [29] Dast-Net: Depth-Aware Spatio-Temporal Network for Video Deblurring
    Zhu, Qi
    Xiao, Zeyu
    Huang, Jie
    Zhao, Feng
    Proceedings - IEEE International Conference on Multimedia and Expo, 2022, 2022-July
  • [30] SPATIO-TEMPORAL CONVOLUTIONAL NEURAL NETWORK FOR ELDERLY FALL DETECTION IN DEPTH VIDEO CAMERAS
    Rahnemoonfar, Maryam
    Alkittawi, Hend
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2868 - 2873