Multi-Stage Spatio-Temporal Fusion Network for Fast and Accurate Video Bit-Depth Enhancement

被引:0
|
作者
Liu, Jing [1 ]
Fan, Zhiwei [1 ]
Yang, Ziwen [1 ]
Su, Yuting [1 ]
Yang, Xiaokang [2 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
关键词
Feature extraction; Image reconstruction; Task analysis; Fuses; Motion compensation; Distortion; Image color analysis; Video bit-depth enhancement; multiple stages; spatio-temporal fusion; EXPANSION;
D O I
10.1109/TMM.2023.3296225
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For video bit-depth enhancement (VBDE) tasks, inter-frame information is critical for removing false contours and recovering the details in low bit-depth (LBD) videos. However, due to different structural distortions and complex motions in the neighboring frames, it is difficult to effectively utilized inter-frame information. Most algorithms rely on alignment operations to provide information of neighboring frames, suffering from slow inference speed due to the complex alignment module design. Meanwhile, most existing methods sequentially perform the intra-frame feature extractions and inter-frame information fusions, but fail to efficiently fuse spatio-temporal information. Therefore, in this paper, we propose a two-stage progressive group (TSPG) network to find complementary information related to the target frame without adopting an alignment operation. To simultaneously achieve intra-frame feature extractions and inter-frame feature fusions, we propose a parallel spatio-temporal fusion (PSTF) module with a dual-branch spatial-temporal residual (DSTR) block to focus on more useful temporal information while ensuring a faster inference speeds. Extensive experiments on public datasets demonstrate that our proposed multi-stage spatio-temporal fusion network (named MSTFN) can quickly and effectively eliminate false contours and recover high quality target frames. Furthermore, our method outperforms the state-of-the-art methods in terms of both PSNR and SSIM, and can reach faster inference speeds.
引用
收藏
页码:2444 / 2455
页数:12
相关论文
共 50 条
  • [1] Spatio-temporal progressive optimization network for video bit depth enhancement
    Li, Qingying
    Lin, Xin
    Liu, Jing
    Su, Yuting
    Ma, Rui
    MULTIMEDIA SYSTEMS, 2024, 30 (05)
  • [2] Efficient Multi-Stage Video Denoising with Recurrent Spatio-Temporal Fusion
    Maggioni, Matteo
    Huang, Yibin
    Li, Cheng
    Xiao, Shuai
    Fu, Zhongqian
    Song, Fenglong
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3465 - 3474
  • [3] A multi-stage spatio-temporal adaptive network for video super-resolution
    Zhang, Yuhang
    Chen, Zhenzhong
    Liu, Shan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
  • [4] TANet: Target Attention Network for Video Bit-Depth Enhancement
    Liu, Jing
    Yang, Ziwen
    Su, Yuting
    Yang, Xiaokang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4212 - 4223
  • [5] Spatio-Temporal Consistency in Depth Video Enhancement
    Li, Li
    Zhang, Caiming
    JOURNAL OF ADVANCED MECHANICAL DESIGN SYSTEMS AND MANUFACTURING, 2013, 7 (05): : 808 - 817
  • [6] MFDGCN: Multi-Stage Spatio-Temporal Fusion Diffusion Graph Convolutional Network for Traffic Prediction
    Cui, Zhengyan
    Zhang, Junjun
    Noh, Giseop
    Park, Hyun Jun
    APPLIED SCIENCES-BASEL, 2022, 12 (05):
  • [7] Spatio-Temporal Information Fusion Network for Compressed Video Quality Enhancement
    Huang, Weiwei
    Jia, Kebin
    Liu, Pengyu
    Yu, Yuan
    2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 343 - 343
  • [8] Spatiotemporal Symmetric Convolutional Neural Network for Video Bit-Depth Enhancement
    Liu, Jing
    Liu, Pingping
    Su, Yuting
    Jing, Peiguang
    Yang, Xiaokang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (09) : 2397 - 2406
  • [9] Residual-Guided Multiscale Fusion Network for Bit-Depth Enhancement
    Liu, Jing
    Wen, Xin
    Nie, Weizhi
    Su, Yuting
    Jing, Peiguang
    Yang, Xiaokang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 2773 - 2786
  • [10] User-Ranking Video Summarization With Multi-Stage Spatio-Temporal Representation
    Huang, Siyu
    Li, Xi
    Zhang, Zhongfei
    Wu, Fei
    Han, Junwei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (06) : 2654 - 2664