STFE-VC: Spatio-temporal feature enhancement for learned video compression

被引:0
|
作者
Wang, Yiming [1 ]
Huang, Qian [1 ,3 ]
Tang, Bin [1 ]
Li, Xin [1 ]
Li, Xing [2 ]
机构
[1] Hohai Univ, Coll Comp Sci & Software Engn, Nanjing, Jiangsu, Peoples R China
[2] Nanjing Forestry Univ, Coll Informat Sci & Technol, Nanjing, Peoples R China
[3] Changzhou Univ, Jiangsu Engn Res Ctr Digital Twinning Technol, Key Equipment Petrochem Proc, Changzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Spatio-temporal feature enhancement; Learned video compression; Spatio-temporal motion enhancement; In-loop filtering enhancement;
D O I
10.1016/j.eswa.2025.126682
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the increasing growth of video data, limited bandwidth and hardware resource constraints demand more efficient video compression. Current learned video compression methods have shown promising performance. However, these methods mainly rely on the optical flow networks to perform temporal prediction, which may suffer from inaccurate motion estimation and introduce extra artifacts to reconstructed frames. In this paper, we propose a spatio-temporal feature enhancement method for learned video compression to better model the inter-frame motion patterns and reduce compression artifacts. Specifically, we introduce a spatio-temporal motion enhancement module that further extracts the feature representation of original motion vector to enhance corresponding spatial and temporal components. Then, we introduce an in-loop filtering enhancement module that employs cascaded residual blocks to progressively enhance feature textures and provide higher- quality temporal domain reference signals for subsequent reconstruction. More importantly, our proposed method can be integrated into the widely-used residual coding and contextual coding schemes. Comprehensive experiments demonstrate that our integrated methods are superior to the previous learned methods on JCTVC, UVG and MCL-JCV benchmark datasets. In addition, our integrated methods also outperform the latest generalized video coding standard (H.266/VVC) by a larger margin in terms of MS-SSIM metric.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] High performance holographic video compression using spatio-temporal phase unwrapping
    Gonzalez, Sorayda Trejos
    Velez-Zea, Alejandro
    Barrera-Ramirez, John Fredy
    OPTICS AND LASERS IN ENGINEERING, 2024, 181
  • [42] End-to-End Learning of Video Compression Using Spatio-Temporal Autoencoders
    Pessoa, Jorge
    Aidos, Helena
    Tomas, Pedro
    Figueiredo, Mario A. T.
    2020 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2020, : 276 - 281
  • [43] Video Segmentation with Spatio-Temporal Tubes
    Trichet, Remi
    Nevatia, Ramakant
    2013 10TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2013), 2013, : 330 - 335
  • [44] Spatio-temporal segmentation for video surveillance
    Sun, HZ
    Tan, TN
    ELECTRONICS LETTERS, 2001, 37 (01) : 20 - 21
  • [45] SPATIO-TEMPORAL SALIENT FEATURE EXTRACTION FOR PERCEPTUAL CONTENT BASED VIDEO RETRIEVAL
    Megrhi, Sameh
    Souidene, Wided
    Beghdadi, Azeddine
    2013 COLOUR AND VISUAL COMPUTING SYMPOSIUM (CVCS), 2013,
  • [46] Efficient Spatio-Temporal Feature Extraction Recurrent Neural Network for Video Deblurring
    Pu Z.
    Ma W.
    Mi Q.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (11): : 1720 - 1730
  • [47] SPATIO-TEMPORAL INTERACTIVE LAWS FEATURE CORRELATION METHOD TO VIDEO QUALITY ASSESSMENT
    Liu, Kuan-Hsien
    Liu, Tsung-Jung
    Liu, Hsin-Hua
    Pei, Soo-Chang
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [48] Spatio-temporal segmentation for video surveillance
    Sun, HZ
    Feng, T
    Tan, TN
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 843 - 846
  • [49] Bidirectional Spatio-Temporal Feature Learning With Multiscale Evaluation for Video Anomaly Detection
    Zhong, Yuanhong
    Chen, Xia
    Hu, Yongting
    Tang, Panliang
    Ren, Fan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8285 - 8296
  • [50] VideoZoom Spatio-Temporal Video Browser
    Smith, John R.
    IEEE TRANSACTIONS ON MULTIMEDIA, 1999, 1 (02) : 157 - 171