STFE-VC: Spatio-temporal feature enhancement for learned video compression

被引:0
|
作者
Wang, Yiming [1 ]
Huang, Qian [1 ,3 ]
Tang, Bin [1 ]
Li, Xin [1 ]
Li, Xing [2 ]
机构
[1] Hohai Univ, Coll Comp Sci & Software Engn, Nanjing, Jiangsu, Peoples R China
[2] Nanjing Forestry Univ, Coll Informat Sci & Technol, Nanjing, Peoples R China
[3] Changzhou Univ, Jiangsu Engn Res Ctr Digital Twinning Technol, Key Equipment Petrochem Proc, Changzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Spatio-temporal feature enhancement; Learned video compression; Spatio-temporal motion enhancement; In-loop filtering enhancement;
D O I
10.1016/j.eswa.2025.126682
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the increasing growth of video data, limited bandwidth and hardware resource constraints demand more efficient video compression. Current learned video compression methods have shown promising performance. However, these methods mainly rely on the optical flow networks to perform temporal prediction, which may suffer from inaccurate motion estimation and introduce extra artifacts to reconstructed frames. In this paper, we propose a spatio-temporal feature enhancement method for learned video compression to better model the inter-frame motion patterns and reduce compression artifacts. Specifically, we introduce a spatio-temporal motion enhancement module that further extracts the feature representation of original motion vector to enhance corresponding spatial and temporal components. Then, we introduce an in-loop filtering enhancement module that employs cascaded residual blocks to progressively enhance feature textures and provide higher- quality temporal domain reference signals for subsequent reconstruction. More importantly, our proposed method can be integrated into the widely-used residual coding and contextual coding schemes. Comprehensive experiments demonstrate that our integrated methods are superior to the previous learned methods on JCTVC, UVG and MCL-JCV benchmark datasets. In addition, our integrated methods also outperform the latest generalized video coding standard (H.266/VVC) by a larger margin in terms of MS-SSIM metric.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Blind video quality assessment based on Spatio-Temporal Feature Resolver
    Bi, Xiaodong
    He, Xiaohai
    Xiong, Shuhua
    Zhao, Zeming
    Chen, Honggang
    Sheriff, Raymond Edward
    NEUROCOMPUTING, 2024, 574
  • [32] Interactive spatio-temporal feature learning network for video foreground detection
    Zhang, Hongrui
    Li, Huan
    COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (05) : 4251 - 4263
  • [33] Guest Editorial: Spatio-temporal Feature Learning for Unconstrained Video Analysis
    Yahong Han
    Liqiang Nie
    Fei Wu
    Multimedia Tools and Applications, 2018, 77 : 29209 - 29211
  • [34] Spatio-Temporal Information Fusion Network for Compressed Video Quality Enhancement
    Huang, Weiwei
    Jia, Kebin
    Liu, Pengyu
    Yu, Yuan
    2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 343 - 343
  • [35] Spatio-temporal propagation and reconstruction for low-light video enhancement
    Ye, Jing
    Qiu, Changzhen
    Zhang, Zhiyong
    DIGITAL SIGNAL PROCESSING, 2023, 139
  • [36] Cross-scale hierarchical spatio-temporal transformer for video enhancement
    Jiang, Qin
    Wang, Qinglin
    Chi, Lihua
    Liu, Jie
    KNOWLEDGE-BASED SYSTEMS, 2025, 309
  • [37] Anomaly Detection Using Spatio-Temporal Context Learned by Video Clip Sorting
    Shao, Wen
    Kawakami, Rei
    Naemura, Takeshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (05) : 1094 - 1102
  • [38] Spatio-Temporal Detail Information Retrieval for Compressed Video Quality Enhancement
    Luo, Dengyan
    Ye, Mao
    Li, Shuai
    Zhu, Ce
    Li, Xue
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 6808 - 6820
  • [39] Spatio-temporal progressive optimization network for video bit depth enhancement
    Li, Qingying
    Lin, Xin
    Liu, Jing
    Su, Yuting
    Ma, Rui
    MULTIMEDIA SYSTEMS, 2024, 30 (05)
  • [40] A Spatio-temporal Data Compression Algorithm
    Wang, Lei
    Guo, Yiming
    Chen, Chen
    Yan, Yaowei
    2012 FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION NETWORKING AND SECURITY (MINES 2012), 2012, : 421 - 424