STFE-VC: Spatio-temporal feature enhancement for learned video compression

被引：0

作者：

Wang, Yiming ^{[1
]}

Huang, Qian ^{[1
,3
]}

Tang, Bin ^{[1
]}

Li, Xin ^{[1
]}

Li, Xing ^{[2
]}

机构：

[1] Hohai Univ, Coll Comp Sci & Software Engn, Nanjing, Jiangsu, Peoples R China

[2] Nanjing Forestry Univ, Coll Informat Sci & Technol, Nanjing, Peoples R China

[3] Changzhou Univ, Jiangsu Engn Res Ctr Digital Twinning Technol, Key Equipment Petrochem Proc, Changzhou, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2025年 / 272卷

基金：

中国国家自然科学基金;

关键词：

Spatio-temporal feature enhancement; Learned video compression; Spatio-temporal motion enhancement; In-loop filtering enhancement;

D O I：

10.1016/j.eswa.2025.126682

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the increasing growth of video data, limited bandwidth and hardware resource constraints demand more efficient video compression. Current learned video compression methods have shown promising performance. However, these methods mainly rely on the optical flow networks to perform temporal prediction, which may suffer from inaccurate motion estimation and introduce extra artifacts to reconstructed frames. In this paper, we propose a spatio-temporal feature enhancement method for learned video compression to better model the inter-frame motion patterns and reduce compression artifacts. Specifically, we introduce a spatio-temporal motion enhancement module that further extracts the feature representation of original motion vector to enhance corresponding spatial and temporal components. Then, we introduce an in-loop filtering enhancement module that employs cascaded residual blocks to progressively enhance feature textures and provide higher- quality temporal domain reference signals for subsequent reconstruction. More importantly, our proposed method can be integrated into the widely-used residual coding and contextual coding schemes. Comprehensive experiments demonstrate that our integrated methods are superior to the previous learned methods on JCTVC, UVG and MCL-JCV benchmark datasets. In addition, our integrated methods also outperform the latest generalized video coding standard (H.266/VVC) by a larger margin in terms of MS-SSIM metric.

引用

页数：13

共 50 条

[31] Blind video quality assessment based on Spatio-Temporal Feature Resolver
Bi, Xiaodong
He, Xiaohai
Xiong, Shuhua
Zhao, Zeming
Chen, Honggang
Sheriff, Raymond Edward
NEUROCOMPUTING, 2024, 574
[32] Interactive spatio-temporal feature learning network for video foreground detection
Zhang, Hongrui
Li, Huan
COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (05) : 4251 - 4263
[33] Guest Editorial: Spatio-temporal Feature Learning for Unconstrained Video Analysis
Yahong Han
Liqiang Nie
Fei Wu
Multimedia Tools and Applications, 2018, 77 : 29209 - 29211
[34] Spatio-Temporal Information Fusion Network for Compressed Video Quality Enhancement
Huang, Weiwei
Jia, Kebin
Liu, Pengyu
Yu, Yuan
2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 343 - 343
[35] Spatio-temporal propagation and reconstruction for low-light video enhancement
Ye, Jing
Qiu, Changzhen
Zhang, Zhiyong
DIGITAL SIGNAL PROCESSING, 2023, 139
[36] Cross-scale hierarchical spatio-temporal transformer for video enhancement
Jiang, Qin
Wang, Qinglin
Chi, Lihua
Liu, Jie
KNOWLEDGE-BASED SYSTEMS, 2025, 309
[37] Anomaly Detection Using Spatio-Temporal Context Learned by Video Clip Sorting
Shao, Wen
Kawakami, Rei
Naemura, Takeshi
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (05) : 1094 - 1102
[38] Spatio-Temporal Detail Information Retrieval for Compressed Video Quality Enhancement
Luo, Dengyan
Ye, Mao
Li, Shuai
Zhu, Ce
Li, Xue
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 6808 - 6820
[39] Spatio-temporal progressive optimization network for video bit depth enhancement
Li, Qingying
Lin, Xin
Liu, Jing
Su, Yuting
Ma, Rui
MULTIMEDIA SYSTEMS, 2024, 30 (05)
[40] A Spatio-temporal Data Compression Algorithm
Wang, Lei
Guo, Yiming
Chen, Chen
Yan, Yaowei
2012 FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION NETWORKING AND SECURITY (MINES 2012), 2012, : 421 - 424

← 1 2 3 4 5 →