Resampling video super-resolution based on multi-scale guided optical flow

被引：0

作者：

Li, Puying ^{[1
]}

Zhu, Fuzhen ^{[1
]}

Liu, Yong ^{[1
]}

Zhang, Qi ^{[1
]}

机构：

[1] Heilongjiang Univ, Sch Elect Engn, Harbin 150080, Peoples R China

来源：

COMPUTERS & ELECTRICAL ENGINEERING | 2025年 / 123卷

关键词：

Video super-resolution; Transformer; Multi-scale adaptive flow estimation; Resampling; NETWORKS;

D O I：

10.1016/j.compeleceng.2025.110176

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Existing video super-resolution (VSR) methods are inadequate for dealing with inter-frame motion and spatial distortion problems, especially in high-motion scenes, which tend to lead to loss of details and degradation of reconstruction quality. To address these challenges, this paper puts forward a resampling video super-resolution algorithm based on multiscale guided optical flow. The method combines multi-scale guided optical flow estimation to address the issue of interframe motion and a resampling deformable convolution module to address the issue of spatial distortion. Specifically, features are first extracted from low-quality video frames using a convolutional layer, followed by feature extraction with Residual Swin Transformer Blocks (RSTBs). In the feature alignment module, a multiscale-guided optical flow estimation approach is employed, which addresses the inter-frame motion problem across different video segments and performs video frame interpolation and super-resolution reconstruction simultaneously. Furthermore, spatial alignment is achieved by integrating resampling into the deformable convolution module, mitigating spatial distortion. Finally, multiple Residual Swin Transformer Blocks (RSTBs) are used to extract and fuse features, and pixel rearrangement layers are employed to reconstruct high-quality video frames. The experimental results on the REDS, Vid4, and UDM10 datasets show that our method significantly outperforms current state-of-the-art (SOTA) techniques, with improvements of 0.61 dB in Peak Signal-to-Noise Ratio (PSNR) and 0.0121 in Structural Similarity (SSIM), validating the effectiveness and superiority of the method.

引用

页数：14

共 50 条

[41] CT image super-resolution reconstruction based on multi-scale residual network
Wu Lei
Lyu Guo-qiang
Zhao Chen
Sheng Jie-chao
Feng Qi-bin
CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2019, 34 (10) : 1006 - 1012
[42] Image super-resolution reconstruction based on multi-scale dual-attention
Li, Hong-an
Wang, Diao
Zhang, Jing
Li, Zhanli
Ma, Tian
CONNECTION SCIENCE, 2023, 35 (01)
[43] HAMSA: Hybrid attention transformer and multi-scale alignment aggregation network for video super-resolution
Xiao, Hanguang
Wen, Hao
Wang, Xin
Zuo, Kun
Liu, Tianqi
Wang, Wei
Xu, Yong
DIGITAL SIGNAL PROCESSING, 2025, 161
[44] Image super-resolution reconstruction based on multi-scale feature mapping network
Duan R.
Zhou D.-W.
Zhao L.-J.
Chai X.-L.
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2019, 53 (07): : 1331 - 1339
[45] Dual prior guided depth image super-resolution with multi-scale transformer fusion network
Zhao, Pengfei
Ji, Jianhua
Wen, Yang
Shi, Wuzhen
Cao, Wenming
VISUAL COMPUTER, 2025,
[46] Video Super-Resolution using Multi-scale Pyramid 3D Convolutional Networks
Luo, Jianping
Huang, Shaofei
Yuan, Yuan
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1882 - 1890
[47] Multi-scale gated network for efficient image super-resolution
Miao, Xuan
Li, Shijie
Li, Zheng
Xu, Wenzheng
Yang, Ning
VISUAL COMPUTER, 2025, 41 (02): : 1227 - 1239
[48] Multi-scale Fractal Coding for Single Image Super-Resolution
Xie, Wei
Liu, Jiwei
Shao, Lizhen
Jing, Fengwei
INTELLIGENT COMPUTING THEORY, 2014, 8588 : 425 - 434
[49] A Comparative Study of Multi-Scale Image Super-Resolution Techniques
Giansiracusa, Michael
Ezekiel, Soundararajan
Raquepas, Joseph
Blasch, Erik
Thomas, Millicent
2016 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR), 2016,
[50] NONLINEAR MULTI-SCALE SUPER-RESOLUTION USING DEEP LEARNING
Tran, Kenneth
Panahi, Ashkan
Adiga, Aniruddha
Sakla, Wesam
Krim, Hamid
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3182 - 3186

← 1 2 3 4 5 →