Resampling video super-resolution based on multi-scale guided optical flow

被引:0
|
作者
Li, Puying [1 ]
Zhu, Fuzhen [1 ]
Liu, Yong [1 ]
Zhang, Qi [1 ]
机构
[1] Heilongjiang Univ, Sch Elect Engn, Harbin 150080, Peoples R China
关键词
Video super-resolution; Transformer; Multi-scale adaptive flow estimation; Resampling; NETWORKS;
D O I
10.1016/j.compeleceng.2025.110176
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Existing video super-resolution (VSR) methods are inadequate for dealing with inter-frame motion and spatial distortion problems, especially in high-motion scenes, which tend to lead to loss of details and degradation of reconstruction quality. To address these challenges, this paper puts forward a resampling video super-resolution algorithm based on multiscale guided optical flow. The method combines multi-scale guided optical flow estimation to address the issue of interframe motion and a resampling deformable convolution module to address the issue of spatial distortion. Specifically, features are first extracted from low-quality video frames using a convolutional layer, followed by feature extraction with Residual Swin Transformer Blocks (RSTBs). In the feature alignment module, a multiscale-guided optical flow estimation approach is employed, which addresses the inter-frame motion problem across different video segments and performs video frame interpolation and super-resolution reconstruction simultaneously. Furthermore, spatial alignment is achieved by integrating resampling into the deformable convolution module, mitigating spatial distortion. Finally, multiple Residual Swin Transformer Blocks (RSTBs) are used to extract and fuse features, and pixel rearrangement layers are employed to reconstruct high-quality video frames. The experimental results on the REDS, Vid4, and UDM10 datasets show that our method significantly outperforms current state-of-the-art (SOTA) techniques, with improvements of 0.61 dB in Peak Signal-to-Noise Ratio (PSNR) and 0.0121 in Structural Similarity (SSIM), validating the effectiveness and superiority of the method.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] CT image super-resolution reconstruction based on multi-scale residual network
    Wu Lei
    Lyu Guo-qiang
    Zhao Chen
    Sheng Jie-chao
    Feng Qi-bin
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2019, 34 (10) : 1006 - 1012
  • [42] Image super-resolution reconstruction based on multi-scale dual-attention
    Li, Hong-an
    Wang, Diao
    Zhang, Jing
    Li, Zhanli
    Ma, Tian
    CONNECTION SCIENCE, 2023, 35 (01)
  • [43] HAMSA: Hybrid attention transformer and multi-scale alignment aggregation network for video super-resolution
    Xiao, Hanguang
    Wen, Hao
    Wang, Xin
    Zuo, Kun
    Liu, Tianqi
    Wang, Wei
    Xu, Yong
    DIGITAL SIGNAL PROCESSING, 2025, 161
  • [44] Image super-resolution reconstruction based on multi-scale feature mapping network
    Duan R.
    Zhou D.-W.
    Zhao L.-J.
    Chai X.-L.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2019, 53 (07): : 1331 - 1339
  • [45] Dual prior guided depth image super-resolution with multi-scale transformer fusion network
    Zhao, Pengfei
    Ji, Jianhua
    Wen, Yang
    Shi, Wuzhen
    Cao, Wenming
    VISUAL COMPUTER, 2025,
  • [46] Video Super-Resolution using Multi-scale Pyramid 3D Convolutional Networks
    Luo, Jianping
    Huang, Shaofei
    Yuan, Yuan
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1882 - 1890
  • [47] Multi-scale gated network for efficient image super-resolution
    Miao, Xuan
    Li, Shijie
    Li, Zheng
    Xu, Wenzheng
    Yang, Ning
    VISUAL COMPUTER, 2025, 41 (02): : 1227 - 1239
  • [48] Multi-scale Fractal Coding for Single Image Super-Resolution
    Xie, Wei
    Liu, Jiwei
    Shao, Lizhen
    Jing, Fengwei
    INTELLIGENT COMPUTING THEORY, 2014, 8588 : 425 - 434
  • [49] A Comparative Study of Multi-Scale Image Super-Resolution Techniques
    Giansiracusa, Michael
    Ezekiel, Soundararajan
    Raquepas, Joseph
    Blasch, Erik
    Thomas, Millicent
    2016 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR), 2016,
  • [50] NONLINEAR MULTI-SCALE SUPER-RESOLUTION USING DEEP LEARNING
    Tran, Kenneth
    Panahi, Ashkan
    Adiga, Aniruddha
    Sakla, Wesam
    Krim, Hamid
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3182 - 3186