Resampling video super-resolution based on multi-scale guided optical flow

被引:0
|
作者
Li, Puying [1 ]
Zhu, Fuzhen [1 ]
Liu, Yong [1 ]
Zhang, Qi [1 ]
机构
[1] Heilongjiang Univ, Sch Elect Engn, Harbin 150080, Peoples R China
关键词
Video super-resolution; Transformer; Multi-scale adaptive flow estimation; Resampling; NETWORKS;
D O I
10.1016/j.compeleceng.2025.110176
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Existing video super-resolution (VSR) methods are inadequate for dealing with inter-frame motion and spatial distortion problems, especially in high-motion scenes, which tend to lead to loss of details and degradation of reconstruction quality. To address these challenges, this paper puts forward a resampling video super-resolution algorithm based on multiscale guided optical flow. The method combines multi-scale guided optical flow estimation to address the issue of interframe motion and a resampling deformable convolution module to address the issue of spatial distortion. Specifically, features are first extracted from low-quality video frames using a convolutional layer, followed by feature extraction with Residual Swin Transformer Blocks (RSTBs). In the feature alignment module, a multiscale-guided optical flow estimation approach is employed, which addresses the inter-frame motion problem across different video segments and performs video frame interpolation and super-resolution reconstruction simultaneously. Furthermore, spatial alignment is achieved by integrating resampling into the deformable convolution module, mitigating spatial distortion. Finally, multiple Residual Swin Transformer Blocks (RSTBs) are used to extract and fuse features, and pixel rearrangement layers are employed to reconstruct high-quality video frames. The experimental results on the REDS, Vid4, and UDM10 datasets show that our method significantly outperforms current state-of-the-art (SOTA) techniques, with improvements of 0.61 dB in Peak Signal-to-Noise Ratio (PSNR) and 0.0121 in Structural Similarity (SSIM), validating the effectiveness and superiority of the method.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Guided filter-based multi-scale super-resolution reconstruction
    Feng, Xiaomei
    Li, Jinjiang
    Hua, Zhen
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2020, 5 (02) : 128 - 140
  • [2] Multi-Scale Video Super-Resolution Transformer With Polynomial Approximation
    Zhang, Fan
    Chen, Gongguan
    Wang, Hua
    Li, Jinjiang
    Zhang, Caiming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4496 - 4506
  • [3] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
    Zhou, Zhenghua
    Xue, Boxiang
    Wang, Hai
    Zhao, Jianwei
    Multimedia Tools and Applications, 83 (09): : 27809 - 27830
  • [4] Super-resolution based on multi-scale feature aggregation adversarial networks Multi-Scale Super-Resolution with Adversarial Networks
    Song, Wei
    Li, Shuo
    Liao, Bin
    Ning, Keqing
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CRYPTOGRAPHY, NETWORK SECURITY AND COMMUNICATION TECHNOLOGY, CNSCT 2024, 2024, : 356 - 360
  • [5] Multi-scale Residual Dense Block for Video Super-Resolution
    Cui, Hetao
    Sun, Quansen
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: VISUAL DATA ENGINEERING, PT I, 2019, 11935 : 424 - 434
  • [6] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
    Zhenghua Zhou
    Boxiang Xue
    Hai Wang
    Jianwei Zhao
    Multimedia Tools and Applications, 2024, 83 : 27809 - 27830
  • [7] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
    Zhou, Zhenghua
    Xue, Boxiang
    Wang, Hai
    Zhao, Jianwei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 27809 - 27830
  • [8] Video super-resolution based on multi-scale 3D convolution
    Zhan K.
    Sun Y.
    Li Y.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2021, 48 (05): : 8 - 14
  • [9] Attention-guided video super-resolution with recurrent multi-scale spatial–temporal transformer
    Wei Sun
    Xianguang Kong
    Yanning Zhang
    Complex & Intelligent Systems, 2023, 9 : 3989 - 4002
  • [10] Attention-guided video super-resolution with recurrent multi-scale spatial-temporal transformer
    Sun, Wei
    Kong, Xianguang
    Zhang, Yanning
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 3989 - 4002