Bidirectional Multi-scale Deformable Attention for Video Super-Resolution

被引:0
|
作者
Zhenghua Zhou
Boxiang Xue
Hai Wang
Jianwei Zhao
机构
[1] Zhejiang University of Finance and Economics,School of Data Sciences
[2] China Jiliang University,Department of Data Science, College of Sciences
[3] Murdoch University,Discipline of Engineering and Energy
[4] China Jiliang University,College of Information Engineering
来源
关键词
Video super-resolution; Multi-scale deformable convolution; Multi-scale attention; Bidirectional propagation;
D O I
暂无
中图分类号
学科分类号
摘要
Video super-resolution aims to generate a high-resolution video frame from its low-resolution video sequences. Video super-resolution is still a challenging problem due to performing the temporal frame alignment and spatial feature fusion during the process of spatial-temporal modeling. Existing deep learning based methods have limitations in handling accurate alignment and effective fusion of frames with multi-scale feature information. In this paper, we propose Bidirectional Multi-scale Deformable Attention (BMDA) for video Super-Resolution in terms of propagation, alignment and fusion. More specifically, the developed Deformable Alignment Module (DAM) in BMDA contains two kinds of modules: Multi-scale Deformable Convolution Module (MDCM) and Multi-scale Attention Module (MAM). MDCM is leveraged to deal with the offset information in different scales and align adjacent frames at the feature level, improving the robustness of the alignment among adjacent frames. MAM is designed to extract the local and global features of the aligned features for aggregation, such that the feature information compensation between pixels is achieved. Additionally, in order to make full use of shallow features, dense connection structure between each layer is adopted in the framework of bidirectional propagation to achieve better visual performance on video super-resolution. In particular, our proposed BDAM outperforms BasicVSR by up to 1.28dB in PSNR when batch size is set to 2. Experimental results on public video benchmark datasets demonstrate that the proposed method can achieve superior performance on large motion videos as compared with the state-of-the art methods.
引用
收藏
页码:27809 / 27830
页数:21
相关论文
共 50 条
  • [21] Image super-resolution with multi-scale fractal residual attention network
    Song, Xiaogang
    Liu, Wanbo
    Liang, Li
    Shi, Weiwei
    Xie, Guo
    Lu, Xiaofeng
    Hei, Xinhong
    COMPUTERS & GRAPHICS-UK, 2023, 113 : 21 - 31
  • [22] Multi-scale convolutional attention network for lightweight image super-resolution
    Xie, Feng
    Lu, Pei
    Liu, Xiaoyong
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
  • [23] Multi-Scale Residual Channel Attention Network for Face Super-Resolution
    Jin W.
    Chen Y.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2020, 32 (06): : 959 - 970
  • [24] Attention augmented multi-scale network for single image super-resolution
    Chengyi Xiong
    Xiaodi Shi
    Zhirong Gao
    Ge Wang
    Applied Intelligence, 2021, 51 : 935 - 951
  • [25] Attention-guided video super-resolution with recurrent multi-scale spatial-temporal transformer
    Sun, Wei
    Kong, Xianguang
    Zhang, Yanning
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 3989 - 4002
  • [26] Super-resolution based on multi-scale feature aggregation adversarial networks Multi-Scale Super-Resolution with Adversarial Networks
    Song, Wei
    Li, Shuo
    Liao, Bin
    Ning, Keqing
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CRYPTOGRAPHY, NETWORK SECURITY AND COMMUNICATION TECHNOLOGY, CNSCT 2024, 2024, : 356 - 360
  • [27] Image Super-Resolution Using Multi-Scale Space Feature and Deformable Convolutional Network
    Jiang, Guosong
    Lu, Zhengwu
    Tu, Xuping
    Guan, Yurong
    Wang, Qingdong
    IEEE ACCESS, 2021, 9 : 74614 - 74621
  • [28] Single image super-resolution based on multi-scale dense attention network
    Gao, Farong
    Wang, Yong
    Yang, Zhangyi
    Ma, Yuliang
    Zhang, Qizhong
    SOFT COMPUTING, 2023, 27 (06) : 2981 - 2992
  • [29] Multi-scale non-local attention network for image super-resolution
    Wu, Xue
    Zhang, Kaibing
    Hu, Yanting
    He, Xin
    Gao, Xinbo
    SIGNAL PROCESSING, 2024, 218
  • [30] Global attention guided multi-scale network for face image super-resolution
    Jinlu Zhang
    Mingliang Liu
    Xiaohang Wang
    Machine Vision and Applications, 2023, 34