Video Super-Resolution via a Spatio-Temporal Alignment Network

被引:31
|
作者
Wen, Weilei [1 ]
Ren, Wenqi [2 ,3 ]
Shi, Yinghuan [3 ]
Nie, Yunfeng [4 ]
Zhang, Jingang [5 ]
Cao, Xiaochun [2 ]
机构
[1] Nankai Univ, Coll Comp Sci, TKLNDST, Tianjin 300350, Peoples R China
[2] Sun Yat Sen Univ Shenzhen, Sch Cyber Sci & Technol, Shenzhen 518107, Peoples R China
[3] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210093, Peoples R China
[4] Vrije Univ Brussel, Brussels Photon, Dept Appl Phys & Photon, B-1050 Brussels, Belgium
[5] Univ Chinese Acad Sci, Intelligent Imaging Ctr, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Superresolution; Motion compensation; Estimation; Integrated optics; Optical imaging; Image reconstruction; Feature extraction; Video super-resolution; temporal consistency; spatio-temporal adaptive filters;
D O I
10.1109/TIP.2022.3146625
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional neural network based video super-resolution (SR) models have achieved significant progress in recent years. Existing deep video SR methods usually impose optical flow to wrap the neighboring frames for temporal alignment. However, accurate estimation of optical flow is quite difficult, which tends to produce artifacts in the super-resolved results. To address this problem, we propose a novel end-to-end deep convolutional network that dynamically generates the spatially adaptive filters for the alignment, which are constituted by the local spatio-temporal channels of each pixel. Our method avoids generating explicit motion compensation and utilizes spatio-temporal adaptive filters to achieve the operation of alignment, which effectively fuses the multi-frame information and improves the temporal consistency of the video. Capitalizing on the proposed adaptive filter, we develop a reconstruction network and take the aligned frames as input to restore the high-resolution frames. In addition, we employ residual modules embedded with channel attention as the basic unit to extract more informative features for video SR. Both quantitative and qualitative evaluation results on three public video datasets demonstrate that the proposed method performs favorably against state-of-the-art super-resolution methods in terms of clearness and texture details.
引用
收藏
页码:1761 / 1773
页数:13
相关论文
共 50 条
  • [31] Encoding and decoding spatio-temporal information for super-resolution microscopy
    Lanzano, Luca
    Hernandez, Ivan Coto
    Castello, Marco
    Gratton, Enrico
    Diaspro, Alberto
    Vicidomini, Giuseppe
    NATURE COMMUNICATIONS, 2015, 6
  • [32] Sparse Spatio-Temporal Representation with Adaptive Regularized Dictionaries for Super-Resolution Based Video Coding
    Pan, Zhiming
    Xiong, Hongkai
    2012 DATA COMPRESSION CONFERENCE (DCC), 2012, : 139 - 148
  • [33] Patch-based spatio-temporal super-resolution for video with non-rigid motion
    Salvador, Jordi
    Kochale, Axel
    Schweidler, Siegfried
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (05) : 483 - 493
  • [34] A Novel Zero-Shot Real World Spatio-Temporal Super-Resolution (ZS-RW-STSR) Model for Video Super-Resolution
    Shukla, Ankit
    Upadhyay, Avinash
    Sharma, Manoj
    Saini, Anil
    Fatema, Nuzhat
    Malik, Hasmat
    Afthanorhan, Asyraf
    Hossaini, Mohammad Asef
    IEEE ACCESS, 2024, 12 : 123969 - 123984
  • [35] Deep Video Matting via Spatio-Temporal Alignment and Aggregation
    Sun, Yanan
    Wang, Guanzhi
    Gu, Qiao
    Tang, Chi-Keung
    Tai, Yu-Wing
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6971 - 6980
  • [36] DFVSR: Directional Frequency Video Super-Resolution via Asymmetric and Enhancement Alignment Network
    Dong, Shuting
    Lu, Feng
    Wu, Zhe
    Yuan, Chun
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 681 - 689
  • [37] Video super-resolution with non-local alignment network
    Zhou, Chao
    Chen, Can
    Ding, Fei
    Zhang, Dengyin
    IET IMAGE PROCESSING, 2021, 15 (08) : 1655 - 1667
  • [38] AttGAN: attention gated generative adversarial network for spatio-temporal super-resolution of ocean phenomena
    Liu, Yanni
    Wang, Xinjie
    Yuan, Chunxin
    Xu, Jiexin
    Wei, Zhiqiang
    Nie, Jie
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
  • [39] Spatio-temporal Super-resolution with Photographic and Depth Data using GANs
    Lim, Steffen
    Khan, Sams
    Alessandro, Matteo
    McFall, Kevin
    PROCEEDINGS OF THE 2019 ANNUAL ACM SOUTHEAST CONFERENCE (ACMSE 2019), 2019, : 262 - 263
  • [40] Super-resolution onmidirectional camera images using spatio-temporal analysis
    Kawasaki, H
    Ikeuchi, K
    Sakauchi, M
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2006, 89 (06): : 47 - 59