iSeeBetter: Spatio-temporal video super-resolution using recurrent generative back-projection networks

被引:0
|
作者
Aman Chadha
John Britto
M. Mani Roja
机构
[1] Stanford University,Department of Computer Science
[2] University of Massachusetts Amherst,Department of Computer Science
[3] University of Mumbai,Department of Electronics and Telecommunication Engineering
来源
关键词
super resolution; video upscaling; frame recurrence; optical flow; generative adversarial networks; convolutional neural networks;
D O I
暂无
中图分类号
学科分类号
摘要
Recently, learning-based models have enhanced the performance of single-image super-resolution (SISR). However, applying SISR successively to each video frame leads to a lack of temporal coherency. Convolutional neural networks (CNNs) outperform traditional approaches in terms of image quality metrics such as peak signal to noise ratio (PSNR) and structural similarity (SSIM). On the other hand, generative adversarial networks (GANs) offer a competitive advantage by being able to mitigate the issue of a lack of finer texture details, usually seen with CNNs when super-resolving at large upscaling factors. We present iSeeBetter, a novel GAN-based spatio-temporal approach to video super-resolution (VSR) that renders temporally consistent super-resolution videos. iSeeBetter extracts spatial and temporal information from the current and neighboring frames using the concept of recurrent back-projection networks as its generator. Furthermore, to improve the “naturality” of the super-resolved output while eliminating artifacts seen with traditional algorithms, we utilize the discriminator from super-resolution generative adversarial network. Although mean squared error (MSE) as a primary loss-minimization objective improves PSNR/SSIM, these metrics may not capture fine details in the image resulting in misrepresentation of perceptual quality. To address this, we use a four-fold (MSE, perceptual, adversarial, and total-variation loss function. Our results demonstrate that iSeeBetter offers superior VSR fidelity and surpasses state-of-the-art performance.
引用
收藏
页码:307 / 317
页数:10
相关论文
共 50 条
  • [21] Spatio-temporal Super-Resolution Using Depth Map
    Awatsu, Yusaku
    Kawai, Norihiko
    Sato, Tomokazu
    Yokoya, Naokazu
    IMAGE ANALYSIS, PROCEEDINGS, 2009, 5575 : 696 - 705
  • [22] Super-resolution reconstruction using spatio-temporal filtering
    Goldberg, N
    Feuer, A
    Goodwin, GC
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2003, 14 (04) : 508 - 525
  • [23] REMOTE SENSING IMAGE SUPER-RESOLUTION VIA ENHANCED BACK-PROJECTION NETWORKS
    Dong, Xiaoyu
    Xi, Zhihong
    Sun, Xu
    Yang, Lina
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1480 - 1483
  • [24] Efficient Spatio-Temporal Network with Gated Fusion for Video Super-Resolution
    Li, Changyu
    Zhang, Dongyang
    Xie, Ning
    Shao, Jie
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 640 - 651
  • [25] Lightweight video super-resolution based on hybrid spatio-temporal convolution
    Xia, Zhenping
    Chen, Hao
    Zhang, Yuning
    Cheng, Cheng
    Hu, Fuyuan
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (16): : 2564 - 2576
  • [26] Towards efficient motion-blurred public security video super-resolution based on back-projection networks
    Guo, Kehua
    Guo, Haifu
    Ren, Sheng
    Zhang, Jian
    Li, Xi
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2020, 166
  • [27] Novel Back-projection Framework for Single Image Super-Resolution
    Zhao, Bin
    Gan, Zongliang
    Zhang, Yanbin
    Liu, Feng
    Wang, Huanjuan
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 894 - 898
  • [28] Deep iterative residual back-projection networks for single-image super-resolution
    Tian, Chuan
    Hu, Jing
    Wu, Xi
    Wen, Wu
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (02)
  • [29] Medical Video Super-Resolution Based on Asymmetric Back-Projection Network With Multilevel Error Feedback
    Ren, Sheng
    Li, Jianqi
    Guo, Kehua
    Li, Fangfang
    IEEE ACCESS, 2021, 9 : 17909 - 17920
  • [30] DSTnet: Deformable Spatio-Temporal Convolutional Residual Network for Video Super-Resolution
    Khan, Anusha
    Sargano, Allah Bux
    Habib, Zulfiqar
    MATHEMATICS, 2021, 9 (22)