iSeeBetter: Spatio-temporal video super-resolution using recurrent generative back-projection networks

被引:0
|
作者
Aman Chadha
John Britto
M. Mani Roja
机构
[1] Stanford University,Department of Computer Science
[2] University of Massachusetts Amherst,Department of Computer Science
[3] University of Mumbai,Department of Electronics and Telecommunication Engineering
来源
关键词
super resolution; video upscaling; frame recurrence; optical flow; generative adversarial networks; convolutional neural networks;
D O I
暂无
中图分类号
学科分类号
摘要
Recently, learning-based models have enhanced the performance of single-image super-resolution (SISR). However, applying SISR successively to each video frame leads to a lack of temporal coherency. Convolutional neural networks (CNNs) outperform traditional approaches in terms of image quality metrics such as peak signal to noise ratio (PSNR) and structural similarity (SSIM). On the other hand, generative adversarial networks (GANs) offer a competitive advantage by being able to mitigate the issue of a lack of finer texture details, usually seen with CNNs when super-resolving at large upscaling factors. We present iSeeBetter, a novel GAN-based spatio-temporal approach to video super-resolution (VSR) that renders temporally consistent super-resolution videos. iSeeBetter extracts spatial and temporal information from the current and neighboring frames using the concept of recurrent back-projection networks as its generator. Furthermore, to improve the “naturality” of the super-resolved output while eliminating artifacts seen with traditional algorithms, we utilize the discriminator from super-resolution generative adversarial network. Although mean squared error (MSE) as a primary loss-minimization objective improves PSNR/SSIM, these metrics may not capture fine details in the image resulting in misrepresentation of perceptual quality. To address this, we use a four-fold (MSE, perceptual, adversarial, and total-variation loss function. Our results demonstrate that iSeeBetter offers superior VSR fidelity and surpasses state-of-the-art performance.
引用
收藏
页码:307 / 317
页数:10
相关论文
共 50 条
  • [31] Spatio-temporal Super-resolution Network: Enhance Visual Representations for Video Captioning
    Cao, Quanhui
    Tang, Pengjie
    Wang, Hanli
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 3125 - 3129
  • [32] A multi-stage spatio-temporal adaptive network for video super-resolution
    Zhang, Yuhang
    Chen, Zhenzhong
    Liu, Shan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
  • [33] Improved Iterative Back Projection for Video Super-Resolution
    Rasti, Pejman
    Demirel, Hasan
    Anbarjafari, Gholamreza
    2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 552 - 555
  • [34] Bidirectional Temporal-Recurrent Propagation Networks for Video Super-Resolution
    Han, Lei
    Fan, Cien
    Yang, Ye
    Zou, Lian
    ELECTRONICS, 2020, 9 (12) : 1 - 15
  • [35] Morphology Based Iterative Back-Projection for Super-Resolution Reconstruction of Image
    Nayak, Rajashree
    Harshavardhan, Saka
    Patra, Dipti
    PROCEEDINGS ON 2014 2ND INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGY TRENDS IN ELECTRONICS, COMMUNICATION AND NETWORKING (ET2ECN), 2014,
  • [36] Non-local feature back-projection for image super-resolution
    Zhang, Xin
    Liu, Qian
    Li, Xuemei
    Zhou, Yuanfeng
    Zhang, Caiming
    IET IMAGE PROCESSING, 2016, 10 (05) : 398 - 408
  • [37] Progressive back-projection network for COVID-CT super-resolution
    Song, Zhaoyang
    Zhao, Xiaoqiang
    Hui, Yongyong
    Jiang, Hongmei
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 208
  • [38] Multiscale Feature Fusion Back-projection Network for Image Super-resolution
    Sun C.-W.
    Chen X.
    Zidonghua Xuebao/Acta Automatica Sinica, 2021, 47 (07): : 1689 - 1700
  • [39] Spatio-temporal Super-resolution with Photographic and Depth Data using GANs
    Lim, Steffen
    Khan, Sams
    Alessandro, Matteo
    McFall, Kevin
    PROCEEDINGS OF THE 2019 ANNUAL ACM SOUTHEAST CONFERENCE (ACMSE 2019), 2019, : 262 - 263
  • [40] Super-resolution onmidirectional camera images using spatio-temporal analysis
    Kawasaki, H
    Ikeuchi, K
    Sakauchi, M
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2006, 89 (06): : 47 - 59