iSeeBetter: Spatio-temporal video super-resolution using recurrent generative back-projection networks

被引:0
|
作者
Aman Chadha
John Britto
M. Mani Roja
机构
[1] Stanford University,Department of Computer Science
[2] University of Massachusetts Amherst,Department of Computer Science
[3] University of Mumbai,Department of Electronics and Telecommunication Engineering
来源
关键词
super resolution; video upscaling; frame recurrence; optical flow; generative adversarial networks; convolutional neural networks;
D O I
暂无
中图分类号
学科分类号
摘要
Recently, learning-based models have enhanced the performance of single-image super-resolution (SISR). However, applying SISR successively to each video frame leads to a lack of temporal coherency. Convolutional neural networks (CNNs) outperform traditional approaches in terms of image quality metrics such as peak signal to noise ratio (PSNR) and structural similarity (SSIM). On the other hand, generative adversarial networks (GANs) offer a competitive advantage by being able to mitigate the issue of a lack of finer texture details, usually seen with CNNs when super-resolving at large upscaling factors. We present iSeeBetter, a novel GAN-based spatio-temporal approach to video super-resolution (VSR) that renders temporally consistent super-resolution videos. iSeeBetter extracts spatial and temporal information from the current and neighboring frames using the concept of recurrent back-projection networks as its generator. Furthermore, to improve the “naturality” of the super-resolved output while eliminating artifacts seen with traditional algorithms, we utilize the discriminator from super-resolution generative adversarial network. Although mean squared error (MSE) as a primary loss-minimization objective improves PSNR/SSIM, these metrics may not capture fine details in the image resulting in misrepresentation of perceptual quality. To address this, we use a four-fold (MSE, perceptual, adversarial, and total-variation loss function. Our results demonstrate that iSeeBetter offers superior VSR fidelity and surpasses state-of-the-art performance.
引用
收藏
页码:307 / 317
页数:10
相关论文
共 50 条
  • [1] iSeeBetter: Spatio-temporal video super-resolution using recurrent generative back-projection networks
    Aman Chadha
    John Britto
    M.Mani Roja
    Computational Visual Media, 2020, 6 (03) : 307 - 317
  • [2] iSeeBetter: Spatio-temporal video super-resolution using recurrent generative back-projection networks
    Chadha, Aman
    Britto, John
    Roja, M. Mani
    COMPUTATIONAL VISUAL MEDIA, 2020, 6 (03) : 307 - 317
  • [3] Recurrent Back-Projection Network for Video Super-Resolution
    Haris, Muhammad
    Shakhnarovich, Greg
    Ukita, Norimichi
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3892 - 3901
  • [4] Deep Back-Projection Networks For Super-Resolution
    Haris, Muhammad
    Shakhnarovich, Greg
    Ukita, Norimichi
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1664 - 1673
  • [5] Bidirectional spatio-temporal generative adversarial network for video super-resolution
    Yang, Peng
    Chen, Zhangquan
    Sun, Yuankang
    Hu, Zhongjian
    Li, Bing
    PATTERN ANALYSIS AND APPLICATIONS, 2025, 28 (01)
  • [6] Arbitrary Back-Projection Networks for Image Super-Resolution
    Ma, Tingsong
    Tian, Wenhong
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2020, 19 (04)
  • [7] Towards Efficient Medical Video Super-Resolution based on Deep Back-Projection Networks
    Ren, Sheng
    Guo, Haifu
    Guo, Kehua
    2019 INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2019, : 682 - 686
  • [8] Spatio-Temporal Fusion Network for Video Super-Resolution
    Li, Huabin
    Zhang, Pingjian
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [9] Video Super-Resolution by Motion Compensated Iterative Back-Projection Approach
    Hsieh, Chen-Chiung
    Huang, Yo-Ping
    Chen, Yu-Yi
    Fuh, Chiou-Shann
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2011, 27 (03) : 1107 - 1122
  • [10] Super-resolution using Neighbor Embedding of Back-projection residuals
    Bevilacqua, Marco
    Roumy, Aline
    Guillemot, Christine
    Morel, Marie-Line Alberi
    2013 18TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2013,