RVSRT: Real-time Video Super Resolution Transformer

被引:0
|
作者
Ou, Linlin [1 ,2 ]
Chen, Yuanping [2 ]
机构
[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
关键词
Video super resolution; vision transformer; deep learning;
D O I
10.1117/12.2680156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video super-resolution is the task of converting low-resolution video to high-resolution video. Existing methods with better intuitive effects are mainly based on convolutional neural networks (CNNs), but the architecture is heavy, resulting in a slow inference structure. Aiming at this problem, this paper proposes a real-time video super-resolution Transformer (RVSRT) can quickly complete the super-resolution task while considering the visual fluency of video frame switching. Unlike traditional methods based on CNNs, this paper does not process video frames separately with different network modules in the temporal domain, but batches adjacent frames through a single UNet-style structure end-to-end Transformer network architecture. Moreover, this paper creatively sets up two-stage interpolation sampling before and after the end-to-end network to maximize the performance of the traditional CV algorithm. The experimental results show that compared with SOTA TMNet [1], RVSRT has only 20% of the network size (2.3M vs 12.3M, parameters) while ensuring comparable performance, and the speed is increased by 80% (26.2 fps vs 14.3 fps, frame size is 720*576).
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Real-Time Super-Resolution System of 4K-Video Based on Deep Learning
    Cao, Yanpeng
    Wang, Chengcheng
    Song, Changjun
    Tang, Yongming
    Li, He
    2021 IEEE 32ND INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2021), 2021, : 69 - 76
  • [32] Real-time implementation of super-resolution imaging algorithm
    Hein, C
    ADVANCED SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, AND IMPLEMENTATIONS VIII, 1998, 3461 : 501 - 511
  • [33] Correlative Real-Time and Super Resolution Imaging of Mitochondrial Dynamics
    Balint, Stefan
    Brede, Norman
    Lakadamyali, Melike
    BIOPHYSICAL JOURNAL, 2012, 102 (03) : 379A - 379A
  • [34] SwinAnomaly: Real-Time Video Anomaly Detection Using Video Swin Transformer and SORT
    Bajgoti, Arpit
    Gupta, Rishik
    Balaji, Prasanalakshmi
    Dwivedi, Rinky
    Siwach, Meena
    Gupta, Deepak
    IEEE ACCESS, 2023, 11 : 111093 - 111105
  • [35] Deformable transformer for endoscopic video super-resolution
    Song, Xiaowei
    Tang, Hui
    Yang, Chunfeng
    Zhou, Guangquan
    Wang, Yangang
    Huang, Xinjun
    Hua, Jie
    Coatrieux, Gouenou
    He, Xiaopu
    Chen, Yang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 77
  • [36] Space-Time Video Super-Resolution 3D Transformer
    Zheng, Minyan
    Luo, Jianping
    MULTIMEDIA MODELING, MMM 2023, PT II, 2023, 13834 : 374 - 385
  • [37] Real-Time Video Super-Resolution with Spatio-Temporal Modeling and Redundancy-Aware Inference
    Wang, Wenhao
    Liu, Zhenbing
    Lu, Haoxiang
    Lan, Rushi
    Zhang, Zhaoyuan
    SENSORS, 2023, 23 (18)
  • [38] Real-Time Video Super-Resolution on Smartphones with Deep Learning, Mobile AI 2021 Challenge: Report
    Ignatov, Andrey
    Romero, Andres
    Kim, Heewon
    Timofte, Radu
    Ho, Chiu Man
    Meng, Zibo
    Lee, Kyoung Mu
    Chen, Yuxiang
    Wang, Yutong
    Long, Zeyu
    Wang, Chenhao
    Chen, Yifei
    Xu, Boshen
    Gu, Shuhang
    Duan, Lixin
    Wen Li
    Wang Bofei
    Zhang Diankai
    Zheng Chengjian
    Liu Shaoli
    Gao Si
    Zhang Xiaofeng
    Lu Kaidi
    Xu Tianyu
    Zheng Hui
    Gao, Xinbo
    Wang, Xiumei
    Guo, Jiaming
    Zhou, Xueyi
    Hao Jia
    Yan, Youliang
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 2535 - 2544
  • [39] Practical considerations for real-time super-resolution implementation. techniques over video coding platforms
    Callicó, GM
    López, S
    Llopis, RP
    Sethuraman, R
    Núñez, A
    López, JF
    Marrero, M
    Sarmiento, R
    VLSI CIRCUITS AND SYSTEMS II, PTS 1 AND 2, 2005, 5837 : 613 - 627
  • [40] Super-resolution-based enhancement for real-time ultra-low-bit-rate video coding
    Chien, Wei-Jung
    Abousleman, Glen P.
    Karam, Lina J.
    MOBILE MULTIMEDIA/IMAGE PROCESSING FOR MILITARY AND SECURITY APPLICATIONS 2007, 2007, 6579