Study of Spatio-Temporal Modeling in Video Quality Assessment

被引:7
|
作者
Fang, Yuming [1 ]
Li, Zhaoqian [1 ]
Yan, Jiebin [1 ]
Sui, Xiangjie [1 ]
Liu, Hantao [2 ]
机构
[1] Jiangxi Univ Finance & Econ, Sch Informat Technol, Nanchang 330032, Jiangxi, Peoples R China
[2] Cardiff Univ, Sch Comp Sci & Informat, Cardiff CF24 3AA, Wales
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Video quality assessment; spatio-temporal modeling; recurrent neural network; PREDICTION; DATABASE; FLOW;
D O I
10.1109/TIP.2023.3272480
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video quality assessment (VQA) has received remarkable attention recently. Most of the popular VQA models employ recurrent neural networks (RNNs) to capture the temporal quality variation of videos. However, each long-term video sequence is commonly labeled with a single quality score, with which RNNs might not be able to learn long-term quality variation well: What's the real role of RNNs in learning the visual quality of videos? Does it learn spatio-temporal representation as expected or just aggregating spatial features redundantly? In this study, we conduct a comprehensive study by training a family of VQA models with carefully designed frame sampling strategies and spatio-temporal fusion methods. Our extensive experiments on four publicly available in- the-wild video quality datasets lead to two main findings. First, the plausible spatio-temporal modeling module (i. e., RNNs) does not facilitate quality-aware spatio-temporal feature learning. Second, sparsely sampled video frames are capable of obtaining the competitive performance against using all video frames as the input. In other words, spatial features play a vital role in capturing video quality variation for VQA. To our best knowledge, this is the first work to explore the issue of spatio-temporal modeling in VQA.
引用
收藏
页码:2693 / 2702
页数:10
相关论文
共 50 条
  • [1] Revisiting the robustness of spatio-temporal modeling in video quality assessment
    Yan, Jiebin
    Wu, Lei
    Jiang, Wenhui
    Liu, Chuanlin
    Shen, Fei
    DISPLAYS, 2024, 81
  • [2] Quality Assessment for Omnidirectional Video: A Spatio-Temporal Distortion Modeling Approach
    Gao, Pan
    Zhang, Pengwei
    Smolic, Aljosa
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1 - 16
  • [3] MODELING SPARSE SPATIO-TEMPORAL REPRESENTATIONS FOR NO-REFERENCE VIDEO QUALITY ASSESSMENT
    Shabeer, Muhammed P.
    Bhati, Saurabhchand
    Channappayya, Sumohana S.
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1220 - 1224
  • [4] Modelling of spatio-temporal interaction for video quality assessment
    Huynh-Thu, Quan
    Ghanbari, Mohammed
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2010, 25 (07) : 535 - 546
  • [5] On the Importance of Spatio-Temporal Learning for Video Quality Assessment
    Fontanel, Dario
    Higham, David
    Vallade, Benoit Quentin Arthur
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2023, : 481 - 487
  • [6] Spatio-temporal Salience Based Video Quality Assessment
    Gao, Xinbo
    Liul, Ni
    Lui, Wen
    Tao, Dacheng
    Li, Xuelong
    IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
  • [7] SPATIO-TEMPORAL SSIM INDEX FOR VIDEO QUALITY ASSESSMENT
    Wang, Yue
    Jiang, Tingting
    Ma, Siwei
    Gao, Wen
    2012 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2012,
  • [8] Video Quality Assessment for Spatio-Temporal Resolution Adaptive Coding
    Zhu, Hanwei
    Chen, Baoliang
    Zhu, Lingyu
    Chen, Peilin
    Song, Linqi
    Wang, Shiqi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 6403 - 6415
  • [9] BLIND VIDEO QUALITY ASSESSMENT USING NATURAL VIDEO SPATIO-TEMPORAL STATISTICS
    Xia, Xiuyan
    Lu, Zhaoming
    Wang, Luhan
    Wan, Mingfei
    Wen, Xiangming
    2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2014,
  • [10] Video Quality Assessment Metric Based on Spatio-Temporal Motion Information
    Kang, Kai
    Liu, Xingang
    Sun, Chao
    2013 IEEE 11TH INTERNATIONAL CONFERENCE ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING (DASC), 2013, : 47 - 51