Learning Generalized Spatial-Temporal Deep Feature Representation for No-Reference Video Quality Assessment

被引:45
|
作者
Chen, Baoliang [1 ]
Zhu, Lingyu [1 ]
Li, Guo [2 ]
Lu, Fangbo [2 ]
Fan, Hongfei [2 ]
Wang, Shiqi [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] Kingsoft Cloud, Beijing 100000, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Quality assessment; Training; Video recording; Image quality; Streaming media; Nonlinear distortion; Video quality assessment; generalization capability; deep neural networks; temporal aggregation; IMAGE; STATISTICS; DATABASE;
D O I
10.1109/TCSVT.2021.3088505
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this work, we propose a no-reference video quality assessment method, aiming to achieve high-generalization capability in cross-content, -resolution and -frame rate quality prediction. In particular, we evaluate the quality of a video by learning effective feature representations in spatial-temporal domain. In the spatial domain, to tackle the resolution and content variations, we impose the Gaussian distribution constraints on the quality features. The unified distribution can significantly reduce the domain gap between different video samples, resulting in more generalized quality feature representation. Along the temporal dimension, inspired by the mechanism of visual perception, we propose a pyramid temporal aggregation module by involving the short-term and long-term memory to aggregate the frame-level quality. Experiments show that our method outperforms the state-of-the-art methods on cross-dataset settings, and achieves comparable performance on intra-dataset configurations, demonstrating the high-generalization capability of the proposed method. The codes are released at https://github.com/Baoliang93/GSTVQA
引用
收藏
页码:1903 / 1916
页数:14
相关论文
共 50 条
  • [41] Feature Selection for Neural-Network Based No-Reference Video Quality Assessment
    Culibrk, Dubravko
    Kukolj, Dragan
    Vasiljevic, Petar
    Pokric, Maja
    Zlokolica, Vladimir
    ARTIFICIAL NEURAL NETWORKS - ICANN 2009, PT II, 2009, 5769 : 633 - 642
  • [42] Feature-based no-reference video quality assessment using Extra Trees
    Otroshi-Shahreza, Hatef
    Amini, Arash
    Behroozi, Hamid
    IET IMAGE PROCESSING, 2022, 16 (06) : 1531 - 1543
  • [43] No-Reference Hyperspectral Image Quality Assessment via Ranking Feature Learning
    Li, Yuyan
    Dong, Yubo
    Li, Haoyong
    Liu, Danhua
    Xue, Fang
    Gao, Dahua
    REMOTE SENSING, 2024, 16 (10)
  • [44] Learning Based Hybrid No-reference Video Quality Assessment of Compressed Videos
    Fazliani, Yasamin
    Andrade, Ernesto
    Shirani, Shahram
    2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [45] Prediction and Modeling for No-Reference Video Quality Assessment based on Machine Learning
    Pedro Lopez, Juan
    Martin, David
    Jimenez, David
    Manuel Menendez, Jose
    2018 14TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS), 2018, : 56 - 63
  • [46] Yoga Posture Recognition by Learning Spatial-Temporal Feature with Deep Learning Techniques
    Palanimeera, J.
    Ponmozhi, K.
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2024, 24 (06)
  • [47] No-reference video quality assessment based on modeling temporal-memory effects
    Pan, Da
    Wang, XueTing
    Shi, Ping
    Yu, ShaoDe
    DISPLAYS, 2021, 70
  • [48] No-Reference Video Quality Assessment Using Statistical Features Along Temporal Trajectory
    Yao, Jie
    Xie, Yongqiang
    Tan, Jianming
    Li, Zhongbo
    Qi, Jin
    Gao, Lanlan
    2012 INTERNATIONAL WORKSHOP ON INFORMATION AND ELECTRONICS ENGINEERING, 2012, 29 : 947 - 951
  • [49] MODELING SPARSE SPATIO-TEMPORAL REPRESENTATIONS FOR NO-REFERENCE VIDEO QUALITY ASSESSMENT
    Shabeer, Muhammed P.
    Bhati, Saurabhchand
    Channappayya, Sumohana S.
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1220 - 1224
  • [50] STAN: Spatio-Temporal Alignment Network for No-Reference Video Quality Assessment
    Yang, Zhengyi
    Dang, Yuanjie
    Xiang, Jianjun
    Chen, Peng
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT III, 2023, 14256 : 160 - 171