Stereoscopic video quality assessment based on 3D convolutional neural networks

被引:28
|
作者
Yang, Jiachen [1 ]
Zhu, Yinghao [1 ]
Ma, Chaofan [1 ]
Lu, Wen [2 ]
Meng, Qinggang [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin, Peoples R China
[2] Xidian Univ, Sch Elect Engn, Xian, Shaanxi, Peoples R China
[3] Loughborough Univ, Dept Comp Sci, Loughborough, Leics, England
基金
中国国家自然科学基金;
关键词
3D convolutional neural networks; Stereoscopic video quality assessment; Quality score fusion; EVALUATOR; IMAGES;
D O I
10.1016/j.neucom.2018.04.072
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The research of stereoscopic video quality assessment (SVQA) plays an important role for promoting the development of stereoscopic video system. Existing SVQA metrics rely on hand-crafted features, which is inaccurate and time-consuming because of the diversity and complexity of stereoscopic video distortion. This paper introduces a 3D convolutional neural networks (CNN) based SVQA framework that can model not only local spatio-temporal information but also global temporal information with cubic difference video patches as input. First, instead of using hand-crafted features, we design a 3D CNN architecture to automatically and effectively capture local spatio-temporal features. Then we employ a quality score fusion strategy considering global temporal clues to obtain final video-level predicted score. Extensive experiments conducted on two public stereoscopic video quality datasets show that the proposed method correlates highly with human perception and outperforms state-of-the-art methods by a large margin. We also show that our 3D CNN features have more desirable property for SVQA than hand-crafted features in previous methods, and our 3D CNN features together with support vector regression (SVR) can further boost the performance. In addition, with no complex preprocessing and GPU acceleration, our proposed method is demonstrated computationally efficient and easy to use. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:83 / 93
页数:11
相关论文
共 50 条
  • [1] VIRTUAL REALITY VIDEO QUALITY ASSESSMENT BASED ON 3D CONVOLUTIONAL NEURAL NETWORKS
    Wu, Pei
    Ding, Wenxin
    You, Zhixiang
    An, Ping
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3187 - 3191
  • [2] 3D Panoramic Virtual Reality Video Quality Assessment Based on 3D Convolutional Neural Networks
    Yang, Jiachen
    Liu, Tianlin
    Jiang, Bin
    Song, Houbing
    Lu, Wen
    IEEE ACCESS, 2018, 6 : 38669 - 38682
  • [3] MULTI-SCALE FEATURE-GUIDED STEREOSCOPIC VIDEO QUALITY ASSESSMENT BASED ON 3D CONVOLUTIONAL NEURAL NETWORK
    Feng, Yingjie
    Li, Sumei
    Chang, Yongli
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2095 - 2099
  • [4] No-Reference Video Quality Assessment With 3D Shearlet Transform and Convolutional Neural Networks
    Li, Yuming
    Po, Lai-Man
    Cheung, Chun-Ho
    Xu, Xuyuan
    Feng, Litong
    Yuan, Fang
    Cheung, Kwok-Wai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (06) : 1044 - 1057
  • [5] Stereoscopic 3D video quality assessment based on depth maps and video motion
    Juan Pedro López
    Juan Antonio Rodrigo
    David Jiménez
    José Manuel Menéndez
    EURASIP Journal on Image and Video Processing, 2013
  • [6] Stereoscopic 3D video quality assessment based on depth maps and video motion
    Pedro Lopez, Juan
    Antonio Rodrigo, Juan
    Jimenez, David
    Manuel Menendez, Jose
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2013,
  • [7] Video-based surgical skill assessment using 3D convolutional neural networks
    Isabel Funke
    Sören Torge Mees
    Jürgen Weitz
    Stefanie Speidel
    International Journal of Computer Assisted Radiology and Surgery, 2019, 14 : 1217 - 1225
  • [8] Full-Reference Video Quality Assessment Using Deep 3D Convolutional Neural Networks
    Dendi, Sathya Veera Reddy
    Krishnappa, Gokul
    Channappayya, Sumohana S.
    2019 25TH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2019,
  • [9] Subjective quality assessment of asymmetric stereoscopic 3D video
    Payman Aflaki
    Miska M. Hannuksela
    Moncef Gabbouj
    Signal, Image and Video Processing, 2015, 9 : 331 - 345
  • [10] Saliency inspired quality assessment of stereoscopic 3D video
    Banitalebi-Dehkordi, Amin
    Nasiopoulos, Panos
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (19) : 26055 - 26082