Stereoscopic video quality assessment based on 3D convolutional neural networks

被引:28
|
作者
Yang, Jiachen [1 ]
Zhu, Yinghao [1 ]
Ma, Chaofan [1 ]
Lu, Wen [2 ]
Meng, Qinggang [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin, Peoples R China
[2] Xidian Univ, Sch Elect Engn, Xian, Shaanxi, Peoples R China
[3] Loughborough Univ, Dept Comp Sci, Loughborough, Leics, England
基金
中国国家自然科学基金;
关键词
3D convolutional neural networks; Stereoscopic video quality assessment; Quality score fusion; EVALUATOR; IMAGES;
D O I
10.1016/j.neucom.2018.04.072
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The research of stereoscopic video quality assessment (SVQA) plays an important role for promoting the development of stereoscopic video system. Existing SVQA metrics rely on hand-crafted features, which is inaccurate and time-consuming because of the diversity and complexity of stereoscopic video distortion. This paper introduces a 3D convolutional neural networks (CNN) based SVQA framework that can model not only local spatio-temporal information but also global temporal information with cubic difference video patches as input. First, instead of using hand-crafted features, we design a 3D CNN architecture to automatically and effectively capture local spatio-temporal features. Then we employ a quality score fusion strategy considering global temporal clues to obtain final video-level predicted score. Extensive experiments conducted on two public stereoscopic video quality datasets show that the proposed method correlates highly with human perception and outperforms state-of-the-art methods by a large margin. We also show that our 3D CNN features have more desirable property for SVQA than hand-crafted features in previous methods, and our 3D CNN features together with support vector regression (SVR) can further boost the performance. In addition, with no complex preprocessing and GPU acceleration, our proposed method is demonstrated computationally efficient and easy to use. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:83 / 93
页数:11
相关论文
共 50 条
  • [21] Motion Sickness Prediction in Stereoscopic Videos using 3D Convolutional Neural Networks
    Lee, Tae Min
    Yoon, Jong-Chul
    Lee, In-Kwon
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (05) : 1919 - 1927
  • [22] Objective Quality Assessment of Stereoscopic Video Using Inflated 3D Features
    Hassan Imani
    Md Baharul Islam
    SN Computer Science, 5 (6)
  • [23] Performance Comparison of Subjective Assessment Methods for Stereoscopic 3D Video Quality
    Kawano, Taichi
    Yamagishi, Kazuhisa
    Hayashi, Takanori
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2014, E97B (04) : 738 - 745
  • [24] Multi-viewport based 3D convolutional neural network for 360-degree video quality assessment
    Guo, Jiefeng
    Huang, Lianfen
    Chien, Wei-Che
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (12) : 16813 - 16831
  • [25] Multi-viewport based 3D convolutional neural network for 360-degree video quality assessment
    Jiefeng Guo
    Lianfen Huang
    Wei-Che Chien
    Multimedia Tools and Applications, 2022, 81 : 16813 - 16831
  • [26] A Novel Violent Video Detection Scheme Based on Modified 3D Convolutional Neural Networks
    Song, Wei
    Zhang, Dongliang
    Zha, Xiaobing
    Yu, Jing
    Zheng, Rui
    Wang, Antai
    IEEE ACCESS, 2019, 7 : 39172 - 39179
  • [27] Objective Quality Assessment of 3D Stereoscopic Video Based on Motion Vectors and Depth Map Features
    Mahmood, Sawsen Abdulhadi
    Ghani, Rana Fareed
    2015 7TH COMPUTER SCIENCE AND ELECTRONIC ENGINEERING CONFERENCE (CEEC), 2015, : 179 - 183
  • [28] Smoke Detection on Video Sequences Using 3D Convolutional Neural Networks
    Lin, Gaohua
    Zhang, Yongming
    Xu, Gao
    Zhang, Qixing
    FIRE TECHNOLOGY, 2019, 55 (05) : 1827 - 1847
  • [29] Smoke Detection on Video Sequences Using 3D Convolutional Neural Networks
    Gaohua Lin
    Yongming Zhang
    Gao Xu
    Qixing Zhang
    Fire Technology, 2019, 55 : 1827 - 1847
  • [30] 3D Semantic Mapping Based on Convolutional Neural Networks
    Li, Jing
    Liu, Yanyu
    Wang, Junzheng
    Yan, Min
    Yao, Yanzhi
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9303 - 9308