A Deep Model of Visual Attention for Saliency Detection on 3D Objects

被引:2
|
作者
Rouhafzay, Ghazal [1 ]
Cretu, Ana-Maria [2 ]
Payeur, Pierre [1 ]
机构
[1] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON, Canada
[2] Univ Quebec & Outaouais, Dept Engn & Comp Sci, Gatineau, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Visual processing; Eye fixation; 3D shapes; Convolutional neural network; Class activation mapping;
D O I
10.1007/s11063-023-11180-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A variety of saliency detection techniques have been proposed during the last two decades to determine important regions on the surface of 3D shapes in form of triangular meshes. However, most fail in predicting the regions where human eyes naturally fixate when observing and exploring an object. Taking inspiration from biological studies that enumerate a list of object characteristics revealed in human visual processing and the influence of semantic properties in the emergence of neural responses in human brain, in this work, we propose a deep convolutional neural network architecture using gradient-based class activation mapping to detect saliencies on the surface of 3D objects when classifying them based on their different properties. We further argue that using Pearson Correlation Coefficient is not sufficient for the evaluation of saliency values and therefore propose a novel evaluation technique to determine how reliable is the detection performed by saliency detectors to predict eye fixations. More specifically, this evaluation metric measures the distance between the most salient region detected and the respective location of human eye fixation. Evaluating the results based on visual comparison, as well as using the proposed evaluation technique, demonstrates that our model is successful in predicting the locations where human eye fixates. Results are compared with five state-of-the-art saliency detectors, and our experiments suggest that in average the location of the highest saliency detected by our approach is closer to the location of human eye fixation by about 22.55% to 77.76% in comparison with five state-of-the art method" on a public dataset.
引用
收藏
页码:8847 / 8867
页数:21
相关论文
共 50 条
  • [21] Learning Stereoscopic Visual Attention Model for 3D Video
    Huang, Gang-jian
    Du, Xin
    Zhu, Yun-fang
    2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATIONS (CSA), 2015, : 6 - 9
  • [22] View Selection of 3D Objects Based On Saliency Segmentation
    Han, Honglei
    Li, Jing
    Wang, Wencheng
    Zhao, Huiwen
    Hua, Miao
    2014 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV2014), 2014, : 214 - 219
  • [23] Visual Saliency Detection Framework for 3D Environment using Virtual Reality Devices
    Hong, Shuai
    Cheng, Hui
    Mao, Bo
    PROCEEDINGS OF THE 2018 IEEE 22ND INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN ((CSCWD)), 2018, : 319 - 324
  • [24] Shot Boundary Detection with 3D Depthwise Convolutions and Visual Attention
    Brotons, Miguel Jose Esteve
    Lucendo, Francisco Javier
    Javier, Rodriguez-Juan
    Garcia-Rodriguez, Jose
    SENSORS, 2023, 23 (16)
  • [25] Deep Saliency Mapping for 3D Meshes and Applications
    Nousias, Stavros
    Arvanitis, Gerasimos
    Lalos, Aris
    Moustakas, Konstantinos
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
  • [26] 3D Lane Detection With Attention in Attention
    Gu, Yinchao
    Ma, Chao
    Li, Qian
    Yang, Xiaokang
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 (1104-1108) : 1104 - 1108
  • [27] Image retargeting with a 3D saliency model
    Chen, Yanxiang
    Pan, Yifei
    Song, Minglong
    Wang, Meng
    SIGNAL PROCESSING, 2015, 112 : 53 - 63
  • [28] Model performance for visual attention in real 3D color scenes
    Hügli, H
    Jost, T
    Ouerhani, N
    ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING APPLICATIONS: A BIOINSPIRED APPROACH, PT 2, PROCEEDINGS, 2005, 3562 : 469 - 478
  • [29] Symmetry Detection of 3D Objects
    Jung, Wookyoung
    Yamauchi, Takashi
    JOURNAL OF COGNITIVE SCIENCE, 2011, 12 (01) : 33 - 64
  • [30] VISUAL SALIENCY DRIVEN ERROR PROTECTION FOR 3D VIDEO
    Hewage, Chaminda T. E. R.
    Wang, Junle
    Martini, Maria G.
    Le Callet, Patrick
    ELECTRONIC PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2013,