Do Humans Look Where Deep Convolutional Neural Networks "Attend"?

被引:5
|
作者
Ebrahimpour, Mohammad K. [1 ]
Ben Falandays, J. [2 ]
Spevack, Samuel [2 ]
Noelle, David C. [1 ,2 ]
机构
[1] Univ Calif, EECS, Merced, CA 95343 USA
[2] Univ Calif, Cognit & Informat Sci, Merced, CA USA
关键词
Visual spatial attention; Computer vision; Convolutional Neural Networks; Densely connected attention maps; Class Activation Maps; Sensitivity analysis;
D O I
10.1007/978-3-030-33723-0_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Convolutional Neural Networks (CNNs) have recently begun to exhibit human level performance on some visual perception tasks. Performance remains relatively poor, however, on some vision tasks, such as object detection: specifying the location and object class for all objects in a still image. We hypothesized that this gap in performance may be largely due to the fact that humans exhibit selective attention, while most object detection CNNs have no corresponding mechanism. In examining this question, we investigated some well-known attention mechanisms in the deep learning literature, identifying their weaknesses and leading us to propose a novel attention algorithm called the Densely Connected Attention Model. We then measured human spatial attention, in the form of eye tracking data, during the performance of an analogous object detection task. By comparing the learned representations produced by various CNN architectures with that exhibited by human viewers, we identified some relative strengths and weaknesses of the examined computational attention mechanisms. Some CNNs produced attentional patterns somewhat similar to those of humans. Others focused processing on objects in the foreground. Still other CNN attentional mechanisms produced usefully interpretable internal representations. The resulting comparisons provide insights into the relationship between CNN attention algorithms and the human visual system.
引用
收藏
页码:53 / 65
页数:13
相关论文
共 50 条
  • [1] Do Deep Convolutional Neural Networks Perform Scene Segmentation in a Similar Way Humans Do?
    Seijdel, Noor
    Tsakmakidis, Nikos
    de Haan, Edward H. F.
    Bohte, Sander M.
    Scholte, H. Steven
    PERCEPTION, 2019, 48 : 77 - 78
  • [2] Do Humans and Convolutional Neural Networks Attend to Similar Areas during Scene Classification: Effects of Task and Image Type
    Mueller, Romy
    Duerschmidt, Marcel
    Ullrich, Julian
    Knoll, Carsten
    Weber, Sascha
    Seitz, Steffen
    APPLIED SCIENCES-BASEL, 2024, 14 (06):
  • [3] Configural relations in humans and deep convolutional neural networks
    Baker, Nicholas
    Garrigan, Patrick
    Phillips, Austin
    Kellman, Philip J. J.
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 5
  • [4] Do Humans and Deep Convolutional Neural Networks Use Visual Information Similarly for the Categorization of Natural Scenes?
    De Cesarei, Andrea
    Cavicchi, Shari
    Cristadoro, Giampaolo
    Lippi, Marco
    COGNITIVE SCIENCE, 2021, 45 (06)
  • [5] Object recognition in deep convolutional neural networks is fundamentally different to that in humans
    Lonnqvist, Ben
    Clarke, Alasdair D. F.
    Chakravarthi, Ramakrishna
    I-PERCEPTION, 2019, 10 : 7 - 7
  • [6] Modeling naturalistic face processing in humans with deep convolutional neural networks
    Guo Jiahui
    Ma Feilong
    Castello, Matteo Visconti di Oleggio
    Nastase, Samuel A.
    Haxby, James V.
    Gobbini, M. Ida
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (43)
  • [7] Deep Convolutional Neural Networks
    Gonzalez, Rafael C.
    IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (06) : 79 - 87
  • [8] Analyzing and Increasing the Similarity of Humans and Deep Convolutional Neural Networks in Object Recognition
    van Dyck, Leonard
    Denzler, Sebastian
    Gruber, Walter
    PERCEPTION, 2022, 51 : 81 - 81
  • [9] Mooney Face Image Processing in Deep Convolutional Neural Networks Compared to Humans
    Zeman, Astrid
    Leers, Tim
    de Beeck, Hans Op
    JOURNAL OF COMPUTATIONAL NEUROSCIENCE, 2024, 52 : S99 - S100
  • [10] Mooney Face Image Processing in Deep Convolutional Neural Networks Compared to Humans
    Zeman, Astrid
    Leers, Tim
    de Beeck, Hans Op
    JOURNAL OF COMPUTATIONAL NEUROSCIENCE, 2024, 52 : S99 - S100