Visual Navigation with Spatial Attention

被引:48
|
作者
Mayo, Bar [1 ]
Hazan, Tamir [1 ]
Tal, Ayellet [1 ]
机构
[1] Technion, Haifa, Israel
关键词
D O I
10.1109/CVPR46437.2021.01662
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work focuses on object goal visual navigation, aiming at finding the location of an object from a given class, where in each step the agent is provided with an egocentric RGB image of the scene. We propose to learn the agent's policy using a reinforcement learning algorithm. Our key contribution is a novel attention probability model for visual navigation tasks. This attention encodes semantic information about observed objects, as well as spatial information about their place. This combination of the "what" and the "where" allows the agent to navigate toward the sought-after object effectively. The attention model is shown to improve the agent's policy and to achieve state-of-the-art results on commonly-used datasets.
引用
收藏
页码:16893 / 16902
页数:10
相关论文
共 50 条
  • [21] MECHANISMS OF VISUAL-SPATIAL ATTENTION
    HILLYARD, SA
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1988, 26 (06) : 493 - 493
  • [22] Visual Attention in spatial language comprehension
    Burigo, Michele
    Knoeferle, Pia
    COGNITIVE PROCESSING, 2012, 13 : S38 - S38
  • [23] Decoding Visual Spatial Attention Control
    Meyyappan, Sreenivasan
    Rajan, Abhijit
    Yang, Qiang
    Mangun, George R.
    Ding, Mingzhou
    ENEURO, 2025, 12 (03)
  • [24] Individual Differences in the Allocation of Visual Attention During Navigation
    Keller, Mikayla
    Sutton, Jennifer E.
    CANADIAN JOURNAL OF EXPERIMENTAL PSYCHOLOGY-REVUE CANADIENNE DE PSYCHOLOGIE EXPERIMENTALE, 2022, 76 (01): : 10 - 21
  • [25] Double Graph Attention Networks for Visual Semantic Navigation
    Lyu, Yunlian
    Talebi, Mohammad Sadegh
    NEURAL PROCESSING LETTERS, 2023, 55 (07) : 9019 - 9040
  • [26] Double Graph Attention Networks for Visual Semantic Navigation
    Yunlian Lyu
    Mohammad Sadegh Talebi
    Neural Processing Letters, 2023, 55 : 9019 - 9040
  • [27] Attention and Anticipation in Fast Visual-Inertial Navigation
    Carlone, Luca
    Karaman, Sertac
    IEEE TRANSACTIONS ON ROBOTICS, 2019, 35 (01) : 1 - 20
  • [28] Spatial navigation signals in rodent visual cortex
    Flossmann, Tom
    Rochefort, Nathalie L.
    CURRENT OPINION IN NEUROBIOLOGY, 2021, 67 : 163 - 173
  • [29] AudioGPS: Spatial Audio Navigation with a Minimal Attention Interface
    Simon Holland
    David R. Morse
    Henrik Gedenryd
    Personal and Ubiquitous Computing, 2002, 6 : 253 - 259
  • [30] AudioGPS: Spatial Audio Navigation with a Minimal Attention Interface
    Holland, Simon
    Morse, David R.
    Gedenryd, Henrik
    PERSONAL AND UBIQUITOUS COMPUTING, 2002, 6 (04) : 253 - 259