Visual Navigation with Spatial Attention

被引:48
|
作者
Mayo, Bar [1 ]
Hazan, Tamir [1 ]
Tal, Ayellet [1 ]
机构
[1] Technion, Haifa, Israel
关键词
D O I
10.1109/CVPR46437.2021.01662
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work focuses on object goal visual navigation, aiming at finding the location of an object from a given class, where in each step the agent is provided with an egocentric RGB image of the scene. We propose to learn the agent's policy using a reinforcement learning algorithm. Our key contribution is a novel attention probability model for visual navigation tasks. This attention encodes semantic information about observed objects, as well as spatial information about their place. This combination of the "what" and the "where" allows the agent to navigate toward the sought-after object effectively. The attention model is shown to improve the agent's policy and to achieve state-of-the-art results on commonly-used datasets.
引用
收藏
页码:16893 / 16902
页数:10
相关论文
共 50 条
  • [1] Building Category Graphs Representation with Spatial and Temporal Attention for Visual Navigation
    Hu, Xiaobo
    Lin, Youfang
    Fan, Hehe
    Wang, Shuo
    Wu, Zhihao
    Lv, Kai
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
  • [2] SIMULATION OF SPATIAL MEMORY FOR HUMAN NAVIGATION BASED ON VISUAL ATTENTION IN FLOORPLAN REVIEW
    Shi, Yangming
    Du, Jing
    2019 WINTER SIMULATION CONFERENCE (WSC), 2019, : 2947 - 2956
  • [3] The spatial distribution of visual attention
    Gobell, JL
    Tseng, CH
    Sperling, G
    VISION RESEARCH, 2004, 44 (12) : 1273 - 1296
  • [4] The retinotopy of visual spatial attention
    Tootell, RBH
    Hadjikhani, N
    Hall, EK
    Marrett, S
    Vanduffel, W
    Vaughan, JT
    Dale, AM
    NEURON, 1998, 21 (06) : 1409 - 1422
  • [5] The spatial resolution of visual attention
    Intriligator, J
    Cavanagh, P
    COGNITIVE PSYCHOLOGY, 2001, 43 (03) : 171 - 216
  • [6] Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships
    Lyu, Yunlian
    Shi, Yimin
    Zhang, Xianggang
    NEURAL PROCESSING LETTERS, 2022, 54 (05) : 3979 - 3998
  • [7] Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships
    Yunlian Lyu
    Yimin Shi
    Xianggang Zhang
    Neural Processing Letters, 2022, 54 : 3979 - 3998
  • [8] Visual attention in spatial cueing and visual search
    Baek, Jongsoo
    Dosher, Barbara Anne
    Lu, Zhong-Lin
    JOURNAL OF VISION, 2021, 21 (03): : 1 - 24
  • [9] Decoding attention control and selection in visual spatial attention
    Hong, Xiangfei
    Bo, Ke
    Meyyappan, Sreenivasan
    Tong, Shanbao
    Ding, Mingzhou
    HUMAN BRAIN MAPPING, 2020, 41 (14) : 3900 - 3921
  • [10] Superior Colliculus and Visual Spatial Attention
    Krauzlis, Richard J.
    Lovejoy, Lee P.
    Zenon, Alexandre
    ANNUAL REVIEW OF NEUROSCIENCE, VOL 36, 2013, 36 : 165 - 182