Double Graph Attention Networks for Visual Semantic Navigation

被引:2
|
作者
Lyu, Yunlian [1 ,2 ]
Talebi, Mohammad Sadegh [2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Xiyuan Ave, Chengdu 611731, Sichuan, Peoples R China
[2] Univ Copenhagen, Dept Comp Sci, Univ Pk 1, DK-2100 Copenhagen, Denmark
关键词
Deep reinforcement learning; Visual navigation; Knowledge graph; Graph convolutional networks; Spatial attention; REINFORCEMENT; LANGUAGE; ROBOT;
D O I
10.1007/s11063-023-11190-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Artificial Intelligence (AI) based on knowledge graphs has been invested in realizing human intelligence like thinking, learning, and logical reasoning. It is a great promise to make AI-based systems not only intelligent but also knowledgeable. In this paper, we investigate knowledge graph based visual semantic navigation using deep reinforcement learning, where an agent reasons actions against targets specified by text words in indoor scenes. The agent perceives its surroundings through egocentric RGB views and learns via trial-and-error. The fundamental problem of visual navigation is efficient learning across different targets and scenes. To obtain an empirical model, we propose a spatial attention model with knowledge graphs, DGVN, which combines both semantic information about observed objects and spatial information about their locations. Our spatial attention model is constructed based on interactions between a 3D global graph and local graphs. The two graphs we adopted encode the spatial relationships between objects and are expected to guide policy search effectively. With the knowledge graph and its robust feature representation using graph convolutional networks, we demonstrate that our agent is able to infer a more plausible attention mechanism for decision-making. Under several experimental metrics, our attention model is shown to achieve superior navigation performance in the AI2-THOR environment.
引用
收藏
页码:9019 / 9040
页数:22
相关论文
共 50 条
  • [31] Signed Graph Attention Networks
    Huang, Junjie
    Shen, Huawei
    Hou, Liang
    Cheng, Xueqi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: WORKSHOP AND SPECIAL SESSIONS, 2019, 11731 : 566 - 577
  • [32] Frontier Semantic Exploration for Visual Target Navigation
    Yu, Bangguo
    Kasaei, Hamidreza
    Cao, Ming
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 4099 - 4105
  • [33] Bayesian Relational Memory for Semantic Visual Navigation
    Wu, Yi
    Wu, Yuxin
    Tamar, Aviv
    Russell, Stuart
    Gkioxari, Georgia
    Tian, Yuandong
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2769 - 2779
  • [34] Semantic Mapping and Navigation with Visual Planar Landmarks
    Ko, Dong Wook
    Yi, Chuho
    Suh, Il Hong
    2012 9TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAL), 2012, : 255 - 258
  • [35] Visual Representations for Semantic Target Driven Navigation
    Mousavian, Arsalan
    Toshev, Alexander
    Fiser, Marek
    Kosecka, Jana
    Wahid, Ayzaan
    Davidson, James
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 8846 - 8852
  • [36] Semantic Visual Navigation by Watching YouTube Videos
    Chang, Matthew
    Gupta, Arjun
    Gupta, Saurabh
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [37] Visual attention model based on graph
    Zeng, Xiao-Ping
    Wei, Li-Bo
    Liu, Guo-Jin
    Sichuan Daxue Xuebao (Gongcheng Kexue Ban)/Journal of Sichuan University (Engineering Science Edition), 2010, 42 (04): : 125 - 129
  • [38] Semantic Navigation of Keyword Search Based on Knowledge Graph
    Peng, Bo
    Chen, Guohua
    Tang, Yong
    Sun, Saimei
    Sun, Yuxia
    12TH CHINESE CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING (CHINESECSCW 2017), 2017, : 189 - 192
  • [39] Neurally-Guided Semantic Navigation in Knowledge Graph
    He, Liang
    Shao, Bin
    Xiao, Yanghua
    Li, Yatao
    Liu, Tie-Yan
    Chen, Enhong
    Xia, Huanhuan
    IEEE TRANSACTIONS ON BIG DATA, 2022, 8 (03) : 607 - 615
  • [40] Semantic Blossom Graph: A new Approach for Visual Graph Exploration
    Rauch, Manuela
    Wozelka, Ralph
    Veas, Eduardo
    Sabol, Vedran
    2014 18TH INTERNATIONAL CONFERENCE ON INFORMATION VISUALISATION (IV), 2014, : 234 - 240