Oracle-SAGE: Planning Ahead in Graph-Based Deep Reinforcement Learning

被引:1
|
作者
Chester, Andrew [1 ]
Dann, Michael [1 ]
Zambetta, Fabio [1 ]
Thangarajah, John [1 ]
机构
[1] RMIT Univ, Sch Comp Technol, Melbourne, Australia
关键词
Reinforcement learning; GNNs; Symbolic planning;
D O I
10.1007/978-3-031-26412-2_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep reinforcement learning (RL) commonly suffers from high sample complexity and poor generalisation, especially with high-dimensional (image-based) input. Where available (such as some robotic control domains), low dimensional vector inputs outperform their image based counterparts, but it is challenging to represent complex dynamic environments in this manner. Relational reinforcement learning instead represents the world as a set of objects and the relations between them; offering a flexible yet expressive view which provides structural inductive biases to aid learning. Recently relational RL methods have been extended with modern function approximation using graph neural networks (GNNs). However, inherent limitations in the processing model for GNNs result in decreased returns when important information is dispersed widely throughout the graph. We outline a hybrid learning and planning model which uses reinforcement learning to propose and select subgoals for a planning model to achieve. This includes a novel action selection mechanism and loss function to allow training around the non-differentiable planner. We demonstrate our algorithms effectiveness on a range of domains, including MiniHack and a challenging extension of the classic taxi domain.
引用
收藏
页码:52 / 67
页数:16
相关论文
共 50 条
  • [31] Leveraging graph-based learning for credit card fraud detection: a comparative study of classical, deep learning and graph-based approaches
    Harish, Sunisha
    Lakhanpal, Chirag
    Jafari, Amir Hossein
    Neural Computing and Applications, 2024, 36 (34) : 21873 - 21883
  • [32] Learning Region Similarities via Graph-Based Deep Metric Learning
    Zhao, Yunxiang
    Qi, Jianzhong
    Trisedya, Bayu D.
    Su, Yixin
    Zhang, Rui
    Ren, Hongguang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (10) : 10237 - 10250
  • [33] A Knowledge Graph-based Interactive Recommender System Using Reinforcement Learning
    Sun, Ruoxi
    Yan, Jun
    Ren, Fenghui
    2022 TENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA, CBD, 2022, : 73 - 78
  • [34] DAGA: Dynamics Aware Reinforcement Learning With Graph-Based Rapid Adaptation
    Ji, Jingtian
    Nie, Buqing
    Gao, Yue
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 2189 - 2196
  • [35] Sample Efficient Reinforcement Learning Using Graph-Based Memory Reconstruction
    Kang Y.
    Zhao E.
    Zang Y.
    Li L.
    Li K.
    Tao P.
    Xing J.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (02): : 751 - 762
  • [36] Knowledge Graph-Based Reinforcement Federated Learning for Chinese Question and Answering
    Xu, Liang
    Chen, Tao
    Hou, Zhaoxiang
    Zhang, Weishan
    Hon, Chitin
    Wang, Xiao
    Wang, Di
    Chen, Long
    Zhu, Wenyin
    Tian, Yunlong
    Ning, Huansheng
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01) : 1035 - 1045
  • [37] A Graph-Based Reinforcement Learning Method with Converged State Exploration and Exploitation
    Li, Han
    Chen, Tianding
    Teng, Hualiang
    Jiang, Yingtao
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2019, 118 (02): : 253 - +
  • [38] Graph weeds net: A graph-based deep learning method for weed recognition
    Hu, Kun
    Coleman, Guy
    Zeng, Shan
    Wang, Zhiyong
    Walsh, Michael
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 174
  • [39] Synchrophasor Recovery and Prediction: A Graph-Based Deep Learning Approach
    Yu, James J. Q.
    Hill, David J.
    Li, Victor O. K.
    Hou, Yunhe
    IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (05) : 7348 - 7359
  • [40] Graph-based Fuzz Testing for Deep Learning Inference Engines
    Luo, Weisi
    Chai, Dong
    Run, Xiaoyue
    Wang, Jiang
    Fang, Chunrong
    Chen, Zhenyu
    2021 IEEE/ACM 43RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2021), 2021, : 288 - 299