Oracle-SAGE: Planning Ahead in Graph-Based Deep Reinforcement Learning

被引:1
|
作者
Chester, Andrew [1 ]
Dann, Michael [1 ]
Zambetta, Fabio [1 ]
Thangarajah, John [1 ]
机构
[1] RMIT Univ, Sch Comp Technol, Melbourne, Australia
关键词
Reinforcement learning; GNNs; Symbolic planning;
D O I
10.1007/978-3-031-26412-2_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep reinforcement learning (RL) commonly suffers from high sample complexity and poor generalisation, especially with high-dimensional (image-based) input. Where available (such as some robotic control domains), low dimensional vector inputs outperform their image based counterparts, but it is challenging to represent complex dynamic environments in this manner. Relational reinforcement learning instead represents the world as a set of objects and the relations between them; offering a flexible yet expressive view which provides structural inductive biases to aid learning. Recently relational RL methods have been extended with modern function approximation using graph neural networks (GNNs). However, inherent limitations in the processing model for GNNs result in decreased returns when important information is dispersed widely throughout the graph. We outline a hybrid learning and planning model which uses reinforcement learning to propose and select subgoals for a planning model to achieve. This includes a novel action selection mechanism and loss function to allow training around the non-differentiable planner. We demonstrate our algorithms effectiveness on a range of domains, including MiniHack and a challenging extension of the classic taxi domain.
引用
收藏
页码:52 / 67
页数:16
相关论文
共 50 条
  • [1] iADA*-RL: Anytime Graph-Based Path Planning with Deep Reinforcement Learning for an Autonomous UAV
    Maw, Aye Aye
    Tyan, Maxim
    Nguyen, Tuan Anh
    Lee, Jae-Woo
    APPLIED SCIENCES-BASEL, 2021, 11 (09):
  • [2] Asymmetric Graph-Based Deep Reinforcement Learning for Portfolio Optimization
    Sun, Haoyu
    Liu, Xin
    Bian, Yuxuan
    Zhu, Peng
    Cheng, Dawei
    Liang, Yuqi
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES-APPLIED DATA SCIENCE TRACK, PT IX, ECML PKDD 2024, 2024, 14949 : 174 - 189
  • [3] DeepMigration: Flow Migration for NFV with Graph-based Deep Reinforcement Learning
    Sun, Penghao
    Lan, Julong
    Guo, Zehua
    Zhang, Di
    Chen, Xianfu
    Hu, Yuxiang
    Liu, Zhi
    ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [4] A Graph-Based Deep Reinforcement Learning Approach to Grasping Fully Occluded Objects
    Zuo, Guoyu
    Tong, Jiayuan
    Wang, Zihao
    Gong, Daoxiong
    COGNITIVE COMPUTATION, 2023, 15 (01) : 36 - 49
  • [5] Playing Text-Adventure Games with Graph-Based Deep Reinforcement Learning
    Ammanabrolu, Prithviraj
    Riedl, Mark O.
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3557 - 3565
  • [6] A Graph-Based Deep Reinforcement Learning Approach to Grasping Fully Occluded Objects
    Guoyu Zuo
    Jiayuan Tong
    Zihao Wang
    Daoxiong Gong
    Cognitive Computation, 2023, 15 : 36 - 49
  • [7] Graph-Based Skill Acquisition For Reinforcement Learning
    Mendonca, Matheus R. F.
    Ziviani, Artur
    Barreto, Andre M. S.
    ACM COMPUTING SURVEYS, 2019, 52 (01)
  • [8] Gargoyles: An Open Source Graph-Based Molecular Optimization Method Based on Deep Reinforcement Learning
    Erikawa, Daiki
    Yasuo, Nobuaki
    Suzuki, Takamasa
    Nakamura, Shogo
    Sekijima, Masakazu
    ACS OMEGA, 2023, 8 (40): : 37431 - 37441
  • [9] Graph-based Deep Reinforcement Learning for Wind Farm Set-Point Optimisation
    Sheehan, H.
    Poole, D.
    Silva Filho, T.
    Bossanyi, E.
    Landberg, L.
    SCIENCE OF MAKING TORQUE FROM WIND, TORQUE 2024, 2024, 2767
  • [10] Graph-based Cluttered Scene Generation and Interactive Exploration using Deep Reinforcement Learning
    Kumar, K. Niranjan
    Essa, Irfan
    Ha, Sehoon
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 7521 - 7527