Symbolic Task Inference in Deep Reinforcement Learning

被引:0
|
作者
Hasanbeig, Hosein [1 ]
Jeppu, Natasha Yogananda [2 ]
Abate, Alessandro [2 ]
Melham, Tom [2 ]
Kroening, Daniel [3 ]
机构
[1] Microsoft Research, United States
[2] Department of Computer Science, University of Oxford, United Kingdom
[3] Amazon, United States
来源
Journal of Artificial Intelligence Research | 2024年 / 80卷
关键词
Deep learning - Intelligent agents;
D O I
10.1613/jair.1.14063
中图分类号
学科分类号
摘要
This paper proposes DeepSynth, a method for effective training of deep reinforcement learning agents when the reward is sparse or non-Markovian, but at the same time progress towards the reward requires achieving an unknown sequence of high-level objectives. Our method employs a novel algorithm for synthesis of compact finite state automata to uncover this sequential structure automatically. We synthesise a human-interpretable automaton from trace data collected by exploring the environment. The state space of the environment is then enriched with the synthesised automaton, so that the generation of a control policy by deep reinforcement learning is guided by the discovered structure encoded in the automaton. The proposed approach is able to cope with both high-dimensional, low-level features and unknown sparse or non-Markovian rewards. We have evaluated DeepSynth’s performance in a set of experiments that includes the Atari game Montezuma’s Revenge, known to be challenging. Compared to approaches that rely solely on deep reinforcement learning, we obtain a reduction of two orders of magnitude in the iterations required for policy synthesis, and a significant improvement in scalability. ©2024 The Authors.
引用
收藏
页码:1099 / 1137
相关论文
共 50 条
  • [21] Optimization of Deep Reinforcement Learning with Hybrid Multi-Task Learning
    Varghese, Nelson Vithayathil
    Mahmoud, Qusay H.
    2021 15TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE (SYSCON 2021), 2021,
  • [22] Comparison of multiple reinforcement learning and deep reinforcement learning methods for the task aimed at achieving the goal
    Parak R.
    Matousek R.
    Mendel, 2021, 27 (01) : 1 - 8
  • [23] Deep Explainable Relational Reinforcement Learning: A Neuro-Symbolic Approach
    Hazra, Rishi
    De Raedt, Luc
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 213 - 229
  • [24] SAGE: Generating Symbolic Goals for Myopic Models in Deep Reinforcement Learning
    Chester, Andrew
    Dann, Michael
    Zambetta, Fabio
    Thangarajah, John
    ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2023, PT II, 2024, 14472 : 274 - 285
  • [25] Multi-task Deep Reinforcement Learning for Scalable Parallel Task Scheduling
    Zhang, Lingxin
    Qi, Qi
    Wang, Jingyu
    Sun, Haifeng
    Liao, Jianxin
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2992 - 3001
  • [26] Deep Reinforcement Learning Task Assignment Based on Domain Knowledge
    Liu, Jiayi
    Wang, Gang
    Guo, Xiangke
    Wang, Siyuan
    Fu, Qiang
    IEEE Access, 2022, 10 : 114402 - 114413
  • [27] PROBING TRANSFER IN DEEP REINFORCEMENT LEARNING WITHOUT TASK ENGINEERING
    Rusu, Andrei A.
    Flennerhag, Sebastian
    Rao, Dushyant
    Pascanu, Razvan
    Hadsell, Raia
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199, 2022, 199
  • [28] Accelerating Deep Continuous Reinforcement Learning through Task Simplification
    Kerzel, Matthias
    Mohammadi, Hadi Beik
    Zamani, Mohammad Ali
    Wermter, Stefan
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018, : 139 - 144
  • [29] Task Assignment of UAV Swarms Based on Deep Reinforcement Learning
    Liu, Bo
    Wang, Shulei
    Li, Qinghua
    Zhao, Xinyang
    Pan, Yunqing
    Wang, Changhong
    DRONES, 2023, 7 (05)
  • [30] Federated Deep Reinforcement Learning for Task Participation in Mobile Crowdsensing
    Dongare, Sumedh
    Ortiz, Andrea
    Klein, Anja
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 4436 - 4441