Symbolic Task Inference in Deep Reinforcement Learning

被引:0
|
作者
Hasanbeig, Hosein [1 ]
Jeppu, Natasha Yogananda [2 ]
Abate, Alessandro [2 ]
Melham, Tom [2 ]
Kroening, Daniel [3 ]
机构
[1] Microsoft Research, United States
[2] Department of Computer Science, University of Oxford, United Kingdom
[3] Amazon, United States
关键词
Deep learning - Intelligent agents;
D O I
10.1613/jair.1.14063
中图分类号
学科分类号
摘要
This paper proposes DeepSynth, a method for effective training of deep reinforcement learning agents when the reward is sparse or non-Markovian, but at the same time progress towards the reward requires achieving an unknown sequence of high-level objectives. Our method employs a novel algorithm for synthesis of compact finite state automata to uncover this sequential structure automatically. We synthesise a human-interpretable automaton from trace data collected by exploring the environment. The state space of the environment is then enriched with the synthesised automaton, so that the generation of a control policy by deep reinforcement learning is guided by the discovered structure encoded in the automaton. The proposed approach is able to cope with both high-dimensional, low-level features and unknown sparse or non-Markovian rewards. We have evaluated DeepSynth’s performance in a set of experiments that includes the Atari game Montezuma’s Revenge, known to be challenging. Compared to approaches that rely solely on deep reinforcement learning, we obtain a reduction of two orders of magnitude in the iterations required for policy synthesis, and a significant improvement in scalability. ©2024 The Authors.
引用
收藏
页码:1099 / 1137
相关论文
共 50 条
  • [31] Task-Agnostic Dynamics Priors for Deep Reinforcement Learning
    Du, Yilun
    Narasimhan, Karthik
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [32] Deep Reinforcement Learning Task Assignment Based on Domain Knowledge
    Liu, Jiayi
    Wang, Gang
    Guo, Xiangke
    Wang, Siyuan
    Fu, Qiang
    IEEE ACCESS, 2022, 10 : 114402 - 114413
  • [33] Research on Dependent Task Offloading Based on Deep Reinforcement Learning
    Zhu, Qianwen
    Guo, Juan
    2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024, 2024, : 705 - 709
  • [34] Task Migration Based on Deep Reinforcement Learning in Mobile Crowdsourcing
    Gao, Yongqiang
    Wang, Zhigang
    Li, Zemin
    Li, Zhenkun
    2022 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING, ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM, 2022, : 410 - 417
  • [35] Task scheduling for control system based on deep reinforcement learning
    Liu, Yuhao
    Ni, Yuqing
    Dong, Chang
    Chen, Jun
    Liu, Fei
    NEUROCOMPUTING, 2024, 610
  • [36] Simultaneous task and energy planning using deep reinforcement learning
    Wang, Di
    Hu, Mengqi
    Weir, Jeffery D.
    INFORMATION SCIENCES, 2022, 607 : 931 - 946
  • [37] Hierarchical Task and Motion Planning through Deep Reinforcement Learning
    Newaz, Abdullah Al Redwan
    Alam, Tauhidul
    2021 FIFTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2021), 2021, : 100 - 105
  • [38] Deep Reinforcement Learning for Task Assignment in Spatial Crowdsourcing and Sensing
    Sun, Lijun
    Yu, Xiaojie
    Guo, Jiachen
    Yan, Yang
    Yu, Xu
    IEEE SENSORS JOURNAL, 2021, 21 (22) : 25323 - 25330
  • [39] Task Offloading Based-on Deep Reinforcement Learning for Microgrid
    Wang, Ye
    Jin, Xianzhi
    Xu, Ren
    Shao, Wenyi
    Lin, Fei
    2022 IEEE 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2022), 2022, : 281 - 285
  • [40] Multi-Agent Deep Reinforcement Learning-Based Inference Task Scheduling and Offloading for Maximum Inference Accuracy under Time and Energy Constraints
    Ben Sada, Abdelkarim
    Khelloufi, Amar
    Naouri, Abdenacer
    Ning, Huansheng
    Aung, Nyothiri
    Dhelim, Sahraoui
    ELECTRONICS, 2024, 13 (13)