Symbolic Task Inference in Deep Reinforcement Learning

被引：0

作者：

Hasanbeig, Hosein ^{[1
]}

Jeppu, Natasha Yogananda ^{[2
]}

Abate, Alessandro ^{[2
]}

Melham, Tom ^{[2
]}

Kroening, Daniel ^{[3
]}

机构：

[1] Microsoft Research, United States

[2] Department of Computer Science, University of Oxford, United Kingdom

[3] Amazon, United States

来源：

Journal of Artificial Intelligence Research | 2024年 / 80卷

关键词：

Deep learning - Intelligent agents;

D O I：

10.1613/jair.1.14063

中图分类号：

学科分类号：

摘要：

This paper proposes DeepSynth, a method for effective training of deep reinforcement learning agents when the reward is sparse or non-Markovian, but at the same time progress towards the reward requires achieving an unknown sequence of high-level objectives. Our method employs a novel algorithm for synthesis of compact finite state automata to uncover this sequential structure automatically. We synthesise a human-interpretable automaton from trace data collected by exploring the environment. The state space of the environment is then enriched with the synthesised automaton, so that the generation of a control policy by deep reinforcement learning is guided by the discovered structure encoded in the automaton. The proposed approach is able to cope with both high-dimensional, low-level features and unknown sparse or non-Markovian rewards. We have evaluated DeepSynth’s performance in a set of experiments that includes the Atari game Montezuma’s Revenge, known to be challenging. Compared to approaches that rely solely on deep reinforcement learning, we obtain a reduction of two orders of magnitude in the iterations required for policy synthesis, and a significant improvement in scalability. ©2024 The Authors.

引用

页码：1099 / 1137

共 50 条

[31] Task-Agnostic Dynamics Priors for Deep Reinforcement Learning
Du, Yilun
Narasimhan, Karthik
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[32] Deep Reinforcement Learning Task Assignment Based on Domain Knowledge
Liu, Jiayi
Wang, Gang
Guo, Xiangke
Wang, Siyuan
Fu, Qiang
IEEE ACCESS, 2022, 10 : 114402 - 114413
[33] Research on Dependent Task Offloading Based on Deep Reinforcement Learning
Zhu, Qianwen
Guo, Juan
2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024, 2024, : 705 - 709
[34] Task Migration Based on Deep Reinforcement Learning in Mobile Crowdsourcing
Gao, Yongqiang
Wang, Zhigang
Li, Zemin
Li, Zhenkun
2022 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING, ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM, 2022, : 410 - 417
[35] Task scheduling for control system based on deep reinforcement learning
Liu, Yuhao
Ni, Yuqing
Dong, Chang
Chen, Jun
Liu, Fei
NEUROCOMPUTING, 2024, 610
[36] Simultaneous task and energy planning using deep reinforcement learning
Wang, Di
Hu, Mengqi
Weir, Jeffery D.
INFORMATION SCIENCES, 2022, 607 : 931 - 946
[37] Hierarchical Task and Motion Planning through Deep Reinforcement Learning
Newaz, Abdullah Al Redwan
Alam, Tauhidul
2021 FIFTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2021), 2021, : 100 - 105
[38] Deep Reinforcement Learning for Task Assignment in Spatial Crowdsourcing and Sensing
Sun, Lijun
Yu, Xiaojie
Guo, Jiachen
Yan, Yang
Yu, Xu
IEEE SENSORS JOURNAL, 2021, 21 (22) : 25323 - 25330
[39] Task Offloading Based-on Deep Reinforcement Learning for Microgrid
Wang, Ye
Jin, Xianzhi
Xu, Ren
Shao, Wenyi
Lin, Fei
2022 IEEE 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2022), 2022, : 281 - 285
[40] Multi-Agent Deep Reinforcement Learning-Based Inference Task Scheduling and Offloading for Maximum Inference Accuracy under Time and Energy Constraints
Ben Sada, Abdelkarim
Khelloufi, Amar
Naouri, Abdenacer
Ning, Huansheng
Aung, Nyothiri
Dhelim, Sahraoui
ELECTRONICS, 2024, 13 (13)

← 1 2 3 4 5 →