Symbolic Task Inference in Deep Reinforcement Learning

被引：0

作者：

Hasanbeig, Hosein ^{[1
]}

Jeppu, Natasha Yogananda ^{[2
]}

Abate, Alessandro ^{[2
]}

Melham, Tom ^{[2
]}

Kroening, Daniel ^{[3
]}

机构：

[1] Microsoft Research, United States

[2] Department of Computer Science, University of Oxford, United Kingdom

[3] Amazon, United States

来源：

Journal of Artificial Intelligence Research | 2024年 / 80卷

关键词：

Deep learning - Intelligent agents;

D O I：

10.1613/jair.1.14063

中图分类号：

学科分类号：

摘要：

This paper proposes DeepSynth, a method for effective training of deep reinforcement learning agents when the reward is sparse or non-Markovian, but at the same time progress towards the reward requires achieving an unknown sequence of high-level objectives. Our method employs a novel algorithm for synthesis of compact finite state automata to uncover this sequential structure automatically. We synthesise a human-interpretable automaton from trace data collected by exploring the environment. The state space of the environment is then enriched with the synthesised automaton, so that the generation of a control policy by deep reinforcement learning is guided by the discovered structure encoded in the automaton. The proposed approach is able to cope with both high-dimensional, low-level features and unknown sparse or non-Markovian rewards. We have evaluated DeepSynth’s performance in a set of experiments that includes the Atari game Montezuma’s Revenge, known to be challenging. Compared to approaches that rely solely on deep reinforcement learning, we obtain a reduction of two orders of magnitude in the iterations required for policy synthesis, and a significant improvement in scalability. ©2024 The Authors.

引用

页码：1099 / 1137

共 50 条

[21] Optimization of Deep Reinforcement Learning with Hybrid Multi-Task Learning
Varghese, Nelson Vithayathil
Mahmoud, Qusay H.
2021 15TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE (SYSCON 2021), 2021,
[22] Comparison of multiple reinforcement learning and deep reinforcement learning methods for the task aimed at achieving the goal
Parak R.
Matousek R.
Mendel, 2021, 27 (01) : 1 - 8
[23] Deep Explainable Relational Reinforcement Learning: A Neuro-Symbolic Approach
Hazra, Rishi
De Raedt, Luc
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 213 - 229
[24] SAGE: Generating Symbolic Goals for Myopic Models in Deep Reinforcement Learning
Chester, Andrew
Dann, Michael
Zambetta, Fabio
Thangarajah, John
ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2023, PT II, 2024, 14472 : 274 - 285
[25] Multi-task Deep Reinforcement Learning for Scalable Parallel Task Scheduling
Zhang, Lingxin
Qi, Qi
Wang, Jingyu
Sun, Haifeng
Liao, Jianxin
2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2992 - 3001
[26] Deep Reinforcement Learning Task Assignment Based on Domain Knowledge
Liu, Jiayi
Wang, Gang
Guo, Xiangke
Wang, Siyuan
Fu, Qiang
IEEE Access, 2022, 10 : 114402 - 114413
[27] PROBING TRANSFER IN DEEP REINFORCEMENT LEARNING WITHOUT TASK ENGINEERING
Rusu, Andrei A.
Flennerhag, Sebastian
Rao, Dushyant
Pascanu, Razvan
Hadsell, Raia
CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199, 2022, 199
[28] Accelerating Deep Continuous Reinforcement Learning through Task Simplification
Kerzel, Matthias
Mohammadi, Hadi Beik
Zamani, Mohammad Ali
Wermter, Stefan
2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018, : 139 - 144
[29] Task Assignment of UAV Swarms Based on Deep Reinforcement Learning
Liu, Bo
Wang, Shulei
Li, Qinghua
Zhao, Xinyang
Pan, Yunqing
Wang, Changhong
DRONES, 2023, 7 (05)
[30] Federated Deep Reinforcement Learning for Task Participation in Mobile Crowdsensing
Dongare, Sumedh
Ortiz, Andrea
Klein, Anja
IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 4436 - 4441

← 1 2 3 4 5 →