Reinforcement learning acceleration through autonomous subgoal discovery

被引：0

作者：

Asadi, M ^{[1
]}

Huber, M ^{[1
]}

机构：

[1] Univ Texas, Dept Comp Sci & Engn, Arlington, TX 76019 USA

来源：

MLMTA '05: Proceedings of the International Conference on Machine Learning Models Technologies and Applications | 2005年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents two methods by which a reinforcement learning agent can automatically discover certain types of subgoals online and construct hierarchical state and action spaces. By creating useful subgoals while learning, the agent is able to accelerate learning on the current task and to transfer its expertise to other, related tasks through the reuse of its ability to attain subgoals. The presented mechanism then constructs macros action to the discovered subgoals and partitions the state space to accelerate learning time while insuring the achievablility of tasks. Simulations of different state spaces show that the policies in both original MDP and this representation achieve similar results, however the total learning time in the partition space is much smaller than the total amount of time spent on learning in the original state space.

引用

页码：69 / 74

页数：6

共 50 条

[1] Autonomous Reinforcement Learning via Subgoal Curricula
Sharma, Archit
Gupta, Abhishek
Levine, Sergey
Hausman, Karol
Finn, Chelsea
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[2] Interpretable Reinforcement Learning with Multilevel Subgoal Discovery
Demin, Alexander
Ponomaryov, Denis
2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 251 - 258
[3] Reinforcement learning transfer based on subgoal discovery and subtask similarity
Wang, Hao
Fan, Shunguo
Song, Jinhua
Gao, Yang
Chen, Xingguo
IEEE/CAA Journal of Automatica Sinica, 2014, 1 (03) : 257 - 266
[4] Reinforcement Learning Transfer Based on Subgoal Discovery and Subtask Similarity
Hao Wang
Shunguo Fan
Jinhua Song
Yang Gao
Xingguo Chen
IEEE/CAAJournalofAutomaticaSinica, 2014, 1 (03) : 257 - 266
[5] Subgoal Discovery in Reinforcement Learning Using Local Graph Clustering
Entezari, Negin
Shiri, Mohammad Ebrahim
Moradi, Parham
INTERNATIONAL JOURNAL OF FUTURE GENERATION COMMUNICATION AND NETWORKING, 2011, 4 (03): : 13 - 23
[6] End-to-End Hierarchical Reinforcement Learning With Integrated Subgoal Discovery
Pateria, Shubham
Subagdja, Budhitama
Tan, Ah-Hwee
Quek, Chai
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (12) : 7778 - 7790
[7] Connect-based subgoal discovery for options in hierarchical reinforcement learning
Chen, Fei
Gao, Yang
Chen, Shifu
Ma, Zhenduo
ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2007, : 698 - +
[8] Induction of Subgoal Automata for Reinforcement Learning
Furelos-Blanco, Daniel
Law, Mark
Jonsson, Anders
Broda, Krysia
Russo, Alessandra
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 3890 - 3897
[9] Induction and Exploitation of Subgoal Automata for Reinforcement Learning
Furelos-Blanco, Daniel
Law, Mark
Jonsson, Anders
Broda, Krysia
Russo, Alessandra
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2021, 70 : 1031 - 1116
[10] Induction and exploitation of subgoal automata for reinforcement learning
Furelos-Blanco D.
Law M.
Jonsson A.
Broda K.
Russo A.
Journal of Artificial Intelligence Research, 2021, 70 : 1031 - 1116

← 1 2 3 4 5 →