Reinforcement learning acceleration through autonomous subgoal discovery

被引:0
|
作者
Asadi, M [1 ]
Huber, M [1 ]
机构
[1] Univ Texas, Dept Comp Sci & Engn, Arlington, TX 76019 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents two methods by which a reinforcement learning agent can automatically discover certain types of subgoals online and construct hierarchical state and action spaces. By creating useful subgoals while learning, the agent is able to accelerate learning on the current task and to transfer its expertise to other, related tasks through the reuse of its ability to attain subgoals. The presented mechanism then constructs macros action to the discovered subgoals and partitions the state space to accelerate learning time while insuring the achievablility of tasks. Simulations of different state spaces show that the policies in both original MDP and this representation achieve similar results, however the total learning time in the partition space is much smaller than the total amount of time spent on learning in the original state space.
引用
收藏
页码:69 / 74
页数:6
相关论文
共 50 条
  • [1] Autonomous Reinforcement Learning via Subgoal Curricula
    Sharma, Archit
    Gupta, Abhishek
    Levine, Sergey
    Hausman, Karol
    Finn, Chelsea
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [2] Interpretable Reinforcement Learning with Multilevel Subgoal Discovery
    Demin, Alexander
    Ponomaryov, Denis
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 251 - 258
  • [3] Reinforcement learning transfer based on subgoal discovery and subtask similarity
    Wang, Hao
    Fan, Shunguo
    Song, Jinhua
    Gao, Yang
    Chen, Xingguo
    IEEE/CAA Journal of Automatica Sinica, 2014, 1 (03) : 257 - 266
  • [4] Reinforcement Learning Transfer Based on Subgoal Discovery and Subtask Similarity
    Hao Wang
    Shunguo Fan
    Jinhua Song
    Yang Gao
    Xingguo Chen
    IEEE/CAAJournalofAutomaticaSinica, 2014, 1 (03) : 257 - 266
  • [5] Subgoal Discovery in Reinforcement Learning Using Local Graph Clustering
    Entezari, Negin
    Shiri, Mohammad Ebrahim
    Moradi, Parham
    INTERNATIONAL JOURNAL OF FUTURE GENERATION COMMUNICATION AND NETWORKING, 2011, 4 (03): : 13 - 23
  • [6] End-to-End Hierarchical Reinforcement Learning With Integrated Subgoal Discovery
    Pateria, Shubham
    Subagdja, Budhitama
    Tan, Ah-Hwee
    Quek, Chai
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (12) : 7778 - 7790
  • [7] Connect-based subgoal discovery for options in hierarchical reinforcement learning
    Chen, Fei
    Gao, Yang
    Chen, Shifu
    Ma, Zhenduo
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2007, : 698 - +
  • [8] Induction of Subgoal Automata for Reinforcement Learning
    Furelos-Blanco, Daniel
    Law, Mark
    Jonsson, Anders
    Broda, Krysia
    Russo, Alessandra
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 3890 - 3897
  • [9] Induction and Exploitation of Subgoal Automata for Reinforcement Learning
    Furelos-Blanco, Daniel
    Law, Mark
    Jonsson, Anders
    Broda, Krysia
    Russo, Alessandra
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2021, 70 : 1031 - 1116
  • [10] Induction and exploitation of subgoal automata for reinforcement learning
    Furelos-Blanco D.
    Law M.
    Jonsson A.
    Broda K.
    Russo A.
    Journal of Artificial Intelligence Research, 2021, 70 : 1031 - 1116