Reinforcement learning acceleration through autonomous subgoal discovery

被引:0
|
作者
Asadi, M [1 ]
Huber, M [1 ]
机构
[1] Univ Texas, Dept Comp Sci & Engn, Arlington, TX 76019 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents two methods by which a reinforcement learning agent can automatically discover certain types of subgoals online and construct hierarchical state and action spaces. By creating useful subgoals while learning, the agent is able to accelerate learning on the current task and to transfer its expertise to other, related tasks through the reuse of its ability to attain subgoals. The presented mechanism then constructs macros action to the discovered subgoals and partitions the state space to accelerate learning time while insuring the achievablility of tasks. Simulations of different state spaces show that the policies in both original MDP and this representation achieve similar results, however the total learning time in the partition space is much smaller than the total amount of time spent on learning in the original state space.
引用
收藏
页码:69 / 74
页数:6
相关论文
共 50 条
  • [41] Improving Autonomous Separation Assurance through Distributed Reinforcement Learning with Attention Networks
    Brittain, Marc W.
    Alvarez, Luis E.
    Breeden, Kara
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 22857 - 22863
  • [42] Autonomous Navigation of the UAV through Deep Reinforcement Learning with Sensor Perception Enhancement
    Zhao S.
    Wang W.
    Li J.
    Huang S.
    Liu S.
    Lolli F.
    Mathematical Problems in Engineering, 2023, 2023
  • [43] Monocular Vision based Autonomous Landing of Quadrotor through Deep Reinforcement Learning
    Xu, Yinbo
    Liu, Zhihong
    Wang, Xiangke
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 10014 - 10019
  • [44] Autonomous Vehicle Decision and Control through Reinforcement Learning with Traffic Flow Randomization
    Lin, Yuan
    Xie, Antai
    Liu, Xiao
    MACHINES, 2024, 12 (04)
  • [45] Prediction based segmentation of state space and application to a subgoal finding problem in reinforcement learning
    Nagata, Y
    Ohigashi, Y
    Takahashi, H
    Ishikawa, S
    Omori, T
    Morikawa, K
    SICE 2004 ANNUAL CONFERENCE, VOLS 1-3, 2004, : 2560 - 2565
  • [46] Autonomous robotic nanofabrication with reinforcement learning
    Leinen, Philipp
    Esders, Malte
    Schuett, Kristof T.
    Wagner, Christian
    Mueller, Klaus-Robert
    Tautz, F. Stefan
    SCIENCE ADVANCES, 2020, 6 (36):
  • [47] Reinforcement Learning for Autonomous Aircraft Avoidance
    Keong, Choo Wai
    Shin, Hyo-Sang
    Tsourdos, Antonios
    2019 INTERNATIONAL WORKSHOP ON RESEARCH, EDUCATION AND DEVELOPMENT OF UNMANNED AERIAL SYSTEMS (RED UAS 2019), 2019, : 126 - 131
  • [48] Autonomous Reinforcement Learning with Hierarchical REPS
    Daniel, Christian
    Neumann, Gerhard
    Peters, Jan
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [49] Routing an Autonomous Taxi with Reinforcement Learning
    Han, Miyoung
    Senellart, Pierre
    Bressan, Stephane
    Wu, Huayu
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 2421 - 2424
  • [50] Autonomous drifting using reinforcement learning
    Orgován L.
    Bécsi T.
    Aradi S.
    Periodica Polytechnica Transportation Engineering, 2021, 49 (03): : 292 - 300