Reinforcement learning acceleration through autonomous subgoal discovery

被引:0
|
作者
Asadi, M [1 ]
Huber, M [1 ]
机构
[1] Univ Texas, Dept Comp Sci & Engn, Arlington, TX 76019 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents two methods by which a reinforcement learning agent can automatically discover certain types of subgoals online and construct hierarchical state and action spaces. By creating useful subgoals while learning, the agent is able to accelerate learning on the current task and to transfer its expertise to other, related tasks through the reuse of its ability to attain subgoals. The presented mechanism then constructs macros action to the discovered subgoals and partitions the state space to accelerate learning time while insuring the achievablility of tasks. Simulations of different state spaces show that the policies in both original MDP and this representation achieve similar results, however the total learning time in the partition space is much smaller than the total amount of time spent on learning in the original state space.
引用
收藏
页码:69 / 74
页数:6
相关论文
共 50 条
  • [21] Learning and Adapting Behavior of Autonomous Vehicles through Inverse Reinforcement Learning
    Trauth, Rainer
    Kaufeld, Marc
    Geisslinger, Maximilian
    Betz, Johannes
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [22] Improving the Performance of Autonomous Driving through Deep Reinforcement Learning
    Tammewar, Akshaj
    Chaudhari, Nikita
    Saini, Bunny
    Venkatesh, Divya
    Dharahas, Ganpathiraju
    Vora, Deepali
    Patil, Shruti
    Kotecha, Ketan
    Alfarhood, Sultan
    SUSTAINABILITY, 2023, 15 (18)
  • [23] An autonomous ore packing system through deep reinforcement learning
    Ren, He
    Zhong, Rui
    ADVANCES IN SPACE RESEARCH, 2024, 74 (12) : 6366 - 6383
  • [24] Selecting Subgoal for Social AGV Path Planning by Using Reinforcement Learning
    Wu, Cheng-En
    Tsai, Hsiao-Ping
    2022 23RD IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2022), 2022, : 452 - 457
  • [25] Human-Interactive Subgoal Supervision for Efficient Inverse Reinforcement Learning
    Pan, Xinlei
    Shen, Yilin
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 1380 - 1387
  • [26] Selecting Subgoal for Social AGV Path Planning by Using Reinforcement Learning
    Wu, Cheng-En
    Tsai, Hsiao-Ping
    Proceedings - IEEE International Conference on Mobile Data Management, 2022, 2022-June : 452 - 457
  • [27] Subgoal-Based Reward Shaping to Improve Efficiency in Reinforcement Learning
    Okudo, Takato
    Yamada, Seiji
    IEEE ACCESS, 2021, 9 : 97557 - 97568
  • [28] Discovery of Novel GABAAR Allosteric Modulators Through Reinforcement Learning
    Michaeli, Amit
    Lerner, Immanuel
    Zatsepin, Maria
    Mezan, Shaul
    Kilshtain, Alexandra Vardi
    CURRENT PHARMACEUTICAL DESIGN, 2020, 26 (44) : 5713 - 5719
  • [29] Highly valued subgoal generation for efficient goal-conditioned reinforcement learning
    Li, Yao
    Wang, YuHui
    Tan, XiaoYang
    NEURAL NETWORKS, 2025, 181
  • [30] Goal- Driven Autonomous Exploration Through Deep Reinforcement Learning
    Cimurs, Reinis
    Suh, Il Hong
    Lee, Jin Han
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 730 - 737