Reinforcement learning acceleration through autonomous subgoal discovery

被引:0
|
作者
Asadi, M [1 ]
Huber, M [1 ]
机构
[1] Univ Texas, Dept Comp Sci & Engn, Arlington, TX 76019 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents two methods by which a reinforcement learning agent can automatically discover certain types of subgoals online and construct hierarchical state and action spaces. By creating useful subgoals while learning, the agent is able to accelerate learning on the current task and to transfer its expertise to other, related tasks through the reuse of its ability to attain subgoals. The presented mechanism then constructs macros action to the discovered subgoals and partitions the state space to accelerate learning time while insuring the achievablility of tasks. Simulations of different state spaces show that the policies in both original MDP and this representation achieve similar results, however the total learning time in the partition space is much smaller than the total amount of time spent on learning in the original state space.
引用
收藏
页码:69 / 74
页数:6
相关论文
共 50 条
  • [31] Autonomous network cyber offence strategy through deep reinforcement learning
    Sultana, Madeena
    Taylor, Adrian
    Li, Li
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS III, 2021, 11746
  • [32] Towards Autonomous Data Ferry Route Design through Reinforcement Learning
    Henkel, Daniel
    Brown, Timothy X.
    2008 IEEE INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS, VOLS 1 AND 2, 2008, : 585 - 590
  • [33] Analysis of Autonomous Penetration Testing Through Reinforcement Learning and Recommender Systems
    Moreno, Ariadna Claudia
    Hernandez-Suarez, Aldo
    Sanchez-Perez, Gabriel
    Toscano-Medina, Linda Karina
    Perez-Meana, Hector
    Portillo-Portillo, Jose
    Olivares-Mercado, Jesus
    Garcia Villalba, Luis Javier
    SENSORS, 2025, 25 (01)
  • [34] Inverse Reinforcement Learning via Nonparametric Spatio-Temporal Subgoal Modeling
    Sosic, Adrian
    Zoubir, Abdelhak M.
    Rueckert, Elmar
    Peters, Jan
    Koeppl, Heinz
    JOURNAL OF MACHINE LEARNING RESEARCH, 2018, 19
  • [35] Fine-grained acceleration control for autonomous intersection management using deep reinforcement learning
    Mirzaei, Hamid
    Givargis, Tony
    2017 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTED, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2017,
  • [36] Convergence of reinforcement learning algorithms and acceleration of learning
    Potapov, A
    Ali, MK
    PHYSICAL REVIEW E, 2003, 67 (02):
  • [37] CAUSAL DISCOVERY WITH REINFORCEMENT LEARNING
    Huawei Noah's Ark Lab
    不详
    Int. Conf. Learn. Represent., ICLR,
  • [38] Faster MIL-based Subgoal Identification for Reinforcement Learning by Tuning Fewer Hyperparameters
    Sunel, Saim
    Cilden, Erkin
    Polat, Faruk
    ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2024, 19 (02)
  • [39] Autonomous Maneuver Decision-Making Through Curriculum Learning and Reinforcement Learning With Sparse Rewards
    Wei, Yujie
    Zhang, Hongpeng
    Wang, Yuan
    Huang, Changqiang
    IEEE ACCESS, 2023, 11 : 73543 - 73555
  • [40] Safe Reinforcement Learning for Autonomous Vehicles through Parallel Constrained Policy Optimization
    Wen, Lu
    Duan, Jingliang
    Li, Shengbo Eben
    Xu, Shaobing
    Peng, Huei
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,