Anchor: The achieved goal to replace the subgoal for hierarchical reinforcement learning

被引:9
|
作者
Li, Ruijia [1 ]
Cai, Zhiling [1 ]
Huang, Tianyi [1 ]
Zhu, William [1 ]
机构
[1] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Hierarchical reinforcement learning; Reinforcement learning; Continuous control; Intrinsic motivation;
D O I
10.1016/j.knosys.2021.107128
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical reinforcement learning (HRL) extends traditional reinforcement learning methods to complex tasks, such as the continuous control task with long horizon. As an effective paradigm for HRL, the subgoal-based HRL method uses subgoals to provide intrinsic motivation which helps the agent to reach the desired goal. However, it is tough to determine the subgoal. In this paper, we present a new concept called anchor to replace the subgoal. Our anchor is selected from the achieved goals of the agent. By the anchor, we propose a new HRL method which encourages the agent to move fast away from the corresponding anchor in the right direction of reaching the desired goal. Specifically, for moving fast, our new method uses an intrinsic reward computed by the distance between the current achieved goal and the corresponding anchor. Meanwhile, for moving in the right direction, it weights the intrinsic reward by the extrinsic rewards collected in the process of moving away from the corresponding anchor. The experiments demonstrate the effectiveness of the proposed method on the continuous control task with long horizon. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Hierarchical Reinforcement Learning Based on Continuous Subgoal Space
    Wang, Chen
    Zeng, Fanyu
    Ge, Shuzhi Sam
    Jiang, Xin
    2020 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (IEEE-RCAR 2020), 2020, : 74 - 80
  • [2] Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning
    Kim, Junsu
    Seo, Younggyo
    Shin, Jinwoo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [3] End-to-End Hierarchical Reinforcement Learning With Integrated Subgoal Discovery
    Pateria, Shubham
    Subagdja, Budhitama
    Tan, Ah-Hwee
    Quek, Chai
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (12) : 7778 - 7790
  • [4] Connect-based subgoal discovery for options in hierarchical reinforcement learning
    Chen, Fei
    Gao, Yang
    Chen, Shifu
    Ma, Zhenduo
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2007, : 698 - +
  • [5] Highly valued subgoal generation for efficient goal-conditioned reinforcement learning
    Li, Yao
    Wang, YuHui
    Tan, XiaoYang
    NEURAL NETWORKS, 2025, 181
  • [6] Induction of Subgoal Automata for Reinforcement Learning
    Furelos-Blanco, Daniel
    Law, Mark
    Jonsson, Anders
    Broda, Krysia
    Russo, Alessandra
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 3890 - 3897
  • [7] Induction and Exploitation of Subgoal Automata for Reinforcement Learning
    Furelos-Blanco, Daniel
    Law, Mark
    Jonsson, Anders
    Broda, Krysia
    Russo, Alessandra
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2021, 70 : 1031 - 1116
  • [8] Robot Subgoal-guided Navigation in Dynamic Crowded Environments with Hierarchical Deep Reinforcement Learning
    Zhang, Tianle
    Liu, Zhen
    Pu, Zhiqiang
    Yi, Jianqiang
    Liang, Yanyan
    Zhang, Du
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2023, 21 (07) : 2350 - 2362
  • [9] Hierarchical extreme learning machine based reinforcement learning for goal localization
    AlDahoul, Nouar
    Htike, Zaw Zaw
    Akmeliawati, Rini
    3RD INTERNATIONAL CONFERENCE ON MECHANICAL, AUTOMOTIVE AND AEROSPACE ENGINEERING 2016, 2017, 184
  • [10] Subgoal Discovery for Hierarchical Dialogue Policy Learning
    Tang, Da
    Li, Xiujun
    Gao, Jianfeng
    Wang, Chong
    Li, Lihong
    Jebara, Tony
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2298 - 2309