Recruitment-imitation mechanism for evolutionary reinforcement learning

被引:23
|
作者
Lu, Shuai [1 ,2 ]
Han, Shuai [1 ,2 ]
Zhou, Wenbo [1 ,2 ]
Zhang, Junwei [1 ,2 ]
机构
[1] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130012, Peoples R China
[2] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Evolutionary reinforcement learning; Reinforcement learning; Evolutionary algorithms; Imitation learning;
D O I
10.1016/j.ins.2020.12.017
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning, evolutionary algorithms and imitation learning are three principal methods to deal with continuous control tasks. Reinforcement learning is sample efficient, yet sensitive to hyperparameters settings and needs efficient exploration; Evolutionary algorithms are stable, but with low sample efficiency; Imitation learning is both sample efficient and stable, however it requires the guidance of expert data. In this paper, we propose Recruitment-imitation Mechanism (RIM) for evolutionary reinforcement learning, a scalable framework that combines advantages of the three methods mentioned above. The core of this framework is a dual-actors and single critic reinforcement learning agent. This agent can recruit high-fitness actors from the population performing evolutionary algorithms, which instructs itself to learn from experience replay buffer. At the same time, low-fitness actors in the evolutionary population can imitate behavior patterns of the reinforcement learning agent and promote their fitness level. Reinforcement and imitation learners in this framework can be replaced with any off-policy actor-critic reinforcement learner and data-driven imitation learner. We evaluate RIM on a series of benchmarks for continuous control tasks in Mujoco. The experimental results show that RIM outperforms prior evolutionary or reinforcement learning methods. The performance of RIM's components is significantly better than components of previous evolutionary reinforcement learning algorithm, and the recruitment using soft update enables reinforcement learning agent to learn faster than that using hard update. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:172 / 188
页数:17
相关论文
共 50 条
  • [31] A Penetration Strategy Combining Deep Reinforcement Learning and Imitation Learning
    Wang X.
    Gu K.
    Yuhang Xuebao/Journal of Astronautics, 2023, 44 (06): : 914 - 925
  • [32] Methodologies for Imitation Learning via Inverse Reinforcement Learning: A Review
    Zhang K.
    Yu Y.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (02): : 254 - 261
  • [33] Bridging the Gap Between Imitation Learning and Inverse Reinforcement Learning
    Piot, Bilal
    Geist, Matthieu
    Pietquin, Olivier
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (08) : 1814 - 1826
  • [34] Learning How to Play Bomberman with Deep Reinforcement and Imitation Learning
    Goulart, Icaro
    Paes, Aline
    Clua, Esteban
    ENTERTAINMENT COMPUTING AND SERIOUS GAMES, ICEC-JCSG 2019, 2019, 11863 : 121 - 133
  • [35] Robotic Manipulation with Reinforcement Learning, State Representation Learning, and Imitation Learning
    Chen, Hanxiao
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15769 - 15770
  • [36] Collaborative Evolutionary Reinforcement Learning
    Khadka, Shauharda
    Majumdar, Somdeb
    Nassar, Tarek
    Dwiel, Zach
    Tumer, Evren
    Miret, Santiago
    Liu, Yinyin
    Tumer, Kagan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [37] Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
    Rashidinejad, Paria
    Zhu, Banghua
    Ma, Cong
    Jiao, Jiantao
    Russell, Stuart
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2022, 68 (12) : 8156 - 8196
  • [38] Evolutionary algorithms for reinforcement learning
    Moriarty, DE
    Schultz, AC
    Grefenstette, JJ
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1999, 11 : 241 - 276
  • [39] Evolutionary Reinforcement Learning: A Survey
    Bai, Hui
    Cheng, Ran
    Jin, Yaochu
    Intelligent Computing, 2023, 2
  • [40] Cloud Resource Scheduling With Deep Reinforcement Learning and Imitation Learning
    Guo, Wenxia
    Tian, Wenhong
    Ye, Yufei
    Xu, Lingxiao
    Wu, Kui
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05): : 3576 - 3586