Adaptive evolution strategy with ensemble of mutations for Reinforcement Learning

被引:17
|
作者
Ajani, Oladayo S. [1 ]
Mallipeddi, Rammohan [1 ]
机构
[1] Kyungpook Natl Univ, Dept Artificial Intelligence, Daegu, South Korea
基金
新加坡国家研究基金会;
关键词
Evolution strategy; Reinforcement Learning; Ensemble; Mutation strategy; Black-box optimization; INITIALIZATION; ADAPTATION;
D O I
10.1016/j.knosys.2022.108624
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Evolving the weights of learning networks through evolutionary computation (neuroevolution) has proven scalable over a range of challenging Reinforcement Learning (RL) control tasks. However, similar to most black-box optimization problems, existing neuroevolution approaches require an additional adaptation process to effectively balance exploration and exploitation through the selection of sensitive hyper-parameters throughout the evolution process. Therefore, these methods are often plagued by the computation complexities of such adaptation processes which often rely on a number of sophisticatedly formulated strategy parameters. In this paper, Evolution Strategy (ES) with a simple yet efficient ensemble of mutation strategies is proposed. Specifically, two distinct mutation strategies coexist throughout the evolution process where each strategy is associated with its own population subset. Consequently, elites for generating a population of offspring are realized by co-evaluation of the combined population. Experiments on testbed of six (6) black-box optimization problems which are generated using a classical control problem and six (6) proven continuous RL agents demonstrate the efficiency of the proposed method in terms of faster convergence and scalability than the canonical ES. Furthermore, the proposed Adaptive Ensemble ES (AEES) shows an average of 5 - 10000x and 10 100x better sample complexity in low and high dimension problems, respectively than their associated base DRL agents.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Adaptive evolution strategy with ensemble of mutations for Reinforcement Learning
    Ajani, Oladayo S.
    Mallipeddi, Rammohan
    Knowledge-Based Systems, 2022, 245
  • [2] Robust Adaptive Ensemble Adversary Reinforcement Learning
    Zhai, Peng
    Hou, Taixian
    Ji, Xiaopeng
    Dong, Zhiyan
    Zhang, Lihua
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04): : 12562 - 12568
  • [3] Memetic Evolution Strategy for Reinforcement Learning
    Qu, Xinghua
    Ong, Yew-Soon
    Hou, Yaqing
    Shen, Xiaobo
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 1922 - 1928
  • [4] Ensemble Strategy Based on Deep Reinforcement Learning for Portfolio Optimization
    Su, Xiao
    Zhou, Yalan
    He, Shanshan
    Li, Xiangxia
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT IV, KSEM 2023, 2023, 14120 : 242 - 249
  • [5] Reinforcement learning assisted differential evolution with adaptive resource allocation strategy for multimodal optimization problems
    Ma, Tao
    Zhao, Hong
    Li, Xiangqian
    Yang, Fang
    Liu, Chun-sheng
    Liu, Jing
    SWARM AND EVOLUTIONARY COMPUTATION, 2025, 94
  • [6] A novel carbon price forecasting method based on model matching, adaptive decomposition, and reinforcement learning ensemble strategy
    Cao, Zijie
    Liu, Hui
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2023, 30 (13) : 36044 - 36067
  • [7] A novel carbon price forecasting method based on model matching, adaptive decomposition, and reinforcement learning ensemble strategy
    Zijie Cao
    Hui Liu
    Environmental Science and Pollution Research, 2023, 30 : 36044 - 36067
  • [8] A Novel Adaptive Sampling Strategy for Deep Reinforcement Learning
    Liang, Xingxing
    Chen, Li
    Feng, Yanghe
    Liu, Zhong
    Ma, Yang
    Huang, Kuihua
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2021, 20 (02)
  • [9] QFlip: An Adaptive Reinforcement Learning Strategy for the FlipIt Security Game
    Oakley, Lisa
    Oprea, Alina
    DECISION AND GAME THEORY FOR SECURITY, 2019, 11836 : 364 - 384
  • [10] Adaptive Water Environment Optimization Strategy Based on Reinforcement Learning
    Dang T.
    Liu J.
    Computer-Aided Design and Applications, 2024, 21 (S23): : 1 - 18