Multi-strategy self-learning particle swarm optimization algorithm based on reinforcement learning

被引:6
|
作者
Meng, Xiaoding [1 ]
Li, Hecheng [2 ,3 ]
Chen, Anshan [2 ]
机构
[1] Qinghai Normal Univ, Sch Comp Sci & Technol, Xining 810008, Peoples R China
[2] Qinghai Normal Univ, Sch Math & Stat, Xining 810008, Peoples R China
[3] Acad Plateau Sci & Sustainabil, Xining 810008, Peoples R China
基金
中国国家自然科学基金;
关键词
particle swarm optimization; reinforcement learning; multi; -strategy; Q; -learning; GLOBAL OPTIMIZATION; EVOLUTIONARY; MUTATION;
D O I
10.3934/mbe.2023373
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The trade-off between exploitation and exploration is a dilemma inherent to particle swarm optimization (PSO) algorithms. Therefore, a growing body of PSO variants is devoted to solving the balance between the two. Among them, the method of self-adaptive multi-strategy selection plays a crucial role in improving the performance of PSO algorithms but has yet to be well exploited. In this research, with the aid of the reinforcement learning technique to guide the generation of offspring, a novel self-adaptive multi-strategy selection mechanism is designed, and then a multi-strategy selflearning PSO algorithm based on reinforcement learning (MPSORL) is proposed. First, the fitness value of particles is regarded as a set of states that are divided into several state subsets non-uniformly. Second, the epsilon-greedy strategy is employed to select the optimal strategy for each particle. The personal best particle and the global best particle are then updated after executing the strategy. Subsequently, the next state is determined. Thus, the value of the Q-table, as a scheme adopted in self-learning, is reshaped by the reward value, the action and the state in a non-stationary environment. Finally, the proposed algorithm is compared with other state-of-the-art algorithms on two well-known benchmark suites and a real-world problem. Extensive experiments indicate that MPSORL has better performance in terms of accuracy, convergence speed and non-parametric tests in most cases. The multi-strategy selection mechanism presented in the manuscript is effective.
引用
收藏
页码:8498 / 8530
页数:33
相关论文
共 50 条
  • [41] A self-learning discrete salp swarm algorithm based on deep reinforcement learning for dynamic job shop scheduling problem
    Gu, Yiming
    Chen, Ming
    Wang, Liang
    APPLIED INTELLIGENCE, 2023, 53 (15) : 18925 - 18958
  • [42] An Adaptive Online Parameter Control Algorithm for Particle Swarm Optimization Based on Reinforcement Learning
    Liu, Yaxian
    Lu, Hui
    Cheng, Shi
    Shi, Yuhui
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 815 - 822
  • [43] Swarm Reinforcement Learning Algorithm Based on Particle Swarm Optimization Whose Personal Bests Have Lifespans
    Iima, Hitoshi
    Kuroe, Yasuaki
    NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2009, 5864 : 169 - 178
  • [44] Multi-strategy parallel genetic algorithm based on machine learning
    Zhang Y.
    Zhong H.
    Zhang C.
    Li X.
    Cong J.
    Li, Xinyu (lixinyu@mail.hust.edu.cn), 1600, CIMS (27): : 2921 - 2928
  • [45] Dynamic Multi-swarm Particle Swarm Optimization with Center Learning Strategy
    Zhu, Zijian
    Zhong, Tian
    Wu, Chenhan
    Xue, Bowen
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2022, PT I, 2022, : 141 - 147
  • [46] A novel multi-swarm particle swarm optimization with dynamic learning strategy
    Ye, Wenxing
    Feng, Weiying
    Fan, Suohai
    APPLIED SOFT COMPUTING, 2017, 61 : 832 - 843
  • [47] A novel parallel multi-swarm algorithm based on comprehensive learning particle swarm optimization
    Gulcu, Saban
    Kodaz, Halife
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 45 : 33 - 45
  • [48] A multi-strategy enhanced salp swarm algorithm for global optimization
    Zhang, Hongliang
    Cai, Zhennao
    Ye, Xiaojia
    Wang, Mingjing
    Kuang, Fangjun
    Chen, Huiling
    Li, Chengye
    Li, Yuping
    ENGINEERING WITH COMPUTERS, 2022, 38 (02) : 1177 - 1203
  • [49] A multi-strategy enhanced salp swarm algorithm for global optimization
    Hongliang Zhang
    Zhennao Cai
    Xiaojia Ye
    Mingjing Wang
    Fangjun Kuang
    Huiling Chen
    Chengye Li
    Yuping Li
    Engineering with Computers, 2022, 38 : 1177 - 1203
  • [50] A reinforcement learning level-based particle swarm optimization algorithm for large-scale optimization
    Wang, Feng
    Wang, Xujie
    Sun, Shilei
    INFORMATION SCIENCES, 2022, 602 : 298 - 312