Multi-strategy self-learning particle swarm optimization algorithm based on reinforcement learning

被引：6

作者：

Meng, Xiaoding ^{[1
]}

Li, Hecheng ^{[2
,3
]}

Chen, Anshan ^{[2
]}

机构：

[1] Qinghai Normal Univ, Sch Comp Sci & Technol, Xining 810008, Peoples R China

[2] Qinghai Normal Univ, Sch Math & Stat, Xining 810008, Peoples R China

[3] Acad Plateau Sci & Sustainabil, Xining 810008, Peoples R China

来源：

MATHEMATICAL BIOSCIENCES AND ENGINEERING | 2023年 / 20卷 / 05期

基金：

中国国家自然科学基金;

关键词：

particle swarm optimization; reinforcement learning; multi; -strategy; Q; -learning; GLOBAL OPTIMIZATION; EVOLUTIONARY; MUTATION;

D O I：

10.3934/mbe.2023373

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

The trade-off between exploitation and exploration is a dilemma inherent to particle swarm optimization (PSO) algorithms. Therefore, a growing body of PSO variants is devoted to solving the balance between the two. Among them, the method of self-adaptive multi-strategy selection plays a crucial role in improving the performance of PSO algorithms but has yet to be well exploited. In this research, with the aid of the reinforcement learning technique to guide the generation of offspring, a novel self-adaptive multi-strategy selection mechanism is designed, and then a multi-strategy selflearning PSO algorithm based on reinforcement learning (MPSORL) is proposed. First, the fitness value of particles is regarded as a set of states that are divided into several state subsets non-uniformly. Second, the epsilon-greedy strategy is employed to select the optimal strategy for each particle. The personal best particle and the global best particle are then updated after executing the strategy. Subsequently, the next state is determined. Thus, the value of the Q-table, as a scheme adopted in self-learning, is reshaped by the reward value, the action and the state in a non-stationary environment. Finally, the proposed algorithm is compared with other state-of-the-art algorithms on two well-known benchmark suites and a real-world problem. Extensive experiments indicate that MPSORL has better performance in terms of accuracy, convergence speed and non-parametric tests in most cases. The multi-strategy selection mechanism presented in the manuscript is effective.

引用

页码：8498 / 8530

页数：33

共 50 条

[41] A self-learning discrete salp swarm algorithm based on deep reinforcement learning for dynamic job shop scheduling problem
Gu, Yiming
Chen, Ming
Wang, Liang
APPLIED INTELLIGENCE, 2023, 53 (15) : 18925 - 18958
[42] An Adaptive Online Parameter Control Algorithm for Particle Swarm Optimization Based on Reinforcement Learning
Liu, Yaxian
Lu, Hui
Cheng, Shi
Shi, Yuhui
2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 815 - 822
[43] Swarm Reinforcement Learning Algorithm Based on Particle Swarm Optimization Whose Personal Bests Have Lifespans
Iima, Hitoshi
Kuroe, Yasuaki
NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2009, 5864 : 169 - 178
[44] Multi-strategy parallel genetic algorithm based on machine learning
Zhang Y.
Zhong H.
Zhang C.
Li X.
Cong J.
Li, Xinyu (lixinyu@mail.hust.edu.cn), 1600, CIMS (27): : 2921 - 2928
[45] Dynamic Multi-swarm Particle Swarm Optimization with Center Learning Strategy
Zhu, Zijian
Zhong, Tian
Wu, Chenhan
Xue, Bowen
ADVANCES IN SWARM INTELLIGENCE, ICSI 2022, PT I, 2022, : 141 - 147
[46] A novel multi-swarm particle swarm optimization with dynamic learning strategy
Ye, Wenxing
Feng, Weiying
Fan, Suohai
APPLIED SOFT COMPUTING, 2017, 61 : 832 - 843
[47] A novel parallel multi-swarm algorithm based on comprehensive learning particle swarm optimization
Gulcu, Saban
Kodaz, Halife
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 45 : 33 - 45
[48] A multi-strategy enhanced salp swarm algorithm for global optimization
Zhang, Hongliang
Cai, Zhennao
Ye, Xiaojia
Wang, Mingjing
Kuang, Fangjun
Chen, Huiling
Li, Chengye
Li, Yuping
ENGINEERING WITH COMPUTERS, 2022, 38 (02) : 1177 - 1203
[49] A multi-strategy enhanced salp swarm algorithm for global optimization
Hongliang Zhang
Zhennao Cai
Xiaojia Ye
Mingjing Wang
Fangjun Kuang
Huiling Chen
Chengye Li
Yuping Li
Engineering with Computers, 2022, 38 : 1177 - 1203
[50] A reinforcement learning level-based particle swarm optimization algorithm for large-scale optimization
Wang, Feng
Wang, Xujie
Sun, Shilei
INFORMATION SCIENCES, 2022, 602 : 298 - 312

← 1 2 3 4 5 →