Learning Adaptive Differential Evolution Algorithm From Optimization Experiences by Policy Gradient

被引:71
|
作者
Sun, Jianyong [1 ]
Liu, Xin [1 ]
Back, Thomas [2 ]
Xu, Zongben [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Peoples R China
[2] Leiden Univ, Leiden Inst Adv Comp Sci, NL-2300 RA Leiden, Netherlands
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Statistics; Sociology; Optimization; Process control; Deep learning; Reinforcement learning; Convergence; Adaptive differential evolution; deep learning; global optimization; policy gradient (PG); reinforcement learning (RL); REAL-PARAMETER OPTIMIZATION; GLOBAL OPTIMIZATION; ADAPTATION; STRATEGY;
D O I
10.1109/TEVC.2021.3060811
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Differential evolution is one of the most prestigious population-based stochastic optimization algorithm for black-box problems. The performance of a differential evolution algorithm depends highly on its mutation and crossover strategy and associated control parameters. However, the determination process for the most suitable parameter setting is troublesome and time consuming. Adaptive control parameter methods that can adapt to problem landscape and optimization environment are more preferable than fixed parameter settings. This article proposes a novel adaptive parameter control approach based on learning from the optimization experiences over a set of problems. In the approach, the parameter control is modeled as a finite-horizon Markov decision process. A reinforcement learning algorithm, named policy gradient, is applied to learn an agent (i.e., parameter controller) that can provide the control parameters of a proposed differential evolution adaptively during the search procedure. The differential evolution algorithm based on the learned agent is compared against nine well-known evolutionary algorithms on the CEC'13 and CEC'17 test suites. Experimental results show that the proposed algorithm performs competitively against these compared algorithms on the test suites.
引用
收藏
页码:666 / 680
页数:15
相关论文
共 50 条
  • [41] Multiobjective Differential Evolution Algorithm with Self-Adaptive Learning Process
    Cichon, Andrzej
    Szlachcic, Ewa
    RECENT ADVANCES IN INTELLIGENT ENGINEERING SYSTEMS, 2012, 378 : 131 - 150
  • [42] Adaptive differential decorrelation: A natural gradient algorithm
    Choi, SJ
    ARTIFICIAL NEURAL NETWORKS - ICANN 2002, 2002, 2415 : 1168 - 1173
  • [43] Teaching-Learning-Based Differential Evolution Algorithm for Optimization Problems
    Zhu, Changming
    Yan, Yan
    Haierhan
    Ni, Jun
    2015 EIGHTH INTERNATIONAL CONFERENCE ON INTERNET COMPUTING FOR SCIENCE AND ENGINEERING (ICICSE), 2015, : 139 - 142
  • [44] Auto Adaptive Differential Evolution Algorithm
    Sharma, Vivek
    Agarwal, Shalini
    Verma, Pawan Kumar
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 958 - 963
  • [45] A Simple Adaptive Differential Evolution Algorithm
    Thangaraj, Radha
    Pant, Millie
    Abraham, Ajith
    2009 WORLD CONGRESS ON NATURE & BIOLOGICALLY INSPIRED COMPUTING (NABIC 2009), 2009, : 456 - +
  • [46] A Hybrid Algorithm of Differential Evolution and Machine Learning for Electromagnetic Structure Optimization
    Chen, Xiao Hui
    Guo, Xin Xin
    Pei, Jin Ming
    Man, Wen Yi
    2017 32ND YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2017, : 755 - 759
  • [47] An Adaptive Multiobjective Differential Evolution Algorithm
    Gu, Fangqing
    Liu, Hai-lin
    JOURNAL OF COMPUTERS, 2013, 8 (02) : 294 - 301
  • [48] A Fuzzy Adaptive Differential Evolution Algorithm
    J. Liu
    J. Lampinen
    Soft Computing, 2005, 9 : 448 - 462
  • [49] A fuzzy adaptive differential evolution algorithm
    Liu, J
    Lampinen, J
    SOFT COMPUTING, 2005, 9 (06) : 448 - 462
  • [50] A fuzzy adaptive differential evolution algorithm
    Liu, JH
    Lampinen, J
    2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 606 - 611