GEFWA: Gradient-Enhanced Fireworks Algorithm for Optimizing Convolutional Neural Networks

被引:0
|
作者
Chen, Maiyue [1 ,2 ]
Tan, Ying [1 ,2 ,3 ]
机构
[1] Peking Univ, Sch Intelligence Sci & Technol, Beijing, Peoples R China
[2] Peking Univ, Key Lab Machine Perceptron MOE, Beijing, Peoples R China
[3] Peking Univ, Inst Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Fireworks algorithm; Deep learning; Convolutional neural network; Swarm intelligence;
D O I
10.1007/978-3-031-36622-2_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The efficacy of evolutionary and swarm intelligence-based black-box optimization algorithms in machine learning has increased their usage, but concerns have been raised about their low sample efficiency owing to their reliance on sampling. Consequently, improving the sample efficiency of conventional black-box optimization algorithms while retaining their strengths is crucial. To this end, we propose a new algorithm called Gradient Enhanced Fireworks Algorithm (GEFWA) that incorporates first-order gradient information into the population-based fireworks algorithm (FWA). We enhance the explosion operator with the gradient-enhanced explosion (GEE) and take advantage of attraction-based cooperation (ABC) for firework collaboration. Experimental results illustrate that GEFWA outperforms traditional first-order stochastic gradient descent-based optimization methods such as Adm and SGD when it comes to optimizing convolutional neural networks. These results demonstrate the potential of integrating gradient information into the FWA framework for addressing large-scale machine learning problems.
引用
收藏
页码:323 / 333
页数:11
相关论文
共 50 条
  • [1] Image retrieval based on fireworks algorithm optimizing convolutional neural network
    Wang, Chunzhi
    Wu, Pan
    Yan, Lingyu
    Zhou, Fangyu
    Cai, Wencheng
    PROCEEDINGS OF THE 2018 IEEE 4TH INTERNATIONAL SYMPOSIUM ON WIRELESS SYSTEMS WITHIN THE INTERNATIONAL CONFERENCES ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS (IDAACS-SWS), 2018, : 53 - 56
  • [2] Novel gradient-enhanced Bayesian neural networks for uncertainty propagation
    Shi, Yan
    Chai, Rui
    Beer, Michael
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2024, 429
  • [3] Gradient-enhanced physics-informed neural networks for forward and inverse PDE
    Yu, Jeremy
    Lu, Lu
    Meng, Xuhui
    Karniadakis, George Em
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2022, 393
  • [4] GRADIENT-ENHANCED MULTIFIDELITY NEURAL NETWORKS FOR HIGH-DIMENSIONAL FUNCTION APPROXIMATION
    Nagawkar, Jethro
    Leifsson, Leifur
    PROCEEDINGS OF ASME 2021 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2021, VOL 3B, 2021,
  • [5] A gradient-enhanced physics-informed neural networks method for the wave equation
    Xie, Guizhong
    Fu, Beibei
    Li, Hao
    Du, Wenliao
    Zhong, Yudong
    Wang, Liangwen
    Geng, Hongrui
    Zhang, Ji
    Si, Liang
    ENGINEERING ANALYSIS WITH BOUNDARY ELEMENTS, 2024, 166
  • [6] A GRADIENT-ENHANCED SPARSE GRID ALGORITHM FOR UNCERTAINTY QUANTIFICATION
    de Baar, Jouke H. S.
    Harding, Brendan
    INTERNATIONAL JOURNAL FOR UNCERTAINTY QUANTIFICATION, 2015, 5 (05) : 453 - 468
  • [7] Gradient-enhanced physics-informed neural networks for power systems operational support
    Mohammadian, Mostafa
    Baker, Kyri
    Fioretto, Ferdinando
    ELECTRIC POWER SYSTEMS RESEARCH, 2023, 223
  • [8] Scalable gradient-enhanced artificial neural networks for airfoil shape design in the subsonic and transonic regimes
    Bouhlel, Mohamed Amine
    He, Sicheng
    Martins, Joaquim R. R. A.
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2020, 61 (04) : 1363 - 1376
  • [9] Tuning Convolutional Neural Network Hyperparameters by Bare Bones Fireworks Algorithm
    Tuba, Ira
    Veinovic, Mladen
    Tuba, Eva
    Hrosik, Romana Capor
    Tuba, Milan
    STUDIES IN INFORMATICS AND CONTROL, 2022, 31 (01): : 25 - 35
  • [10] Dropout Probability Estimation in Convolutional Neural Networks by the Enhanced Bat Algorithm
    Bacanin, Nebojsa
    Tuba, Eva
    Bezdan, Timea
    Strumberger, Ivana
    Jovanovic, Raka
    Tuba, Milan
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,