Efficient Hyperparameter Optimization for Convolution Neural Networks in Deep Learning: A Distributed Particle Swarm Optimization Approach

被引:55
|
作者
Guo, Yu [1 ]
Li, Jian-Yu [1 ]
Zhan, Zhi-Hui [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
关键词
Convolution neural network (CNN); deep learning; distributed particle swarm optimization algorithm (DPSO); hyperparameter; particle swarm optimization (PSO); ALGORITHM;
D O I
10.1080/01969722.2020.1827797
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolution neural network (CNN) is a kind of powerful and efficient deep learning approach that has obtained great success in many real-world applications. However, due to its complex network structure, the intertwining of hyperparameters, and the time-consuming procedure for network training, finding an efficient network configuration for CNN is a challenging yet tough work. To efficiently solve the hyperparameters setting problem, this paper proposes a distributed particle swarm optimization (DPSO) approach, which can optimize the hyperparameters to find high-performing CNNs. Compared to tedious, historical-experience-based, and personal-preference-based manual designs, the proposed DPSO approach can evolve the hyperparameters automatically and globally to obtain promising CNNs, which provides a new idea and approach for finding the global optimal hyperparameter combination. Moreover, by cooperating with the distributed computing techniques, the DPSO approach can have a considerable speedup when compared with the traditional particle swarm optimization (PSO) algorithm. Extensive experiments on widely-used image classification benchmarks have verified that the proposed DPSO approach can effectively find the CNN model with promising performance, and at the same time, has greatly reduced the computational time when compared with traditional PSO.
引用
收藏
页码:36 / 57
页数:22
相关论文
共 50 条
  • [21] Multiagent Reinforcement Learning for Hyperparameter Optimization of Convolutional Neural Networks
    Iranfar, Arman
    Zapater, Marina
    Atienza, David
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (04) : 1034 - 1047
  • [22] Hyperparameter optimization of neural networks based on Q-learning
    Qi, Xin
    Xu, Bing
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 1669 - 1676
  • [23] Hyperparameter optimization of neural networks based on Q-learning
    Xin Qi
    Bing Xu
    Signal, Image and Video Processing, 2023, 17 : 1669 - 1676
  • [24] Supervised Learning of Fuzzy ARTMAP Neural Networks Through Particle Swarm Optimization
    Granger, Eric
    Henniges, Philippe
    Sabourin, Robert
    Oliveira, Luiz S.
    JOURNAL OF PATTERN RECOGNITION RESEARCH, 2007, 2 (01): : 27 - 60
  • [25] Parameters Optimization of Deep Learning Models using Particle Swarm Optimization
    Qolomany, Basheer
    Maabreh, Majdi
    Al-Fuqaha, Ala
    Gupta, Ajay
    Benhaddou, Driss
    2017 13TH INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING CONFERENCE (IWCMC), 2017, : 1285 - 1290
  • [26] Application of Particle Swarm Optimization in Fussy Neural Networks
    Wang, Qingnian
    Yan, Kun
    Wan, Xiaofeng
    Yuan, Meiling
    FIFTH INTERNATIONAL CONFERENCE ON INFORMATION ASSURANCE AND SECURITY, VOL 1, PROCEEDINGS, 2009, : 158 - 161
  • [27] Training Neural Networks by Continuation Particle Swarm Optimization
    Rojas-Delgado, Jairo
    Trujillo-Rasua, Rafael
    PROGRESS IN ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION, IWAIPR 2018, 2018, 11047 : 59 - 67
  • [28] PARTICLE SWARM OPTIMIZATION FOR NEURAL NETWORK LEARNING ENHANCEMENT
    Hamed, Haza Nuzly Abdull
    Shamsuddin, Siti Mariyam
    Salim, Naomie
    JURNAL TEKNOLOGI, 2008, 49
  • [29] Scour modeling using deep neural networks based on hyperparameter optimization
    Asim, Mohammed
    Rashid, Adnan
    Ahmad, Tanvir
    ICT EXPRESS, 2022, 8 (03): : 357 - 362
  • [30] The optimal combination: Grammatical swarm, particle swarm optimization and neural networks
    de Mingo Lopez, Luis Fernando
    Gomez Blas, Nuria
    Arteta, Alberto
    JOURNAL OF COMPUTATIONAL SCIENCE, 2012, 3 (1-2) : 46 - 55