Efficient Hyperparameter Optimization for Convolution Neural Networks in Deep Learning: A Distributed Particle Swarm Optimization Approach

被引:55
|
作者
Guo, Yu [1 ]
Li, Jian-Yu [1 ]
Zhan, Zhi-Hui [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
关键词
Convolution neural network (CNN); deep learning; distributed particle swarm optimization algorithm (DPSO); hyperparameter; particle swarm optimization (PSO); ALGORITHM;
D O I
10.1080/01969722.2020.1827797
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolution neural network (CNN) is a kind of powerful and efficient deep learning approach that has obtained great success in many real-world applications. However, due to its complex network structure, the intertwining of hyperparameters, and the time-consuming procedure for network training, finding an efficient network configuration for CNN is a challenging yet tough work. To efficiently solve the hyperparameters setting problem, this paper proposes a distributed particle swarm optimization (DPSO) approach, which can optimize the hyperparameters to find high-performing CNNs. Compared to tedious, historical-experience-based, and personal-preference-based manual designs, the proposed DPSO approach can evolve the hyperparameters automatically and globally to obtain promising CNNs, which provides a new idea and approach for finding the global optimal hyperparameter combination. Moreover, by cooperating with the distributed computing techniques, the DPSO approach can have a considerable speedup when compared with the traditional particle swarm optimization (PSO) algorithm. Extensive experiments on widely-used image classification benchmarks have verified that the proposed DPSO approach can effectively find the CNN model with promising performance, and at the same time, has greatly reduced the computational time when compared with traditional PSO.
引用
收藏
页码:36 / 57
页数:22
相关论文
共 50 条
  • [31] Video deepfake detection using Particle Swarm Optimization improved deep neural networks
    Leandro Cunha
    Li Zhang
    Bilal Sowan
    Chee Peng Lim
    Yinghui Kong
    Neural Computing and Applications, 2024, 36 : 8417 - 8453
  • [32] Video deepfake detection using Particle Swarm Optimization improved deep neural networks
    Cunha, Leandro
    Zhang, Li
    Lim, Chee Peng
    Sowan, Bilal
    Kong, Yinghui
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (15): : 8417 - 8453
  • [33] An Efficient Particle Swarm Optimization with Multidimensional Mean Learning
    Li, Wei
    Meng, Xiang
    Huang, Ying
    Yang, Junhui
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (03)
  • [34] Optimization Strategies for Urban Waterlogging Warning in Complex Environments: Based on Particle Swarm Optimization and Deep Neural Networks
    Hu, Xiande
    Gu, Fenfei
    ADVANCES IN CIVIL ENGINEERING, 2024, 2024
  • [35] An Efficient Multisensor Hybrid Data Fusion Approach Based on Artificial Neural Networks and Particle Swarm Optimization Algorithms
    Ihonock, Luc Eyembe
    Essiben, Jean-Francois Dikoundou
    Diboma, Benjamin Salomon
    Yong, Joe Suk
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2024, 2024
  • [36] Distributed parallel deep learning with a hybrid backpropagation-particle swarm optimization for community detection in large complex networks
    Al-Andoli, Mohammed Nasser
    Tan, Shing Chiang
    Cheah, Wooi Ping
    INFORMATION SCIENCES, 2022, 600 : 94 - 117
  • [37] Hyperparameter Optimization of Deep Learning Networks for Classification of Breast Histopathology Images
    Lin, Cheng-Jian
    Jeng, Shiou-Yun
    Lee, Chin-Ling
    SENSORS AND MATERIALS, 2021, 33 (01) : 315 - 325
  • [38] Task Allocation in Industrial Edge Networks with Particle Swarm Optimization and Deep Reinforcement Learning
    Buschmann, Philippe
    Shorim, Mostafa H. M.
    Helm, Max
    Broering, Arne
    Carle, Georg
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON THE INTERNET OF THINGS 2022, IOT 2022, 2022, : 239 - 247
  • [39] Using Particle Swarm Optimization with Gradient Descent for Parameter Learning in Convolutional Neural Networks
    Wessels, Steven
    van der Haar, Dustin
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2021, 2021, 12702 : 119 - 128
  • [40] Gaussian-Distributed Particle Swarm Optimization: A Novel Gaussian Particle Swarm Optimization
    Lee, Joon-Woo
    Lee, Ju-Jang
    2013 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2013, : 1122 - 1127