Hyperparameter optimization of neural networks based on Q-learning

被引:0
|
作者
Xin Qi
Bing Xu
机构
[1] The Hong Kong Polytechnic University,Department of Aeronautical and Aviation Engineering
来源
关键词
Hyperparameter optimization; Q-learning; Neural networks; Markov decision process;
D O I
暂无
中图分类号
学科分类号
摘要
Machine learning algorithms are sensitive to hyperparameters, and hyperparameter optimization techniques are often computationally expensive, especially for complex deep neural networks. In this paper, we use Q-learning algorithm to search for good hyperparameter configurations for neural networks, where the learning agent searches for the optimal hyperparameter configuration by continuously updating the Q-table to optimize hyperparameter tuning strategy. We modify the initial states and termination conditions of Q-learning to improve search efficiency. The experimental results on hyperparameter optimization of a convolutional neural network and a bidirectional long short-term memory network show that our method has higher search efficiency compared with tree of Parzen estimators, random search and genetic algorithm and can find out the optimal or near-optimal hyperparameter configuration of neural network models with minimum number of trials.
引用
收藏
页码:1669 / 1676
页数:7
相关论文
共 50 条
  • [21] Optimization of user behavior based handover using fuzzy Q-learning for LTE networks
    Rana D. Hegazy
    Omar A. Nasr
    Hanan A. Kamal
    Wireless Networks, 2018, 24 : 481 - 495
  • [22] Multipath TCP Path Scheduling Optimization Based on Q-Learning in Vehicular Heterogeneous Networks
    Zhao, Haitao
    Zhang, Mengkang
    Yu, Hongsu
    Mao, Tianqi
    Zhu, Hongbo
    2018 10TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2018,
  • [23] Q-learning with recurrent neural networks as a controller for the inverted pendulum problem
    Onat, A
    Kita, H
    Nishikawa, Y
    ICONIP'98: THE FIFTH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING JOINTLY WITH JNNS'98: THE 1998 ANNUAL CONFERENCE OF THE JAPANESE NEURAL NETWORK SOCIETY - PROCEEDINGS, VOLS 1-3, 1998, : 837 - 840
  • [24] Online Hyperparameter Optimization for Streaming Neural Networks
    Gunasekara, Nuwan
    Gomes, Heitor Murilo
    Pfahringer, Bernhard
    Bifet, Albert
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [25] Parallel hyperparameter optimization of spiking neural networks
    Firmin, Thomas
    Boulet, Pierre
    Talbi, El-Ghazali
    NEUROCOMPUTING, 2024, 609
  • [26] An effective algorithm for hyperparameter optimization of neural networks
    Diaz, G. I.
    Fokoue-Nkoutche, A.
    Nannicini, G.
    Samulowitz, H.
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2017, 61 (4-5)
  • [27] Neural Q-learning for solving PDEs
    Cohen, Samuel N.
    Jiang, Deqing
    Sirignano, Justin
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [28] A Turbo Q-Learning (TQL) for Energy Efficiency Optimization in Heterogeneous Networks
    Wang, Xiumin
    Li, Lei
    Li, Jun
    Li, Zhengquan
    ENTROPY, 2020, 22 (09)
  • [29] A Population-Based Hybrid Approach for Hyperparameter Optimization of Neural Networks
    Japa, Luis
    Serqueira, Marcello
    Mendonca, Israel
    Aritsugi, Masayoshi
    Bezerra, Eduardo
    Gonzalez, Pedro Henrique
    IEEE ACCESS, 2023, 11 : 50752 - 50768
  • [30] Scour modeling using deep neural networks based on hyperparameter optimization
    Asim, Mohammed
    Rashid, Adnan
    Ahmad, Tanvir
    ICT EXPRESS, 2022, 8 (03): : 357 - 362