Evolution and Role of Optimizers in Training Deep Learning Models

被引:2
|
作者
Wen, XiaoHao [1 ]
Zhou, MengChu [2 ,3 ]
机构
[1] Guangxi Normal Univ, Guilin 541004, Peoples R China
[2] Zhejiang Gongshang Univ, Sch Informat & Elect Engn, Hangzhou 310018, Peoples R China
[3] New Jersey Inst Technol, Helen & John C Hartmann Dept Elect & Comp Engn, Newark, NJ 07102 USA
关键词
D O I
10.1109/JAS.2024.124806
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To perform well, deep learning (DL) models have to be trained well. Which optimizer should be adopted? We answer this question by discussing how optimizers have evolved from traditional methods like gradient descent to more advanced techniques to address challenges posed by high-dimensional and non-convex problem space. Ongoing challenges include their hyperparameter sensitivity, balancing between convergence and generalization performance, and improving interpretability of optimization processes. Researchers continue to seek robust, efficient, and universally applicable optimizers to advance the field of DL across various domains.
引用
收藏
页码:2039 / 2042
页数:4
相关论文
共 50 条
  • [41] Performance Analysis and Characterization of Training Deep Learning Models on Mobile Device
    Liu, Jie
    Liu, Jiawen
    Du, Wan
    Li, Dong
    2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, : 506 - 515
  • [42] Performance Analysis of Different Optimizers for Deep Learning-Based Image Recognition
    Postalcioglu, Seda
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 34 (02)
  • [43] Impact of Deep Learning Optimizers and Hyperparameter Tuning on the Performance of Bearing Fault Diagnosis
    Lee, Seongjae
    Kim, Taehyoun
    IEEE ACCESS, 2023, 11 : 55046 - 55070
  • [44] AdaSwarm: Augmenting Gradient-Based Optimizers in Deep Learning With Swarm Intelligence
    Mohapatra, Rohan
    Saha, Snehanshu
    Coello, Carlos A. Coello
    Bhattacharya, Anwesh
    Dhavala, Soma S.
    Saha, Sriparna
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (02): : 329 - 340
  • [45] Characterize and Compare the Performance of Deep Learning Optimizers in Recurrent Neural Network Architectures
    Zaeed, Mohammad
    Islam, Tanzima Z.
    Indic, Vladimir
    2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024, 2024, : 39 - 44
  • [46] On the Influence of Optimizers in Deep Learning-Based Side-Channel Analysis
    Perin, Guilherme
    Picek, Stjepan
    SELECTED AREAS IN CRYPTOGRAPHY, 2021, 12804 : 615 - 636
  • [47] Selecting the best optimizers for deep learning-based medical image segmentation
    Mortazi, Aliasghar
    Cicek, Vedat
    Keles, Elif
    Bagci, Ulas
    FRONTIERS IN RADIOLOGY, 2023, 3
  • [48] Role of deep learning models and analytics in industrial multimedia environment
    Qureshi, Nawab Muhammad Faseeh
    Menon, Varun G.
    Bashir, Ali Kashif
    Mumtaz, Shahid
    Mehmood, Irfan
    MULTIMEDIA SYSTEMS, 2023, 29 (03) : 1663 - 1664
  • [49] Role of deep learning models and analytics in industrial multimedia environment
    Nawab Muhammad Faseeh Qureshi
    Varun G. Menon
    Ali Kashif Bashir
    Shahid Mumtaz
    Irfan Mehmood
    Multimedia Systems, 2023, 29 : 1663 - 1664
  • [50] Distributed Training for Deep Learning Models On An Edge Computing Network Using Shielded Reinforcement Learning
    Sen, Tanmoy
    Shen, Haiying
    2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2022), 2022, : 581 - 591