Research on three-step accelerated gradient algorithm in deep learning

被引:0
|
作者
Lian, Yongqiang [1 ,2 ]
Tang, Yincai [1 ]
Zhou, Shirong [1 ]
机构
[1] East China Normal Univ, Sch Stat, KLATASDS MOE, Shanghai, Peoples R China
[2] East China Normal Univ, Sch Stat, KLATASDS MOE, Shanghai 200062, Peoples R China
基金
中国国家自然科学基金;
关键词
Accelerated algorithm; backpropagation; deep learning; learning rate; momentum; stochastic gradient descent;
D O I
10.1080/24754269.2020.1846414
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Gradient descent (GD) algorithm is the widely used optimisation method in training machine learning and deep learning models. In this paper, based on GD, Polyak's momentum (PM), and Nesterov accelerated gradient (NAG), we give the convergence of the algorithms from an initial value to the optimal value of an objective function in simple quadratic form. Based on the convergence property of the quadratic function, two sister sequences of NAG's iteration and parallel tangent methods in neural networks, the three-step accelerated gradient (TAG) algorithm is proposed, which has three sequences other than two sister sequences. To illustrate the performance of this algorithm, we compare the proposed algorithm with the three other algorithms in quadratic function, high-dimensional quadratic functions, and nonquadratic function. Then we consider to combine the TAG algorithm to the backpropagation algorithm and the stochastic gradient descent algorithm in deep learning. For conveniently facilitate the proposed algorithms, we rewite the R package 'neuralnet' and extend it to 'supneuralnet'. All kinds of deep learning algorithms in this paper are included in 'supneuralnet' package. Finally, we show our algorithms are superior to other algorithms in four case studies.
引用
收藏
页码:40 / 57
页数:18
相关论文
共 50 条
  • [31] A Three-Step Automated Segmentation Method for Early Cervical Cancer MRI Images Based on Deep Learning
    Xiong, Liu
    Chen, Chunxia
    Lin, Yongping
    Song, Zhiyu
    Su, Jialin
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2025, 35 (01)
  • [32] A three-step health services research approach to improve prescribing
    Rose, Adam J.
    McCullough, Megan B.
    Jasuja, Guneet K.
    HEALTHCARE-THE JOURNAL OF DELIVERY SCIENCE AND INNOVATION, 2018, 6 (02): : 135 - 138
  • [33] A Three-Step Deep Neural Network Methodology for Exchange Rate Forecasting
    Carlos Figueroa-Garcia, Juan
    Lopez-Santana, Eduyn
    Franco-Franco, Carlos
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT I, 2017, 10361 : 786 - 795
  • [34] A three-step optimization-based algorithm for home healthcare delivery
    Guo, Jia
    Bard, Jonathan F.
    SOCIO-ECONOMIC PLANNING SCIENCES, 2023, 87
  • [35] Three-step iterative algorithm for multivalued nonepansive mappings in CAT(κ) spaces
    Saluja, G.S.
    UPB Scientific Bulletin, Series A: Applied Mathematics and Physics, 2019, 81 (03): : 53 - 66
  • [36] A new three-step iterative algorithm for solving the split feasibility problem
    Feng, Meiling
    Shi, Luoyi
    Chen, Rudong
    UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2019, 81 (01): : 93 - 102
  • [37] Approach to Management of Insomnia in Primary Care Settings: A Three-step Algorithm
    Gulati, Meghal
    Samantray, Swayanka
    Panda, Udit Kumar
    Pattnaik, Jigyansa Ipsita
    Ravan, J. P. R.
    INDIAN JOURNAL OF PSYCHIATRY, 2024, 66 : S54 - S55
  • [38] Three-step Harmonic Solvmanifolds
    Chal Benson
    Tracy L. Payne
    Gail Ratcliff
    Geometriae Dedicata, 2003, 101 : 103 - 127
  • [39] A three-step plan for antibiotics
    不详
    NATURE, 2014, 509 (7502) : 533 - 533
  • [40] A three-step synthesis of halomon
    Sotokawa, T
    Noda, T
    Pi, S
    Hirama, M
    ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2000, 39 (19) : 3430 - +