Research on three-step accelerated gradient algorithm in deep learning

被引:0
|
作者
Lian, Yongqiang [1 ,2 ]
Tang, Yincai [1 ]
Zhou, Shirong [1 ]
机构
[1] East China Normal Univ, Sch Stat, KLATASDS MOE, Shanghai, Peoples R China
[2] East China Normal Univ, Sch Stat, KLATASDS MOE, Shanghai 200062, Peoples R China
基金
中国国家自然科学基金;
关键词
Accelerated algorithm; backpropagation; deep learning; learning rate; momentum; stochastic gradient descent;
D O I
10.1080/24754269.2020.1846414
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Gradient descent (GD) algorithm is the widely used optimisation method in training machine learning and deep learning models. In this paper, based on GD, Polyak's momentum (PM), and Nesterov accelerated gradient (NAG), we give the convergence of the algorithms from an initial value to the optimal value of an objective function in simple quadratic form. Based on the convergence property of the quadratic function, two sister sequences of NAG's iteration and parallel tangent methods in neural networks, the three-step accelerated gradient (TAG) algorithm is proposed, which has three sequences other than two sister sequences. To illustrate the performance of this algorithm, we compare the proposed algorithm with the three other algorithms in quadratic function, high-dimensional quadratic functions, and nonquadratic function. Then we consider to combine the TAG algorithm to the backpropagation algorithm and the stochastic gradient descent algorithm in deep learning. For conveniently facilitate the proposed algorithms, we rewite the R package 'neuralnet' and extend it to 'supneuralnet'. All kinds of deep learning algorithms in this paper are included in 'supneuralnet' package. Finally, we show our algorithms are superior to other algorithms in four case studies.
引用
收藏
页码:40 / 57
页数:18
相关论文
共 50 条
  • [21] Learning to Box: Reinforcement Learning using Heuristic Three-step Curriculum Learning
    Rho, Heeseon
    Yu, Yeonguk
    Lee, Kyoobin
    International Conference on Control, Automation and Systems, 2022, 2022-November : 227 - 231
  • [22] The strong convergence of a three-step algorithm for the split feasibility problem
    Yazheng Dang
    Yan Gao
    Optimization Letters, 2013, 7 : 1325 - 1339
  • [23] Three-Step Tomographic Algorithm for Ionospheric Electron Density Reconstruction
    Wen, Debao
    Mei, Dengkui
    Chen, Hanqing
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [24] An efficient VLSI architecture for new three-step search algorithm
    He, ZL
    Liou, ML
    Chan, PCH
    Li, R
    38TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 1228 - 1231
  • [25] A Three-Step Diagnostic Algorithm for Alopecia: Pattern Analysis in Trichoscopy
    Katoulis, Alexander C.
    Pappa, Georgia
    Sgouros, Dimitrios
    Markou, Effie
    Kanelleas, Antonios
    Bozi, Evangelia
    Ioannides, Demetrios
    Rudnicka, Lidia
    JOURNAL OF CLINICAL MEDICINE, 2025, 14 (04)
  • [26] An efficient three-step search algorithm for block motion estimation
    Jing, X
    Chau, LP
    IEEE TRANSACTIONS ON MULTIMEDIA, 2004, 6 (03) : 435 - 438
  • [27] A NOVEL THREE-STEP FOCUSING ALGORITHM FOR TOPSAR IMAGE FORMATION
    Yang Wei
    Li Chunsheng
    Chen Jie
    Wang Pengbo
    2010 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2010, : 4087 - 4090
  • [28] A Three-Step Clustering Algorithm over an Evolving Data Stream
    Liu Li-xiong
    Guo Yun-fei
    Kang Jing
    Huang Hai
    2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 1, 2009, : 160 - 164
  • [29] High Performance VLSI Architecture for Three-Step Search Algorithm
    Rohan Mukherjee
    Keyur Sheth
    Anindya Sundar Dhar
    Indrajit Chakrabarti
    Somnath Sengupta
    Circuits, Systems, and Signal Processing, 2015, 34 : 1595 - 1612
  • [30] Learning to Box: Reinforcement Learning using Heuristic Three-step Curriculum Learning
    Rho, Heeseon
    Yu, Yeonguk
    Lee, Kyoobin
    2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 227 - 231