Accelerating neural network training using weight extrapolations

被引:91
|
作者
Kamarthi, SV [1 ]
Pittner, S [1 ]
机构
[1] Northeastern Univ, Dept Mech Ind & Mfg Engn, Snell Eng Ctr 334, Boston, MA 02115 USA
关键词
multilayer neural networks; backpropagation algorithm; convergence acceleration; extrapolation methods; parameter estimation; linear regression; conjugate gradient method; relative entropy;
D O I
10.1016/S0893-6080(99)00072-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The backpropagation (BP) algorithm for training feedforward neural networks has proven robust even for difficult problems. However, its high performance results are attained at the expense of a long training time to adjust the network parameters, which can be discouraging in many real-world applications. Even on relatively simple problems, standard BP often requires a lengthy training process in which the complete set of training examples is processed hundreds or thousands of times. In this paper, a universal acceleration technique for the BP algorithm based on extrapolation of each individual interconnection weight is presented. This extrapolation procedure is easy to implement and is activated only a few times in between iterations of the conventional BP algorithm. This procedure, unlike earlier acceleration procedures, minimally alters the computational structure of the BP algorithm. The viability of this new approach is demonstrated on three examples. The results suggest that it leads to significant savings in computation time of the standard BP algorithm. Moreover, the solution computed by the proposed approach is always located in close proximity to the one obtained by the conventional BP procedure. Hence, the proposed method provides a real acceleration of the BP algorithm without degrading the usefulness of its solutions. The performance of the new method is also compared with that of the conjugate gradient algorithm, which is an improved and faster version of the BP algorithm. (C) 1999 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:1285 / 1299
页数:15
相关论文
共 50 条
  • [1] Accelerating convolutional neural network training using ProMoD backpropagation algorithm
    Gurhanli, Ahmet
    IET IMAGE PROCESSING, 2020, 14 (13) : 2957 - 2964
  • [2] Accelerating Neural Network Training: A Brief Review
    Nokhwal, Sahil
    Chilakalapudi, Priyanka
    Donekal, Preeti
    Nokhwal, Suman
    Pahune, Saurabh
    Chaudhary, Ankit
    2024 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, METAHEURISTICS & SWARM INTELLIGENCE, ISMSI 2024, 2024, : 31 - 35
  • [3] FPRaker: A Processing Element For Accelerating Neural Network Training
    Awad, Omar Mohamed
    Mahmoud, Mostafa
    Edo, Isak
    Zadeh, Ali Hadi
    Bannon, Ciaran
    Jayarajan, Anand
    Pekhimenko, Gennady
    Moshovos, Andreas
    PROCEEDINGS OF 54TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO 2021, 2021, : 857 - 869
  • [4] Accelerating Data Loading in Deep Neural Network Training
    Yang, Chih-Chieh
    Cong, Guojing
    2019 IEEE 26TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC), 2019, : 235 - 245
  • [5] Neural network weight training by mutation
    Jenkins, W. M.
    COMPUTERS & STRUCTURES, 2006, 84 (31-32) : 2107 - 2112
  • [6] Centered Weight Normalization in Accelerating Training of Deep Neural Networks
    Huang, Lei
    Liu, Xianglong
    Liu, Yang
    Lang, Bo
    Tao, Dacheng
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2822 - 2830
  • [7] Accelerating Neural Network Training using Arbitrary Precision Approximating Matrix Multiplication Algorithms
    Ballard, Grey
    Weissenberger, Jack
    Zhang, Luoping
    50TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOP PROCEEDINGS - ICPP WORKSHOPS '21, 2021,
  • [8] GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training
    Cai, Tianle
    Luo, Shengjie
    Xu, Keyulu
    He, Di
    Liu, Tie-Yan
    Wang, Liwei
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [9] fuseGNN: Accelerating Graph Convolutional Neural Network Training on GPGPU
    Chen, Zhaodong
    Yan, Mingyu
    Zhu, Maohua
    Deng, Lei
    Li, Guoqi
    Li, Shuangchen
    Xie, Yuan
    2020 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED-DESIGN (ICCAD), 2020,
  • [10] Accelerating Neural Network Training with Processing-in-Memory GPU
    Fei, Xiang
    Han, Jianhui
    Huang, Jianqiang
    Zheng, Weimin
    Zhang, Youhui
    2022 22ND IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2022), 2022, : 414 - 421