Accelerating neural network training using weight extrapolations

被引：91

作者：

Kamarthi, SV ^{[1
]}

Pittner, S ^{[1
]}

机构：

[1] Northeastern Univ, Dept Mech Ind & Mfg Engn, Snell Eng Ctr 334, Boston, MA 02115 USA

来源：

NEURAL NETWORKS | 1999年 / 12卷 / 09期

关键词：

multilayer neural networks; backpropagation algorithm; convergence acceleration; extrapolation methods; parameter estimation; linear regression; conjugate gradient method; relative entropy;

D O I：

10.1016/S0893-6080(99)00072-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The backpropagation (BP) algorithm for training feedforward neural networks has proven robust even for difficult problems. However, its high performance results are attained at the expense of a long training time to adjust the network parameters, which can be discouraging in many real-world applications. Even on relatively simple problems, standard BP often requires a lengthy training process in which the complete set of training examples is processed hundreds or thousands of times. In this paper, a universal acceleration technique for the BP algorithm based on extrapolation of each individual interconnection weight is presented. This extrapolation procedure is easy to implement and is activated only a few times in between iterations of the conventional BP algorithm. This procedure, unlike earlier acceleration procedures, minimally alters the computational structure of the BP algorithm. The viability of this new approach is demonstrated on three examples. The results suggest that it leads to significant savings in computation time of the standard BP algorithm. Moreover, the solution computed by the proposed approach is always located in close proximity to the one obtained by the conventional BP procedure. Hence, the proposed method provides a real acceleration of the BP algorithm without degrading the usefulness of its solutions. The performance of the new method is also compared with that of the conjugate gradient algorithm, which is an improved and faster version of the BP algorithm. (C) 1999 Elsevier Science Ltd. All rights reserved.

引用

页码：1285 / 1299

页数：15

共 50 条

[1] Accelerating convolutional neural network training using ProMoD backpropagation algorithm
Gurhanli, Ahmet
IET IMAGE PROCESSING, 2020, 14 (13) : 2957 - 2964
[2] Accelerating Neural Network Training: A Brief Review
Nokhwal, Sahil
Chilakalapudi, Priyanka
Donekal, Preeti
Nokhwal, Suman
Pahune, Saurabh
Chaudhary, Ankit
2024 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, METAHEURISTICS & SWARM INTELLIGENCE, ISMSI 2024, 2024, : 31 - 35
[3] FPRaker: A Processing Element For Accelerating Neural Network Training
Awad, Omar Mohamed
Mahmoud, Mostafa
Edo, Isak
Zadeh, Ali Hadi
Bannon, Ciaran
Jayarajan, Anand
Pekhimenko, Gennady
Moshovos, Andreas
PROCEEDINGS OF 54TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO 2021, 2021, : 857 - 869
[4] Accelerating Data Loading in Deep Neural Network Training
Yang, Chih-Chieh
Cong, Guojing
2019 IEEE 26TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC), 2019, : 235 - 245
[5] Neural network weight training by mutation
Jenkins, W. M.
COMPUTERS & STRUCTURES, 2006, 84 (31-32) : 2107 - 2112
[6] Centered Weight Normalization in Accelerating Training of Deep Neural Networks
Huang, Lei
Liu, Xianglong
Liu, Yang
Lang, Bo
Tao, Dacheng
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2822 - 2830
[7] Accelerating Neural Network Training using Arbitrary Precision Approximating Matrix Multiplication Algorithms
Ballard, Grey
Weissenberger, Jack
Zhang, Luoping
50TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOP PROCEEDINGS - ICPP WORKSHOPS '21, 2021,
[8] GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training
Cai, Tianle
Luo, Shengjie
Xu, Keyulu
He, Di
Liu, Tie-Yan
Wang, Liwei
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[9] fuseGNN: Accelerating Graph Convolutional Neural Network Training on GPGPU
Chen, Zhaodong
Yan, Mingyu
Zhu, Maohua
Deng, Lei
Li, Guoqi
Li, Shuangchen
Xie, Yuan
2020 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED-DESIGN (ICCAD), 2020,
[10] Accelerating Neural Network Training with Processing-in-Memory GPU
Fei, Xiang
Han, Jianhui
Huang, Jianqiang
Zheng, Weimin
Zhang, Youhui
2022 22ND IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2022), 2022, : 414 - 421

← 1 2 3 4 5 →