Distributed Asynchronous Optimization of Convolutional Neural Networks

被引:0
|
作者
Chan, William [1 ]
Lane, Ian [1 ,2 ]
机构
[1] Carnegie Mellon Univ, Elect & Comp Engn, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
关键词
deep neural network; distributed optimization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, deep Convolutional Neural Networks have been shown to outperform Deep Neural Networks for acoustic modelling, producing state-of-the-art accuracy in speech recognition tasks. Convolutional models provide increased model robustness through the usage of pooling invariance and weight sharing across spectrum and time. However, training convolutional models is a very computationally expensive optimization procedure, especially when combined with large training corpora. In this paper, we present a novel algorithm for scalable training of deep Convolutional Neural Networks across multiple GPUs. Our distributed asynchronous stochastic gradient descent algorithm incorporates sparse gradients, momentum and gradient decay to accelerate the training of these networks. Our approach is stable, neither requiring warm-starting or excessively large minibatches. Our proposed approach enables convolutional models to be efficiently trained across multiple GPUs, enabling a model to be scaled asynchronously across 5 GPU workers with 68% efficiency.
引用
收藏
页码:1073 / 1077
页数:5
相关论文
共 50 条
  • [1] OPTIMIZATION OF DISTRIBUTED CONVOLUTIONAL NEURAL NETWORK FOR IMAGE LABELING ON ASYNCHRONOUS GPU MODEL
    Fu, Jinhua
    Huang, Yongzhong
    Xu, Jie
    Wu, Huaiguang
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2019, 15 (03): : 1145 - 1156
  • [2] Autoregressive Convolutional Neural Networks for Asynchronous Time Series
    Binkowski, Mikolaj
    Marti, Gautier
    Donnat, Philippe
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [3] Distributed Information Integration in Convolutional Neural Networks
    Kumar, Dinesh
    Sharma, Dharmendra
    PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 491 - 498
  • [4] Deep distributed convolutional neural networks: Universality
    Zhou, Ding-Xuan
    ANALYSIS AND APPLICATIONS, 2018, 16 (06) : 895 - 919
  • [5] Neuro-distributed cognitive adaptive optimization for training neural networks in a parallel and asynchronous manner
    Michailidis, Panagiotis
    Michailidis, Iakovos T.
    Gkelios, Sokratis
    Karatzinis, Georgios
    Kosmatopoulos, Elias B.
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2024, 31 (01) : 19 - 41
  • [6] CONet: Channel Optimization for Convolutional Neural Networks
    Hosseini, Mahdi S.
    Zhang, Jia Shu
    Liu, Zhe
    Fu, Andre
    Su, Jingxuan
    Tuli, Mathieu
    Plataniotis, Konstantinos N.
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 326 - 335
  • [7] diffGrad: An Optimization Method for Convolutional Neural Networks
    Dubey, Shiv Ram
    Chakraborty, Soumendu
    Roy, Swalpa Kumar
    Mukherjee, Snehasis
    Singh, Satish Kumar
    Chaudhuri, Bidyut Baran
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (11) : 4500 - 4511
  • [8] Optimization and acceleration of convolutional neural networks: A survey
    Habib, Gousia
    Qureshi, Shaima
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (07) : 4244 - 4268
  • [9] Structure Optimization of Convolutional Neural Networks: A Survey
    Lin J.-D.
    Wu X.-Y.
    Chai Y.
    Yin H.-P.
    Zidonghua Xuebao/Acta Automatica Sinica, 2020, 46 (01): : 24 - 37
  • [10] Performance Modeling for Distributed Training of Convolutional Neural Networks
    Castello, Adrian
    Catalan, Mar
    Dolz, Manuel F.
    Mestre, Jose, I
    Quintana-Orti, Enrique S.
    Duato, Jose
    2021 29TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2021), 2021, : 99 - 108