Dissecting Convolutional Neural Networks for Runtime and Scalability Prediction

被引:0
|
作者
Beringer, Tim [1 ]
Mazaheri, Arya [1 ]
Stock, Jakob [1 ]
Wolf, Felix [1 ]
机构
[1] Tech Univ Darmstadt, Darmstadt, Hesse, Germany
关键词
Artificial intelligence; deep neural networks; convolution; performance modeling; distributed training; scalability;
D O I
10.1145/3673038.3673107
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Given the computational complexity of deep neural networks (DNN), accurate prediction of their training and inference time using performance modeling is crucial for efficient infrastructure planning and DNN development. However, existing methods often predict only the inference time and rely on exhaustive benchmarking and fine tuning, making them time consuming and restricted in scope. As a remedy, we propose ConvMeter, a novel yet simple performance model that considers the inherent characteristics of DNNs, such as architecture, dataset, and target hardware, which strongly affect their runtime and scalability. Our performance model, which has been thoroughly tested on convolutional neural networks (ConvNets), a class of DNNs widely used for image analysis, offers the prediction of inference and training time, the latter on one or more compute nodes. Experiments with various ConvNets demonstrate that our runtime predictions of inference and training phases achieved an average error rate of less than 20% and 18%, respectively, making the assessment of ConvNets regarding efficiency and scalability straightforward.
引用
收藏
页码:168 / 178
页数:11
相关论文
共 50 条
  • [1] Performance and Runtime Improvement of Spiking Convolutional Neural Networks
    Shirsavar, Shahriar Rezghi
    Dehaqani, Mohammad-Reza A.
    2022 56TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2022, : 801 - 805
  • [2] Existence, Stability and Scalability of Orthogonal Convolutional Neural Networks
    Achour, El Mehdi
    Malgouyres, Francois
    Mamalet, Franck
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [3] Theoretical Scalability Analysis of Distributed Deep Convolutional Neural Networks
    Castello, Adrian
    Dolz, Manuel F.
    Quintana-Orti, Enrique S.
    Duato, Jose
    2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 534 - 541
  • [4] Performance and Scalability of GPU-based Convolutional Neural Networks
    Strigl, Daniel
    Kofler, Klaus
    Podlipnig, Stefan
    PROCEEDINGS OF THE 18TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2010, : 317 - 324
  • [5] Dissecting Convolutional Neural Networks for Efficient Implementation on Constrained Platforms
    Laguduva, Vishalini R.
    Mahmud, Shakil
    Aakur, Sathyanarayanan N.
    Karam, Robert
    Katkoori, Srinivas
    2020 33RD INTERNATIONAL CONFERENCE ON VLSI DESIGN AND 2020 19TH INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS (VLSID), 2020, : 149 - 154
  • [6] Dissecting the High-Frequency Bias in Convolutional Neural Networks
    Abello, Antonio A.
    Hirata Jr, Roberto
    Wang, Zhangyang
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 863 - 871
  • [7] Wavefront Reconstruction and Prediction with Convolutional Neural Networks
    Swanson, Robin
    Lamb, Masen
    Correia, Carlos
    Sivanandam, Suresh
    Kutulakos, Kiriakos
    ADAPTIVE OPTICS SYSTEMS VI, 2018, 10703
  • [8] Public Transportation Prediction with Convolutional Neural Networks
    Panovski, Dancho
    Zaharia, Titus
    INTELLIGENT TRANSPORT SYSTEMS, 2020, 310 : 150 - 161
  • [9] Face Attribute Prediction with Convolutional Neural Networks
    Gao, Doudou
    Yuan, Peijiang
    Sun, Ning
    Wu, Xulei
    Cai, Ying
    2017 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE ROBIO 2017), 2017, : 1294 - 1299
  • [10] Convolutional Recurrent Neural Networks for Glucose Prediction
    Li, Kezhi
    Daniels, John
    Liu, Chengyuan
    Herrero, Pau
    Georgiou, Pantelis
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (02) : 603 - 613