Dissecting Convolutional Neural Networks for Runtime and Scalability Prediction

被引：0

作者：

Beringer, Tim ^{[1
]}

Mazaheri, Arya ^{[1
]}

Stock, Jakob ^{[1
]}

Wolf, Felix ^{[1
]}

机构：

[1] Tech Univ Darmstadt, Darmstadt, Hesse, Germany

来源：

53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024 | 2024年

关键词：

Artificial intelligence; deep neural networks; convolution; performance modeling; distributed training; scalability;

D O I：

10.1145/3673038.3673107

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Given the computational complexity of deep neural networks (DNN), accurate prediction of their training and inference time using performance modeling is crucial for efficient infrastructure planning and DNN development. However, existing methods often predict only the inference time and rely on exhaustive benchmarking and fine tuning, making them time consuming and restricted in scope. As a remedy, we propose ConvMeter, a novel yet simple performance model that considers the inherent characteristics of DNNs, such as architecture, dataset, and target hardware, which strongly affect their runtime and scalability. Our performance model, which has been thoroughly tested on convolutional neural networks (ConvNets), a class of DNNs widely used for image analysis, offers the prediction of inference and training time, the latter on one or more compute nodes. Experiments with various ConvNets demonstrate that our runtime predictions of inference and training phases achieved an average error rate of less than 20% and 18%, respectively, making the assessment of ConvNets regarding efficiency and scalability straightforward.

引用

页码：168 / 178

页数：11

共 50 条

[1] Performance and Runtime Improvement of Spiking Convolutional Neural Networks
Shirsavar, Shahriar Rezghi
Dehaqani, Mohammad-Reza A.
2022 56TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2022, : 801 - 805
[2] Existence, Stability and Scalability of Orthogonal Convolutional Neural Networks
Achour, El Mehdi
Malgouyres, Francois
Mamalet, Franck
JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
[3] Theoretical Scalability Analysis of Distributed Deep Convolutional Neural Networks
Castello, Adrian
Dolz, Manuel F.
Quintana-Orti, Enrique S.
Duato, Jose
2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 534 - 541
[4] Performance and Scalability of GPU-based Convolutional Neural Networks
Strigl, Daniel
Kofler, Klaus
Podlipnig, Stefan
PROCEEDINGS OF THE 18TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2010, : 317 - 324
[5] Dissecting Convolutional Neural Networks for Efficient Implementation on Constrained Platforms
Laguduva, Vishalini R.
Mahmud, Shakil
Aakur, Sathyanarayanan N.
Karam, Robert
Katkoori, Srinivas
2020 33RD INTERNATIONAL CONFERENCE ON VLSI DESIGN AND 2020 19TH INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS (VLSID), 2020, : 149 - 154
[6] Dissecting the High-Frequency Bias in Convolutional Neural Networks
Abello, Antonio A.
Hirata Jr, Roberto
Wang, Zhangyang
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 863 - 871
[7] Wavefront Reconstruction and Prediction with Convolutional Neural Networks
Swanson, Robin
Lamb, Masen
Correia, Carlos
Sivanandam, Suresh
Kutulakos, Kiriakos
ADAPTIVE OPTICS SYSTEMS VI, 2018, 10703
[8] Public Transportation Prediction with Convolutional Neural Networks
Panovski, Dancho
Zaharia, Titus
INTELLIGENT TRANSPORT SYSTEMS, 2020, 310 : 150 - 161
[9] Face Attribute Prediction with Convolutional Neural Networks
Gao, Doudou
Yuan, Peijiang
Sun, Ning
Wu, Xulei
Cai, Ying
2017 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE ROBIO 2017), 2017, : 1294 - 1299
[10] Convolutional Recurrent Neural Networks for Glucose Prediction
Li, Kezhi
Daniels, John
Liu, Chengyuan
Herrero, Pau
Georgiou, Pantelis
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (02) : 603 - 613

← 1 2 3 4 5 →