Dynamic Neural Network to Enable Run-Time Trade-off between Accuracy and Latency

被引:0
|
作者
Yang, Li [1 ]
Fan, Deliang [1 ]
机构
[1] Arizona State Univ, Tempe, AZ 85281 USA
基金
美国国家科学基金会;
关键词
dynamic neural networks;
D O I
10.1145/3394885.3431628
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
To deploy powerful deep neural network (DNN) into smart, but resource limited IoT devices, many prior works have been proposed to compress DNN to reduce the network size and computation complexity with negligible accuracy degradation, such as weight quantization, network pruning, convolution decomposition, etc. However, by utilizing conventional DNN compression methods, a smaller, but fixed, network is generated from a relative large background model to achieve resource limited hardware acceleration. However, such optimization lacks the ability to adjust its structure in real-time to adapt for a dynamic computing hardware resource allocation and workloads. In this paper, we mainly review our two prior works [13, 15] to tackle this challenge, discussing how to construct a dynamic DNN by means of either uniform or non-uniform sub-nets generation methods. Moreover, to generate multiple non-uniform sub-nets, [15] needs to fully retrain the background model for each sub-net individually, named as multi-path method. To reduce the training cost, in this work, we further propose a single-path sub-nets generation method that can sample multiple sub-nets in different epochs within one training round. The constructed dynamic DNN, consisting of multiple sub-nets, provides the ability to run-time trade-off the inference accuracy and latency according to hardware resources and environment requirements. In the end, we study the the dynamic DNNs with different sub-nets generation methods on both CIFAR-10 and ImageNet dataset. We also present the run-time tuning of accuracy and latency on both GPU and CPU.
引用
收藏
页码:587 / 592
页数:6
相关论文
共 50 条
  • [31] THE CASCADE NEURAL-NETWORK MODEL AND A SPEED-ACCURACY TRADE-OFF OF ARM MOVEMENT
    HIRAYAMA, M
    KAWATO, M
    JORDAN, MI
    JOURNAL OF MOTOR BEHAVIOR, 1993, 25 (03) : 162 - 174
  • [32] Realistic Simulation of Extraterrestrial Legged Robot in Trade-off between Accuracy and Simulation Time
    Yoo, Yong-Ho
    Ahmed, Mohammed
    Bartsch, Sebastian
    Kirchner, Frank
    IECON 2010 - 36TH ANNUAL CONFERENCE ON IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2010,
  • [33] Improving the trade-off between simulation time and accuracy in efficiency calibrations with the code DETEFF
    Cornejo Diaz, N.
    Jurado Vargas, M.
    APPLIED RADIATION AND ISOTOPES, 2010, 68 (7-8) : 1413 - 1417
  • [34] On the Trade-Off Between Multi-Level Security Classification Accuracy and Training Time
    Engelstad, Paal
    2015 THIRD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, MODELLING AND SIMULATION (AIMS 2015), 2015, : 349 - 355
  • [35] On the Trade-Off Between Efficiency and Precision of Neural Abstraction
    Edwards, Alec
    Giacobbe, Mirco
    Abate, Alessandro
    QUANTITATIVE EVALUATION OF SYSTEMS, QEST 2023, 2023, 14287 : 152 - 171
  • [36] The speed-accuracy trade-off in space-time
    Hsieh, Tsung-Yu
    Liu, Yeou-Teh
    Newell, Karl M.
    JOURNAL OF SPORT & EXERCISE PSYCHOLOGY, 2011, 33 : S75 - S75
  • [37] Trade-off between accuracy and interpretability for predictive in silico modeling
    Johansson, Ulf
    Sonstrod, Cecilia
    Norinder, Ulf
    Bostrom, Henrik
    FUTURE MEDICINAL CHEMISTRY, 2011, 3 (06) : 647 - 663
  • [38] On the Trade-off Between Accuracy and Delay in Cooperative UWB Navigation
    Garcia, Gabriel E.
    Muppirisetty, L. Srikar
    Wymeersch, Henk
    2013 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2013, : 1603 - 1608
  • [39] Analysing the trade-off between comprehensibility and accuracy in mimetic models
    Blanco-Vega, R
    Hernández-Orallo, J
    Ramírez-Quintana, MJ
    DISCOVERY SCIENCE, PROCEEDINGS, 2004, 3245 : 338 - 346
  • [40] The Trade-Off between Accuracy and Accessibility of Syphilis Screening Assays
    Smit, Pieter W.
    Mabey, David
    Changalucha, John
    Mngara, Julius
    Clark, Benjamin
    Andreasen, Aura
    Todd, Jim
    Urassa, Mark
    Zaba, Basia
    Peeling, Rosanna W.
    PLOS ONE, 2013, 8 (09):