Dynamic Neural Network to Enable Run-Time Trade-off between Accuracy and Latency

被引:0
|
作者
Yang, Li [1 ]
Fan, Deliang [1 ]
机构
[1] Arizona State Univ, Tempe, AZ 85281 USA
基金
美国国家科学基金会;
关键词
dynamic neural networks;
D O I
10.1145/3394885.3431628
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
To deploy powerful deep neural network (DNN) into smart, but resource limited IoT devices, many prior works have been proposed to compress DNN to reduce the network size and computation complexity with negligible accuracy degradation, such as weight quantization, network pruning, convolution decomposition, etc. However, by utilizing conventional DNN compression methods, a smaller, but fixed, network is generated from a relative large background model to achieve resource limited hardware acceleration. However, such optimization lacks the ability to adjust its structure in real-time to adapt for a dynamic computing hardware resource allocation and workloads. In this paper, we mainly review our two prior works [13, 15] to tackle this challenge, discussing how to construct a dynamic DNN by means of either uniform or non-uniform sub-nets generation methods. Moreover, to generate multiple non-uniform sub-nets, [15] needs to fully retrain the background model for each sub-net individually, named as multi-path method. To reduce the training cost, in this work, we further propose a single-path sub-nets generation method that can sample multiple sub-nets in different epochs within one training round. The constructed dynamic DNN, consisting of multiple sub-nets, provides the ability to run-time trade-off the inference accuracy and latency according to hardware resources and environment requirements. In the end, we study the the dynamic DNNs with different sub-nets generation methods on both CIFAR-10 and ImageNet dataset. We also present the run-time tuning of accuracy and latency on both GPU and CPU.
引用
收藏
页码:587 / 592
页数:6
相关论文
共 50 条
  • [41] Dynamic Optimization of the Latency Throughput Trade-off in Parallel Chain Distributed Ledgers
    Campones, Pedro
    Domingos, Henrique
    39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 226 - 234
  • [42] Trade-off between sample size and accuracy: Case of dynamic measurements under interval uncertainty
    Nguyen, Hung T.
    Kosheleva, Olga
    Kreinovich, Vladik
    Ferson, Scott
    INTERVAL / PROBABILISTIC UNCERTAINTY AND NON-CLASSICAL LOGICS, 2008, 46 : 45 - +
  • [43] Trade-off between cost and accuracy in large-scale surface water dynamic modeling
    Getirana, Augusto
    Peters-Lidard, Christa
    Rodell, Matthew
    Bates, Paul D.
    WATER RESOURCES RESEARCH, 2017, 53 (06) : 4942 - 4955
  • [44] Semantic similarity controllers: On the trade-off between accuracy and interpretability
    Martinez-Gil, Jorge
    Chaves-Gonzalez, Jose Manuel
    KNOWLEDGE-BASED SYSTEMS, 2021, 234
  • [45] The latency accuracy trade-off and optimization in implied volatility-based trading systems
    Guo, Guoxiang
    Qi, Yuanyuan
    Lai, Sirui
    Liu, Zhi
    Yen, Joseph
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 221
  • [46] Neural mechanisms for executive control of speed-accuracy trade-off
    Reppert, Thomas R.
    Heitz, Richard P.
    Schall, Jeffrey D.
    CELL REPORTS, 2023, 42 (11):
  • [47] ACCURACY-ROBUSTNESS TRADE-OFF FOR POSITIVELY WEIGHTED NEURAL NETWORKS
    Neacsu, Ana
    Pesquet, Jean-Christophe
    Burileanu, Corneliu
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8389 - 8393
  • [48] Accuracy/Speed Trade-off Technique for Dynamic Programing Based Algorithms
    Bonny, Talal
    2016 5TH INTERNATIONAL CONFERENCE ON ELECTRONIC DEVICES, SYSTEMS AND APPLICATIONS (ICEDSA), 2016,
  • [49] Efficiency-Accuracy Trade-off for Spectrum Sensing in Cognitive Network
    Sun Wujian
    Dai Weiguo
    Zhang Lei
    Liu Yang
    Li Ou
    PROCEEDINGS OF THE 2012 SECOND INTERNATIONAL CONFERENCE ON INSTRUMENTATION & MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2012), 2012, : 1136 - 1141
  • [50] Trade-off between seed dispersal in space and time
    Chen, Si-Chong
    Poschlod, Peter
    Antonelli, Alexandre
    Liu, Udayangani
    Dickie, John B.
    ECOLOGY LETTERS, 2020, 23 (11) : 1635 - 1642