A Review of State-of-the-art Mixed-Precision Neural Network Frameworks

被引:0
|
作者
Rakka, Mariam [1 ]
Fouda, Mohammed E. [2 ]
Khargonekar, Pramod [1 ]
Kurdahi, Fadi [1 ]
机构
[1] Univ Calif Irvine, Ctr Embedded & Cyber Phys Syst, Irvine, CA 92697 USA
[2] Rain Neuromorph Inc, San Francisco, CA 94110 USA
关键词
Deep neural networks; mixed-precision neural networks; edge inference; quantization; computational complexity; ALGORITHMS;
D O I
10.1109/TPAMI.2024.3394390
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mixed-precision Deep Neural Networks (DNNs) provide an efficient solution for hardware deployment, especially under resource constraints, while maintaining model accuracy. Identifying the ideal bit precision for each layer, however, remains a challenge given the vast array of models, datasets, and quantization schemes, leading to an expansive search space. Recent literature has addressed this challenge, resulting in several promising frameworks. This paper offers a comprehensive overview of the standard quantization classifications prevalent in existing studies. A detailed survey of current mixed-precision frameworks is provided, with an in-depth comparative analysis highlighting their respective merits and limitations. The paper concludes with insights into potential avenues for future research in this domain.
引用
收藏
页码:7793 / 7812
页数:20
相关论文
共 50 条
  • [41] Reverse Logistics Network Design: A State-of-the-art Literature Review
    Chanintrakul, Piyawat
    Mondragon, Adrian E. Coronado
    Lalwani, Chandra
    ICPOM2008: PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE OF PRODUCTION AND OPERATION MANAGEMENT, VOLUMES 1-3, 2008, : 1310 - 1315
  • [42] A State-of-the-Art Review on Synchrophasor Applications to Power Network Protection
    Prabhu, M. S.
    Nayak, Paresh Kumar
    ADVANCES IN POWER SYSTEMS AND ENERGY MANAGEMENT, 2018, 436
  • [43] Optimized co-scheduling of mixed-precision neural network accelerator for real-time multitasking applications
    Jiang, Wei
    Song, Ziwei
    Zhan, Jinyu
    He, Zhiyuan
    Wen, Xiangyu
    Jiang, Ke
    JOURNAL OF SYSTEMS ARCHITECTURE, 2020, 110 (110)
  • [44] Entropy-Driven Mixed-Precision Quantization for Deep Network Design
    Sun, Zhenhong
    Ge, Ce
    Wang, Junyan
    Lin, Ming
    Chen, Hesen
    Li, Hao
    Sun, Xiuyu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [45] Mixed-precision weights network for field-programmable gate array
    Fuengfusin, Ninnart
    Tamukoh, Hakaru
    PLOS ONE, 2021, 16 (05):
  • [46] Comparing State-of-the-Art Neural Network Ensemble Methods in Soccer Predictions
    Mendes-Neves, Tiago
    Mendes-Moreira, Joao
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2020), 2020, 12117 : 139 - 149
  • [47] Neural-network-based target tracking state-of-the-art survey
    Amoozegar, F
    OPTICAL ENGINEERING, 1998, 37 (03) : 836 - 846
  • [48] A comparison between state-of-the-art and neural network modelling of solar collectors
    Fischer, Stephan
    Frey, Patrick
    Druck, Harald
    SOLAR ENERGY, 2012, 86 (11) : 3268 - 3277
  • [49] State-of-the-Art Review: Neurosyphilis
    Hamill, Matthew M.
    Ghanem, Khalil G.
    Tuddenham, Susan
    CLINICAL INFECTIOUS DISEASES, 2024, 78 (05) : e57 - e68
  • [50] VIDEOARTHROSCOPY - REVIEW AND STATE-OF-THE-ART
    WHELAN, JM
    JACKSON, DW
    ARTHROSCOPY, 1992, 8 (03): : 311 - 319