A Review of State-of-the-art Mixed-Precision Neural Network Frameworks

被引:0
|
作者
Rakka, Mariam [1 ]
Fouda, Mohammed E. [2 ]
Khargonekar, Pramod [1 ]
Kurdahi, Fadi [1 ]
机构
[1] Univ Calif Irvine, Ctr Embedded & Cyber Phys Syst, Irvine, CA 92697 USA
[2] Rain Neuromorph Inc, San Francisco, CA 94110 USA
关键词
Deep neural networks; mixed-precision neural networks; edge inference; quantization; computational complexity; ALGORITHMS;
D O I
10.1109/TPAMI.2024.3394390
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mixed-precision Deep Neural Networks (DNNs) provide an efficient solution for hardware deployment, especially under resource constraints, while maintaining model accuracy. Identifying the ideal bit precision for each layer, however, remains a challenge given the vast array of models, datasets, and quantization schemes, leading to an expansive search space. Recent literature has addressed this challenge, resulting in several promising frameworks. This paper offers a comprehensive overview of the standard quantization classifications prevalent in existing studies. A detailed survey of current mixed-precision frameworks is provided, with an in-depth comparative analysis highlighting their respective merits and limitations. The paper concludes with insights into potential avenues for future research in this domain.
引用
收藏
页码:7793 / 7812
页数:20
相关论文
共 50 条
  • [21] Mixed-Precision Network Quantization for Infrared Small Target Segmentation
    Li, Boyang
    Wang, Longguang
    Wang, Yingqian
    Wu, Tianhao
    Lin, Zaiping
    Li, Miao
    An, Wei
    Guo, Yulan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 12
  • [22] State-of-the-art review
    Langhoff-Roos, Jens
    ACTA OBSTETRICIA ET GYNECOLOGICA SCANDINAVICA, 2016, 95 (09) : 963 - 964
  • [23] STATE-OF-THE-ART REVIEW
    Filipiak, Krzysztof J.
    KARDIOLOGIA POLSKA, 2013, 71 (05)
  • [24] DeepBurning-MixQ: An Open Source Mixed-Precision Neural Network Accelerator Design Framework for FPGAs
    Luo, Erjing
    Huang, Haitong
    Liu, Cheng
    Li, Guoyu
    Yang, Bing
    Wang, Ying
    Li, Huawei
    Li, Xiaowei
    2023 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2023,
  • [25] AutoMPQ: Automatic Mixed-Precision Neural Network Search via Few-Shot Quantization Adapter
    Xu, Ke
    Shao, Xiangyang
    Tian, Ye
    Yang, Shangshang
    Zhang, Xingyi
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, : 1 - 13
  • [26] State-of-the-Art Review on Mixed Reality Applications in the AECO Industry
    Cheng, Jack C. P.
    Chen, Keyu
    Chen, Weiwei
    JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2020, 146 (02)
  • [27] CMQ: Crossbar-Aware Neural Network Mixed-Precision Quantization via Differentiable Architecture Search
    Peng, Jie
    Liu, Haijun
    Zhao, Zhongjin
    Li, Zhiwei
    Liu, Sen
    Li, Qingjiang
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (11) : 4124 - 4133
  • [28] Deep Neural Network-Based Intrusion Detection in Internet of Things: A State-of-the-Art Review
    Li, Zhiqi
    Fang, Weidong
    Zhu, Chunsheng
    Chen, Wentao
    Gao, Zhiwei
    Jiang, Xinhang
    Zhang, Wuxiong
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14864 : 13 - 23
  • [29] A Review of State-of-the-Art on Enabling Additive Manufacturing Processes for Precision Medicine
    Awad, Atheer
    Goyanes, Alvaro
    Basit, Abdul W. W.
    Zidan, Ahmed S. S.
    Xu, Changxue
    Li, Wei
    Narayan, Roger J. J.
    Chen, Roland K. K.
    JOURNAL OF MANUFACTURING SCIENCE AND ENGINEERING-TRANSACTIONS OF THE ASME, 2023, 145 (01):
  • [30] Advances in precision micro/nano-electroforming: a state-of-the-art review
    Zhang, Honggang
    Zhang, Nan
    Gilchrist, Michael
    Fang, Fengzhou
    JOURNAL OF MICROMECHANICS AND MICROENGINEERING, 2020, 30 (10)