A Review of State-of-the-art Mixed-Precision Neural Network Frameworks

被引:0
|
作者
Rakka, Mariam [1 ]
Fouda, Mohammed E. [2 ]
Khargonekar, Pramod [1 ]
Kurdahi, Fadi [1 ]
机构
[1] Univ Calif Irvine, Ctr Embedded & Cyber Phys Syst, Irvine, CA 92697 USA
[2] Rain Neuromorph Inc, San Francisco, CA 94110 USA
关键词
Deep neural networks; mixed-precision neural networks; edge inference; quantization; computational complexity; ALGORITHMS;
D O I
10.1109/TPAMI.2024.3394390
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mixed-precision Deep Neural Networks (DNNs) provide an efficient solution for hardware deployment, especially under resource constraints, while maintaining model accuracy. Identifying the ideal bit precision for each layer, however, remains a challenge given the vast array of models, datasets, and quantization schemes, leading to an expansive search space. Recent literature has addressed this challenge, resulting in several promising frameworks. This paper offers a comprehensive overview of the standard quantization classifications prevalent in existing studies. A detailed survey of current mixed-precision frameworks is provided, with an in-depth comparative analysis highlighting their respective merits and limitations. The paper concludes with insights into potential avenues for future research in this domain.
引用
收藏
页码:7793 / 7812
页数:20
相关论文
共 50 条
  • [31] Precision Medicine in the Management of Dilated Cardiomyopathy JACC State-of-the-Art Review
    Fatkin, Diane
    Huttner, Inken G.
    Kovacic, Jason C.
    Seidman, J. G.
    Seidman, Christine E.
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2019, 74 (23) : 2921 - 2938
  • [32] Mixed-precision quantization-aware training for photonic neural networks
    Kirtas, Manos
    Passalis, Nikolaos
    Oikonomou, Athina
    Moralis-Pegios, Miltos
    Giamougiannis, George
    Tsakyridis, Apostolos
    Mourgias-Alexandris, George
    Pleros, Nikolaos
    Tefas, Anastasios
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (29): : 21361 - 21379
  • [33] Mixed-precision quantization-aware training for photonic neural networks
    Manos Kirtas
    Nikolaos Passalis
    Athina Oikonomou
    Miltos Moralis-Pegios
    George Giamougiannis
    Apostolos Tsakyridis
    George Mourgias-Alexandris
    Nikolaos Pleros
    Anastasios Tefas
    Neural Computing and Applications, 2023, 35 : 21361 - 21379
  • [34] A state-of-the-art review of image motion deblurring techniques in precision agriculture
    Yu, Huihui
    Li, Daoliang
    Chen, Yingyi
    HELIYON, 2023, 9 (06)
  • [35] Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization
    Chen, Weihan
    Wang, Peisong
    Cheng, Jian
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5330 - 5339
  • [36] Mixed-precision quantization for neural networks based on error limit (Invited)
    Li Y.
    Guo Z.
    Liu K.
    Sun X.
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2022, 51 (04):
  • [37] Mixed-precision Quantization with Dynamical Hessian Matrix for Object Detection Network
    Yang, Zerui
    Fei, Wen
    Dai, Wenrui
    Li, Chenglin
    Zou, Junni
    Xiong, Hongkai
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [38] Sustainable development of process facilities: State-of-the-art review of pollution prevention frameworks
    Hossain, Khandoker A.
    Khan, Faisal I.
    Hawboldt, Kelly
    JOURNAL OF HAZARDOUS MATERIALS, 2008, 150 (01) : 4 - 20
  • [39] Molecular design of covalent organic frameworks for seawater desalination: A state-of-the-art review
    Jrad, Asmaa
    Olson, Mark A.
    Trabolsi, Ali
    CHEM, 2023, 9 (06): : 1413 - 1451
  • [40] Empowering the Vehicular Network with RIS Technology: A State-of-the-Art Review
    Naaz, Farheen
    Nauman, Ali
    Khurshaid, Tahir
    Kim, Sung-Won
    SENSORS, 2024, 24 (02)