Mixed-precision Deep Neural Network Quantization With Multiple Compression Rates

被引:0
|
作者
Wang, Xuanda [1 ]
Fei, Wen [1 ]
Dai, Wenrui [1 ]
Li, Chenglin [1 ]
Zou, Junni [1 ]
Xiong, Hongkai [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/DCC55655.2023.00075
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
引用
收藏
页码:371 / 371
页数:1
相关论文
共 50 条
  • [31] A Review of State-of-the-art Mixed-Precision Neural Network Frameworks
    Rakka, Mariam
    Fouda, Mohammed E.
    Khargonekar, Pramod
    Kurdahi, Fadi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 7793 - 7812
  • [32] Dynamic Split Computing-Aware Mixed-Precision Quantization for Efficient Deep Edge Intelligence
    Nagamatsu, Naoki
    Hara-Azumi, Yuko
    2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 2538 - 2545
  • [33] ADAPTIVE LAYERWISE QUANTIZATION FOR DEEP NEURAL NETWORK COMPRESSION
    Zhu, Xiaotian
    Zhou, Wengang
    Li, Houqiang
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [34] Quantization Aware Factorization for Deep Neural Network Compression
    Cherniuk, Daria
    Abukhovich, Stanislav
    Phan, Anh-Huy
    Oseledets, Ivan
    Cichocki, Andrzej
    Gusak, Julia
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 81 : 973 - 988
  • [35] Quantization Aware Factorization for Deep Neural Network Compression
    Cherniuk, Daria
    Abukhovich, Stanislav
    Phan, Anh-Huy
    Oseledets, Ivan
    Cichocki, Andrzej
    Gusak, Julia
    Journal of Artificial Intelligence Research, 2024, 81 : 973 - 988
  • [36] Evaluating the Impact of Mixed-Precision on Fault Propagation for Deep Neural Networks on GPUs
    Dos Santos, Fernando Fernandes
    Rech, Paolo
    Kritikakou, Angeliki
    Sentieys, Olivier
    2022 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2022), 2022, : 327 - 327
  • [37] Generalizable Mixed-Precision Quantization via Attribution Rank Preservation
    Wang, Ziwei
    Xiao, Han
    Lu, Jiwen
    Zhou, Jie
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5271 - 5280
  • [38] MetaMix: Meta-State Precision Searcher for Mixed-Precision Activation Quantization
    Kim, Han-Byul
    Lee, Joo Hyung
    Yoo, Sungjoo
    Kim, Hong-Seok
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12, 2024, : 13132 - 13141
  • [39] Mixed-precision architecture based on computational memory for training deep neural networks
    Nandakumar, S. R.
    Le Gallo, Manuel
    Boybat, Irem
    Rajendran, Bipin
    Sebastian, Abu
    Eleftheriou, Evangelos
    2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,
  • [40] Learning Generalizable Mixed-Precision Quantization via Attribution Imitation
    Wang, Ziwei
    Xiao, Han
    Zhou, Jie
    Lu, Jiwen
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (11) : 5101 - 5123