Non-Zero Grid for Accurate 2-Bit Additive Power-of-Two CNN Quantization

被引:1
|
作者
Kim, Young Min [1 ]
Han, Kyunghyun [2 ]
Lee, Wai-Kong [3 ]
Chang, Hyung Jin [4 ]
Hwang, Seong Oun [3 ]
机构
[1] Gachon Univ, Dept IT Convergence Engn, Seongnam 13120, South Korea
[2] Hongik Univ, Dept Elect & Comp Engn, Sejong 30016, South Korea
[3] Gachon Univ, Dept Comp Engn, Seongnam 13120, South Korea
[4] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, England
基金
新加坡国家研究基金会;
关键词
Quantization (signal); Deep learning; Convolutional neural networks; Gaussian distribution; Mathematical models; Internet of Things; Computational modeling; Quantization; deep learning; convolutional neural network;
D O I
10.1109/ACCESS.2023.3259959
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Quantization is an effective technique to reduce the memory and computational complexity of CNNs. Recent advances utilize additive powers-of-two to perform non-uniform quantization, which resembles a normal distribution and shows better performance than uniform quantization. With powers-of-two quantization, the computational complexity is also largely reduced because the slow multiplication operations are replaced with lightweight shift operations. However, there are serious problems in the previously proposed grid formulation for 2-bit quantization. In particular, these powers-of-two schemes produce zero values, generating significant training error and causing low accuracy. In addition, due to improper grid formulation, they also fallback to uniform quantization when the quantization level reaches 2-bit. Due to these reasons, on large CNN like ResNet-110, these powers-of-two schemes may not even train properly. To resolve these issues, we propose a new non-zero grid formulation that enables 2-bit non-uniform quantization and allow the CNN to be trained successfully in every attempt, even for a large network. The proposed technique quantizes weight as power-of-two values and projects it close to the mean area through a simple constant product on the exponential part. This allows our quantization scheme to closely resemble a non-uniform quantization at 2-bit, enabling successful training at 2-bit quantization, which is not found in the previous work. The proposed technique achieves 70.57% accuracy on the CIFAR-100 dataset trained with ResNet-110. This result is 6.24% higher than the additive powers-of-two scheme which only achieves 64.33% accuracy. Beside achieving higher accuracy, our work also maintains the same memory and computational efficiency with the original additive powers-of-two scheme.
引用
收藏
页码:32051 / 32060
页数:10
相关论文
共 11 条
  • [1] DenseShift : Towards Accurate and Efficient Low-Bit Power-of-Two Quantization
    Li, Xinlin
    Liu, Bang
    Yang, Rui Heng
    Courville, Vanessa
    Xing, Chao
    Nia, Vahid Partovi
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16964 - 16974
  • [2] Generalized quantization scheme for two-person non-zero sum games
    Nawaz, A
    Toor, AH
    JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 2004, 37 (47): : 11457 - 11463
  • [3] A Hardware-Friendly Low-Bit Power-of-Two Quantization Method for CNNs and Its FPGA Implementation
    Sui, Xuefu
    Lv, Qunbo
    Bai, Yang
    Zhu, Baoyu
    Zhi, Liangjie
    Yang, Yuanbo
    Tan, Zheng
    SENSORS, 2022, 22 (17)
  • [4] Covariance Matrix Recovery From One-Bit Data With Non-Zero Quantization Thresholds: Algorithm and Performance Analysis
    Xiao, Yu-Hang
    Huang, Lei
    Ramirez, David
    Qian, Cheng
    So, Hing Cheung
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2023, 71 : 4060 - 4076
  • [5] REDUCTION AND 2ND QUANTIZATION OF GENERALIZED ELECTROMAGNETIC-FIELDS FOR NON-ZERO MASS SYSTEM
    JOSHI, DC
    RAJPUT, BS
    INDIAN JOURNAL OF PURE & APPLIED PHYSICS, 1980, 18 (12) : 988 - 992
  • [6] P2 -ViT: Power-of-Two Post-Training Quantization and Acceleration for Fully Quantized Vision Transformer
    Shi, Huihong
    Cheng, Xin
    Mao, Wendong
    Wang, Zhongfeng
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2024, 32 (09) : 1704 - 1717
  • [7] Lattice simulation of two-color QCD with Nf=2 at non-zero baryon density
    Braguta, V. V.
    Kotov, A. Yu
    Nikolaev, A. A.
    Valgushev, S. N.
    15TH INTERNATIONAL CONFERENCE ON STRANGENESS IN QUARK MATTER (SQM2015), 2016, 668
  • [8] SECOND QUANTIZATION AND INTERACTION OF ELECTROMAGNETIC-FIELDS FOR NON-ZERO MASS SYSTEM IN ANGULAR-MOMENTUM BASIS .2.
    PARKASH, O
    SINGH, B
    RAJPUT, BS
    INDIAN JOURNAL OF PHYSICS AND PROCEEDINGS OF THE INDIAN ASSOCIATION FOR THE CULTIVATION OF SCIENCE, 1974, 48 (06): : 509 - 519
  • [9] 2-Bit Phase Quantization Using Mixed Polarization-Rotation/Non-Polarization- Rotation Reflection Modes for Beam-Steerable Reflectarrays
    Luyen, Hung
    Booske, John H.
    Behdad, Nader
    IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2020, 68 (12) : 7937 - 7946
  • [10] MATRIX-METHOD FOR FINDING SETS OF CONTIGUOUS NON-ZERO ELEMENTS IN A TWO-DIMENSIONAL ARRAY .2.
    CAMPBELL, D
    HIGGINS, J
    PATTERN RECOGNITION, 1988, 21 (05) : 451 - 453