PSQ: An Automatic Search Framework for Data-Free Quantization on PIM-based Architecture

被引:1
|
作者
Liu, Fangxin [1 ,2 ]
Yang, Ning [1 ,2 ]
Jiang, Li [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Shanghai Qi Zhi Inst, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCD58817.2023.00084
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Crossbar-based Process-In-Memory (PIM) architecture has been considered as a promising solution for Deep Neural Networks (DNNs) acceleration. Due to the ever increasing model size and computational budget of DNNs, model compression is a critical step for the deployment of DNNs. However, when deploying DNNs in PIM architectures, fine-grained quantization on DNN weight matrices is not easy due to the inflexible data path inside the crossbar. To this end, in this paper, we study the feasibility and efficiency of a novel fine-grained quantization scheme called PSQ for PIM-based design. The scheme tightly combines the search principle of quantization and the PIM architecture to provide smooth hardware-friendly quantization. We leverage the weight locality and the variety of weight distributions in different blocks to facilitate the fine-grained quantization process. Meanwhile, we propose a lightweight search framework to adaptively allocate the quantization parameters (e.g., scale, bitwidth, etc.). During the search process, suitable quantization parameters are assigned directly to each fine-grained block, keeping the weight distributions before and after quantization as close as possible, thus minimizing the quantization errors. Our evaluation shows that the proposed PSQ achieves 3.5x reduction in occupied crossbars while the accuracy loss is negligible. What's more, PSQ can perform such a process in just a few seconds on a single CPU, without model retraining and expensive computation.
引用
收藏
页码:507 / 514
页数:8
相关论文
共 50 条
  • [31] SPIQ: Data-Free Per-Channel Static Input Quantization
    Yvinec, Edouard
    Dapogny, Arnaud
    Cord, Matthieu
    Bailly, Kevin
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3858 - 3867
  • [32] Data-Free Quantization with Accurate Activation Clipping and Adaptive Batch Normalization
    He, Yefei
    Zhang, Luoming
    Wu, Weijia
    Zhou, Hong
    NEURAL PROCESSING LETTERS, 2023, 55 (08) : 10555 - 10568
  • [33] Diverse Sample Generation: Pushing the Limit of Generative Data-Free Quantization
    Qin, Haotong
    Ding, Yifu
    Zhang, Xiangguo
    Wang, Jiakai
    Liu, Xianglong
    Lu, Jiwen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 11689 - 11706
  • [34] Causal-DFQ: Causality Guided Data-free Network Quantization
    Shang, Yuzhang
    Xu, Bingxin
    Liu, Gaowen
    Kompella, Ramana Rao
    Yan, Yan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17391 - 17400
  • [35] ACQ: Improving generative data-free quantization via attention correction
    Li, Jixing
    Guo, Xiaozhou
    Dai, Benzhe
    Gong, Guoliang
    Jin, Min
    Chen, Gang
    Mao, Wenyu
    Lu, Huaxiang
    PATTERN RECOGNITION, 2024, 152
  • [36] Data-Free Quantization with Accurate Activation Clipping and Adaptive Batch Normalization
    Yefei He
    Luoming Zhang
    Weijia Wu
    Hong Zhou
    Neural Processing Letters, 2023, 55 : 10555 - 10568
  • [37] Data-Free Low-Bit Quantization for Remote Sensing Object Detection
    Zhang, Ruiyan
    Jiang, Xiujie
    An, Junshe
    Cui, Tianshu
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [38] Unified Data-Free Compression: Pruning and Quantization without Fine-Tuning
    Bai, Shipeng
    Chen, Jun
    Shen, Xintian
    Qian, Yixuan
    Liu, Yong
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5853 - 5862
  • [39] Data-Free Backdoor Removal Based on Channel Lipschitzness
    Zheng, Runkai
    Tang, Rongjun
    Li, Jianze
    Liu, Li
    COMPUTER VISION - ECCV 2022, PT V, 2022, 13665 : 175 - 191
  • [40] Data-Free Sketch-Based Image Retrieval
    Chaudhuri, Abhra
    Bhunia, Ayan Kumar
    Song, Yi-Zhe
    Dutta, Anjan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12084 - 12093