PSQ: An Automatic Search Framework for Data-Free Quantization on PIM-based Architecture

被引:1
|
作者
Liu, Fangxin [1 ,2 ]
Yang, Ning [1 ,2 ]
Jiang, Li [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Shanghai Qi Zhi Inst, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCD58817.2023.00084
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Crossbar-based Process-In-Memory (PIM) architecture has been considered as a promising solution for Deep Neural Networks (DNNs) acceleration. Due to the ever increasing model size and computational budget of DNNs, model compression is a critical step for the deployment of DNNs. However, when deploying DNNs in PIM architectures, fine-grained quantization on DNN weight matrices is not easy due to the inflexible data path inside the crossbar. To this end, in this paper, we study the feasibility and efficiency of a novel fine-grained quantization scheme called PSQ for PIM-based design. The scheme tightly combines the search principle of quantization and the PIM architecture to provide smooth hardware-friendly quantization. We leverage the weight locality and the variety of weight distributions in different blocks to facilitate the fine-grained quantization process. Meanwhile, we propose a lightweight search framework to adaptively allocate the quantization parameters (e.g., scale, bitwidth, etc.). During the search process, suitable quantization parameters are assigned directly to each fine-grained block, keeping the weight distributions before and after quantization as close as possible, thus minimizing the quantization errors. Our evaluation shows that the proposed PSQ achieves 3.5x reduction in occupied crossbars while the accuracy loss is negligible. What's more, PSQ can perform such a process in just a few seconds on a single CPU, without model retraining and expensive computation.
引用
收藏
页码:507 / 514
页数:8
相关论文
共 50 条
  • [1] LerGAN: A Zero-free, Low Data Movement and PIM-based GAN Architecture
    Mao, Haiyu
    Song, Mingcong
    Li, Tao
    Dai, Yuting
    Shu, Jiwu
    2018 51ST ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2018, : 669 - 681
  • [2] Introspection in a massively parallel PIM-based architecture
    Zima, HP
    PARALLEL COMPUTING: SOFTWARE TECHNOLOGY, ALGORITHMS, ARCHITECTURES AND APPLICATIONS, 2004, 13 : 441 - 448
  • [3] Adaptive Data-Free Quantization
    Qian, Biao
    Wang, Yang
    Hong, Richang
    Wang, Meng
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7960 - 7968
  • [4] Dual-discriminator adversarial framework for data-free quantization
    Li, Zhikai
    Ma, Liping
    Long, Xianlei
    Xiao, Junrui
    Gu, Qingyi
    NEUROCOMPUTING, 2022, 511 : 67 - 77
  • [5] AutoReCon: Neural Architecture Search-based Reconstruction for Data-free Compression
    Zhu, Baozhou
    Hofstee, Peter
    Peltenburg, Johan
    Lee, Jinho
    Alars, Zaid
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3470 - 3476
  • [6] Data-Free Neural Architecture Search via Recursive Label Calibration
    Liu, Zechun
    Shen, Zhiqiang
    Long, Yun
    Xing, Eric
    Cheng, Kwang-Ting
    Leichner, Chas
    COMPUTER VISION, ECCV 2022, PT XXIV, 2022, 13684 : 391 - 406
  • [7] LrGAN: A Compact and Energy Efficient PIM-Based Architecture for GAN Training
    Mao, Haiyu
    Shu, Jiwu
    Song, Mingcong
    Li, Tao
    IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (09) : 1427 - 1442
  • [8] Data-Free Network Quantization With Adversarial Knowledge Distillation
    Choi, Yoojin
    Choi, Jihwan
    El-Khamy, Mostafa
    Lee, Jungwon
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 3047 - 3057
  • [9] META-BNS FOR ADVERSARIAL DATA-FREE QUANTIZATION
    Fu, Siming
    Wang, Hualiang
    Cao, Yuchen
    Hu, Haoji
    Peng, Bo
    Tan, Wenming
    Ye, Tingqun
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 4038 - 4042
  • [10] Diversifying Sample Generation for Accurate Data-Free Quantization
    Zhang, Xiangguo
    Qin, Haotong
    Ding, Yifu
    Gong, Ruihao
    Yan, Qinghua
    Tao, Renshuai
    Li, Yuhang
    Yu, Fengwei
    Liu, Xianglong
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15653 - 15662