Layer-Wise Data-Free CNN Compression

被引:1
|
作者
Horton, Maxwell [1 ]
Jin, Yanzi [1 ]
Farhadi, Ali [1 ]
Rastegari, Mohammad [1 ]
机构
[1] Apple, Cupertino, CA 95014 USA
关键词
D O I
10.1109/ICPR56361.2022.9956237
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a computationally efficient method for compressing a trained neural network without using real data. We break the problem of data-free network compression into independent layer-wise compressions. We show how to efficiently generate layer-wise training data using only a pretrained network. We use this data to perform independent layer-wise compressions on the pretrained network. We also show how to precondition the network to improve the accuracy of our layer-wise compression method. We present results for layer-wise compression using quantization and pruning. When quantizing, we compress with higher accuracy than related works while using orders of magnitude less compute. When compressing MobileNetV2 and evaluating on ImageNet, our method outperforms existing methods for quantization at all bit-widths, achieving a +0.34% improvement in 8-bit quantization, and a stronger improvement at lower bit-widths (up to a +28.50% improvement at 5 bits). When pruning, we outperform baselines of a similar compute envelope, achieving 1.5 times the sparsity rate at the same accuracy. We also show how to combine our efficient method with high-compute generative methods to improve upon their results.
引用
收藏
页码:2019 / 2026
页数:8
相关论文
共 50 条
  • [31] Layer-Wise Personalized Federated Learning with Hypernetwork
    Zhu, Suxia
    Liu, Tianyu
    Sun, Guanglu
    NEURAL PROCESSING LETTERS, 2023, 55 (09) : 12273 - 12287
  • [32] Quantification and Analysis of Layer-wise and Pixel-wise Information Discarding
    Ma, Haotian
    Zhang, Hao
    Zhou, Fan
    Zhang, Yinqing
    Zhang, Quanshi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [33] A LAYER-WISE ANALYSIS FOR FREE-VIBRATION OF THICK COMPOSITE CYLINDRICAL-SHELLS
    HUANG, KH
    DASGUPTA, A
    JOURNAL OF SOUND AND VIBRATION, 1995, 186 (02) : 207 - 222
  • [34] Layer-wise dynamic stiffness solution for free vibration analysis of laminated composite plates
    Boscolo, M.
    Banerjee, J. R.
    JOURNAL OF SOUND AND VIBRATION, 2014, 333 (01) : 200 - 227
  • [35] Analytical solution for free vibration analysis of composite plates with layer-wise displacement assumptions
    Boscolo, M.
    COMPOSITE STRUCTURES, 2013, 100 : 493 - 510
  • [36] LSMQ: A Layer-Wise Sensitivity-Based MixedPrecision Quantization Method for Bit-Flexible CNN Accelerator
    Huang, Yimin
    Chen, Kai
    Shao, Zhuang
    Bai, Yichuan
    Huang, Yafeng
    Du, Yuan
    Du, Li
    Wang, Zhongfeng
    18TH INTERNATIONAL SOC DESIGN CONFERENCE 2021 (ISOCC 2021), 2021, : 256 - 257
  • [37] Efficient Layer-Wise N:M Sparse CNN Accelerator with Flexible SPEC: Sparse Processing Element Clusters
    Xie, Xiaoru
    Zhu, Mingyu
    Lu, Siyuan
    Wang, Zhongfeng
    MICROMACHINES, 2023, 14 (03)
  • [38] Path-Weights and Layer-Wise Relevance Propagation for Explainability of ANNs with fMRI Data
    Marques dos Santos, Jose Diogo
    Marques dos Santos, Jose Paulo
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT II, 2024, 14506 : 433 - 448
  • [39] Explaining Deep Learning Models for Tabular Data Using Layer-Wise Relevance Propagation
    Ullah, Ihsan
    Rios, Andre
    Gala, Vaibhav
    Mckeever, Susan
    APPLIED SCIENCES-BASEL, 2022, 12 (01):
  • [40] Selective Information Control and Layer-Wise Partial Collective Compression for Multi-Layered Neural Networks
    Kamimura, Ryotaro
    INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, ISDA 2021, 2022, 418 : 121 - 131