Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration

被引:792
|
作者
He, Yang [1 ]
Liu, Ping [1 ,2 ]
Wang, Ziwei [3 ]
Hu, Zhilan [4 ]
Yang, Yi [1 ,5 ]
机构
[1] Univ Technol Sydney, CAI, Sydney, NSW, Australia
[2] JD Com, Beijing, Peoples R China
[3] CETC, Informat Sci Acad, Beijing, Peoples R China
[4] Huawei, Shenzhen, Guangdong, Peoples R China
[5] Baidu Res, Beijing, Peoples R China
关键词
D O I
10.1109/CVPR.2019.00447
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous works utilized "smaller-norm-less-important" criterion to prune filters with smaller norm values in a convolutional neural network. In this paper, we analyze this norm-based criterion and point out that its effectiveness depends on two requirements that are not always met: (1) the norm deviation of the filters should be large; (2) the minimum norm of the filters should be small. To solve this problem, we propose a novel filter pruning method, namely Filter Pruning via Geometric Median (FPGM), to compress the model regardless of those two requirements. Unlike previous methods, FPGM compresses CNN models by pruning filters with redundancy, rather than those with "relatively less" importance. When applied to two image classification benchmarks, our method validates its usefulness and strengths. Notably, on CIFAR-10, FPGM reduces more than 52% FLOPs on ResNet-110 with even 2.69% relative accuracy improvement. Moreover, on ILSVRC-2012, FPGM reduces more than 42% FLOPs on ResNet-101 without top-5 accuracy drop, which has advanced the state-of-the-art. Code is publicly available on GitHub:https://github.com/he-y/filter-pruning-geometric-median
引用
收藏
页码:4335 / 4344
页数:10
相关论文
共 50 条
  • [21] Activation Pruning of Deep Convolutional Neural Networks
    Ardakani, Arash
    Condo, Carlo
    Gross, Warren J.
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1325 - 1329
  • [22] Filter Pruning using Hierarchical Group Sparse Regularization for Deep Convolutional Neural Networks
    Mitsuno, Kakeru
    Kurita, Takio
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1089 - 1095
  • [23] Batch-Normalization-based Soft Filter Pruning for Deep Convolutional Neural Networks
    Xu, Xiaozhou
    Chen, Qiming
    Xie, Lei
    Su, Hongye
    16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 951 - 956
  • [24] Gate Decorator: Global Filter Pruning Method for Accelerating Deep Convolutional Neural Networks
    You, Zhonghui
    Yan, Kun
    Ye, Jinmian
    Ma, Meng
    Wang, Ping
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [25] Layer Pruning via Fusible Residual Convolutional Block for Deep Neural Networks
    Xu P.
    Cao J.
    Sun W.
    Li P.
    Wang Y.
    Zhang X.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2022, 58 (05): : 801 - 807
  • [26] Structured Pruning for Deep Convolutional Neural Networks via Adaptive Sparsity Regularization
    Shao, Tuanjie
    Shin, Dongkun
    2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 982 - 987
  • [27] Empirical evaluation of filter pruning methods for acceleration of convolutional neural network
    Dheeraj Kumar
    Mayuri A. Mehta
    Vivek C. Joshi
    Rachana S. Oza
    Ketan Kotecha
    Jerry Chun-Wei Lin
    Multimedia Tools and Applications, 2024, 83 : 54699 - 54727
  • [28] Empirical evaluation of filter pruning methods for acceleration of convolutional neural network
    Kumar, Dheeraj
    Mehta, Mayuri A.
    Joshi, Vivek C.
    Oza, Rachana S.
    Kotecha, Ketan
    Lin, Jerry Chun-Wei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 54699 - 54727
  • [29] Structured Pruning for Deep Convolutional Neural Networks: A Survey
    He, Yang
    Xiao, Lingao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (05) : 2900 - 2919
  • [30] A Filter Rank Based Pruning Method for Convolutional Neural Networks
    Liu, Hao
    Guan, Zhenyu
    Lei, Peng
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 1318 - 1322