Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration

被引:792
|
作者
He, Yang [1 ]
Liu, Ping [1 ,2 ]
Wang, Ziwei [3 ]
Hu, Zhilan [4 ]
Yang, Yi [1 ,5 ]
机构
[1] Univ Technol Sydney, CAI, Sydney, NSW, Australia
[2] JD Com, Beijing, Peoples R China
[3] CETC, Informat Sci Acad, Beijing, Peoples R China
[4] Huawei, Shenzhen, Guangdong, Peoples R China
[5] Baidu Res, Beijing, Peoples R China
关键词
D O I
10.1109/CVPR.2019.00447
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous works utilized "smaller-norm-less-important" criterion to prune filters with smaller norm values in a convolutional neural network. In this paper, we analyze this norm-based criterion and point out that its effectiveness depends on two requirements that are not always met: (1) the norm deviation of the filters should be large; (2) the minimum norm of the filters should be small. To solve this problem, we propose a novel filter pruning method, namely Filter Pruning via Geometric Median (FPGM), to compress the model regardless of those two requirements. Unlike previous methods, FPGM compresses CNN models by pruning filters with redundancy, rather than those with "relatively less" importance. When applied to two image classification benchmarks, our method validates its usefulness and strengths. Notably, on CIFAR-10, FPGM reduces more than 52% FLOPs on ResNet-110 with even 2.69% relative accuracy improvement. Moreover, on ILSVRC-2012, FPGM reduces more than 42% FLOPs on ResNet-101 without top-5 accuracy drop, which has advanced the state-of-the-art. Code is publicly available on GitHub:https://github.com/he-y/filter-pruning-geometric-median
引用
收藏
页码:4335 / 4344
页数:10
相关论文
共 50 条
  • [41] Compression of Deep Convolutional Neural Networks Using Effective Channel Pruning
    Guo, Qingbei
    Wu, Xiao-Jun
    Zhao, Xiuyang
    IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 760 - 772
  • [42] Studying the plasticity in deep convolutional neural networks using random pruning
    Mittal, Deepak
    Bhardwaj, Shweta
    Khapra, Mitesh M.
    Ravindran, Balaraman
    MACHINE VISION AND APPLICATIONS, 2019, 30 (02) : 203 - 216
  • [43] Studying the plasticity in deep convolutional neural networks using random pruning
    Deepak Mittal
    Shweta Bhardwaj
    Mitesh M. Khapra
    Balaraman Ravindran
    Machine Vision and Applications, 2019, 30 : 203 - 216
  • [44] Filter Level Pruning Based on Similar Feature Extraction for Convolutional Neural Networks
    Li, Lianqiang
    Xu, Yuhui
    Zhu, Jie
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (04) : 1203 - 1206
  • [45] FILTER PRUNING BASED ON LOCAL GRADIENT ACTIVATION MAPPING IN CONVOLUTIONAL NEURAL NETWORKS
    Intraraprasit, Monthon
    Chitsobhuk, Orachat
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2023, 19 (06): : 1697 - 1715
  • [46] Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks
    Christian Heidorn
    Muhammad Sabih
    Nicolai Meyerhöfer
    Christian Schinabeck
    Jürgen Teich
    Frank Hannig
    International Journal of Parallel Programming, 2024, 52 : 40 - 58
  • [47] A Dual Rank-Constrained Filter Pruning Approach for Convolutional Neural Networks
    Fan, Fugui
    Su, Yuting
    Jing, Peiguang
    Lu, Wei
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 (28) : 1734 - 1738
  • [48] Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks
    Heidorn, Christian
    Sabih, Muhammad
    Meyerhoefer, Nicolai
    Schinabeck, Christian
    Teich, Juergen
    Hannig, Frank
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2024, 52 (1-2) : 40 - 58
  • [49] Structured Pruning for Efficient Convolutional Neural Networks via Incremental Regularization
    Wang, Huan
    Hu, Xinyi
    Zhang, Qiming
    Wang, Yuehai
    Yu, Lu
    Hu, Haoji
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (04) : 775 - 788
  • [50] HILP: hardware-in-loop pruning of convolutional neural networks towards inference acceleration
    Dong Li
    Qianqian Ye
    Xiaoyue Guo
    Yunda Sun
    Li Zhang
    Neural Computing and Applications, 2024, 36 : 8825 - 8842