Automatic Group-Based Structured Pruning for Deep Convolutional Networks

被引:4
|
作者
Wei, Hang [1 ]
Wang, Zulin [1 ]
Hua, Gengxin [2 ]
Sun, Jinjing [1 ]
Zhao, Yunfu [2 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing 100000, Peoples R China
[2] Beijing Inst Control Engn, Beijing 100000, Peoples R China
基金
中国国家自然科学基金;
关键词
Pruning; convolutional neural networks; group convolution; reinforcement learning;
D O I
10.1109/ACCESS.2022.3227619
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Structured pruning methods have been used in several convolutional neural networks (CNNs). However, group-based structured pruning is a challenging task. In previous methods, the number of groups is manually determined for all layers, which is suboptimal. Moreover, which kernels should be appropriately removed? Model accuracy may be significantly reduced when a large number of kernels are deleted. To address these challenges, we propose an automatic group-based structured pruning method with reinforcement learning, named AGSPRL, which can generate pruned models with different compression rates automatically. We first develop a reinforcement learning (RL) framework to learn the pruning rate for group-based channel pruning layer by layer. Then, based on the learned kernel pruning rate, we propose an efficient group configuration algorithm to adaptively determine the number of groups for each convolution layer. Finally, we introduce a channel pruning method with an attention mechanism as a tiny auxiliary filter selector for each group to dynamically determine which part of the kernels should be selected into the group convolution and which part of the kernels should be removed. To demonstrate the efficiency of our method, we apply it to a variety of CNNs in classification and detection datasets. The experimental results show that the AGSPRL not only adaptively but also accurately configures the number of groups. The accuracy is reduced by less than 1%. Moreover, compared to other state-of-the-art methods, AGSPRL is more effective and has less accuracy loss.
引用
收藏
页码:128824 / 128834
页数:11
相关论文
共 50 条
  • [1] Structured Pruning of Deep Convolutional Neural Networks
    Anwar, Sajid
    Hwang, Kyuyeon
    Sung, Wonyong
    ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2017, 13 (03)
  • [2] Structured Pruning for Deep Convolutional Neural Networks: A Survey
    He, Yang
    Xiao, Lingao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (05) : 2900 - 2919
  • [3] Automatic Pruning Rate Derivation for Structured Pruning of Deep Neural Networks
    Sakai, Yasufumi
    Iwakawa, Akinori
    Tabaru, Tsuguchika
    Inoue, Atsuki
    Kawaguchi, Hiroshi
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2561 - 2567
  • [4] Structured Pruning for Deep Convolutional Neural Networks via Adaptive Sparsity Regularization
    Shao, Tuanjie
    Shin, Dongkun
    2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 982 - 987
  • [5] Leveraging Structured Pruning of Convolutional Neural Networks
    Tessier, Hugo
    Gripon, Vincent
    Leonardon, Mathieu
    Arzel, Matthieu
    Bertrand, David
    Hannagan, Thomas
    2022 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2022, : 174 - 179
  • [6] GAP: A group-based automatic pruning algorithm via convolution kernel fusion
    Chen, Dingfu
    Lin, Kangwei
    Deng, Qingxu
    NEUROCOMPUTING, 2024, 609
  • [7] STRUCTURED PRUNING FOR GROUP REGULARIZED CONVOLUTIONAL NEURAL NETWORKS VIA DYNAMIC REGULARIZATION FACTOR
    Li, Feng
    Li, Bo
    Zhu, Meijiao
    Ma, Junchi
    Yuan, Jinlong
    JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2025, 21 (02) : 1440 - 1455
  • [8] Heuristic-based automatic pruning of deep neural networks
    Choudhary, Tejalal
    Mishra, Vipul
    Goswami, Anurag
    Sarangapani, Jagannathan
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (06): : 4889 - 4903
  • [9] Heuristic-based automatic pruning of deep neural networks
    Tejalal Choudhary
    Vipul Mishra
    Anurag Goswami
    Jagannathan Sarangapani
    Neural Computing and Applications, 2022, 34 : 4889 - 4903
  • [10] Activation Pruning of Deep Convolutional Neural Networks
    Ardakani, Arash
    Condo, Carlo
    Gross, Warren J.
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1325 - 1329