A Study of Filter Duplication for CNNs Filter Pruning

被引:0
|
作者
Ikuta, Ryosuke [1 ]
Yata, Noriko [1 ]
Manabe, Yoshitsugu [1 ]
机构
[1] Chiba Univ, 1-33 Yayoicho,Inage Ku, Chiba, Chiba 2638522, Japan
关键词
CNN; pruning; redundancy; filter duplication;
D O I
10.1117/12.3018876
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional Neural Networks (CNNs) have demonstrated great success in image recognition, but most trained models are over-parameterized, and models can be compressed with only a slight performance degradation. Pruning is one of the lightweight techniques of networks, which obtains a model with a lower computational cost of inference by removing filters selectively that do not contribute to the performance. While various methods have been proposed to identify unimportant filters, determining the number of filters to be removed at each layer without causing a significant loss of accuracy is an open problem. This paper proposes a "filter duplication" approach to reduce the accuracy degradation caused by pruning, especially in higher compression ratio ranges. Filter duplication replaces unimportant filters with critical filters in a pre-trained model based on the measured importance of each convolutional layer before pruning. In experiments using mainstream CNN models and datasets, we confirmed that filter duplication improves the accuracy of the pruned model, especially with higher compression ratios. In addition, the proposed method can reflect the structural redundancy of the network to the compression ratio of each layer, providing a more efficient compression. The results show that duplicating an appropriate number of critical filters for each layer improves the robustness of the network against pruning, and optimization of duplication methods is desirable.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Filter Pruning by Switching to Neighboring CNNs With Good Attributes
    He, Yang
    Liu, Ping
    Zhu, Linchao
    Yang, Yi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 8044 - 8056
  • [2] FPWT: Filter pruning via wavelet transform for CNNs
    Liu, Yajun
    Fan, Kefeng
    Zhou, Wenju
    NEURAL NETWORKS, 2024, 179
  • [3] Stability Based Filter Pruning for Accelerating Deep CNNs
    Singh, Pravendra
    Kadi, Vinay Sameer Raja
    Verma, Nikhil
    Namboodiri, Vinay P.
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1166 - 1174
  • [4] COMPRESSING AUDIO CNNS WITH GRAPH CENTRALITY BASED FILTER PRUNING
    King, James A.
    Singh, Arshdeep
    Plumbley, Mark D.
    2023 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, WASPAA, 2023,
  • [5] Pruning Filter in Filter
    Meng, Fanxu
    Cheng, Hao
    Li, Ke
    Luo, Huixiang
    Guo, Xiaowei
    Lu, Guangming
    Sun, Xing
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [6] Interspace Pruning: Using Adaptive Filter Representations to Improve Training of Sparse CNNs
    Wimmer, Paul
    Mehnert, Jens
    Condurache, Alexandru
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12517 - 12527
  • [7] Compressing CNNs Using Multilevel Filter Pruning for the Edge Nodes of Multimedia Internet of Things
    Liu, Xingang
    Wu, Lishuai
    Dai, Cheng
    Chao, Han-Chieh
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (14) : 11041 - 11051
  • [8] FALF ConvNets: Fatuous auxiliary loss based filter-pruning for efficient deep CNNs
    Singh, Pravendra
    Kadi, Vinay Sameer Raja
    Namboodiri, Vinay P.
    IMAGE AND VISION COMPUTING, 2020, 93
  • [9] Filter Sketch for Network Pruning
    Lin, Mingbao
    Cao, Liujuan
    Li, Shaojie
    Ye, Qixiang
    Tian, Yonghong
    Liu, Jianzhuang
    Tian, Qi
    Ji, Rongrong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (12) : 7091 - 7100
  • [10] Partial filter duplication:: A solution for noise tolerant FIR filter
    San Julián, AM
    Palacio, FC
    2004 INTERNATIONAL CONFERENCE ON COMMUNICATION, CIRCUITS, AND SYSTEMS, VOLS 1 AND 2: VOL 1: COMMUNICATION THEORY AND SYSTEMS - VOL 2: SIGNAL PROCESSING, CIRCUITS AND SYSTEMS, 2004, : 1451 - 1455