Pruning-and-distillation: One-stage joint compression framework for CNNs via clustering

被引:4
|
作者
Niu, Tao [1 ]
Teng, Yinglei [1 ]
Jin, Lei [1 ]
Zou, Panpan [1 ]
Liu, Yiding [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Filter pruning; Clustering; Knowledge distillation; Deep neural networks; NEURAL-NETWORKS;
D O I
10.1016/j.imavis.2023.104743
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Network pruning and knowledge distillation, as two effective network compression techniques, have drawn extensive attention due to their success in reducing model complexity. However, previous works regard them as two independent methods and combine them in an isolated manner rather than joint, leading to a sub-optimal optimization. In this paper, we propose a collaborative compression scheme named Pruningand-Distillation via Clustering (PDC), which integrates pruning and distillation into an end-to-end single-stage framework that takes both advantages of them. Specifically, instead of directly deleting or zeroing out unimportant filters within each layer, we reconstruct them based on clustering, which preserves the learned features as much as possible. The guidance from the teacher is integrated into the pruning process to further improve the generalization of pruned model, which alleviates the randomness caused by reconstruction to some extent. After convergence, we can equivalently remove reconstructed filters within each cluster through the proposed channel addition operation. Benefiting from such equivalence, we no longer require the time-consuming finetuning step to regain accuracy. Extensive experiments on CIFAR-10/100 and ImageNet datasets show that our method achieves the best trade-off between performance and complexity compared with other state-of-theart algorithms. For example, for ResNet-110, we achieve a 61.5% FLOPs reduction with even 0.14% top-1 accuracy increase on CIFAR-10 and remove 55.2% FLOPs with only 0.32% accuracy drop on CIFAR-100. & COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] A Light-weighted One-stage Framework for Speech Enhancement
    Chen, Zhuangqi
    Zhang, Pingjian
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [32] One-Stage Multi-view Clustering with Hierarchical Attributes Extraction
    Mi, Yong
    Dai, Jian
    Ren, Zhenwen
    You, Xiaojian
    Wang, Yanlong
    COGNITIVE COMPUTATION, 2023, 15 (02) : 552 - 564
  • [33] One-Stage Multi-view Clustering with Hierarchical Attributes Extraction
    Yong Mi
    Jian Dai
    Zhenwen Ren
    Xiaojian You
    Yanlong Wang
    Cognitive Computation, 2023, 15 : 552 - 564
  • [34] One-Stage Percutaneous Treatment in a Patient with Pelvic and Vertebral Compression Fractures
    Sedat, Jacques
    Chau, Yves
    Razafidratsiva, Cesar
    Bronsard, Nicolas
    de Peretti, Fernand
    CARDIOVASCULAR AND INTERVENTIONAL RADIOLOGY, 2010, 33 (01) : 219 - 222
  • [35] One-Stage Percutaneous Treatment in a Patient with Pelvic and Vertebral Compression Fractures
    Jacques Sedat
    Yves Chau
    Cesar Razafidratsiva
    Nicolas Bronsard
    Fernand de Peretti
    CardioVascular and Interventional Radiology, 2010, 33 : 219 - 222
  • [36] One-Stage Periprosthetic Joint Infection Reimbursement-Is It Worth The Effort?
    Fehring, Keith A.
    Curtin, Brian M.
    Springer, Bryan D.
    Fehring, Thomas K.
    JOURNAL OF ARTHROPLASTY, 2019, 34 (09): : 2072 - 2074
  • [37] ABOS: an attention-based one-stage framework for person search
    Chen, Yuqi
    Han, Dezhi
    Cui, Mingming
    Wu, Zhongdai
    Chang, Chin-Chen
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2022, 2022 (01)
  • [38] ABOS: an attention-based one-stage framework for person search
    Yuqi Chen
    Dezhi Han
    Mingming Cui
    Zhongdai Wu
    Chin-Chen Chang
    EURASIP Journal on Wireless Communications and Networking, 2022
  • [39] Scalable one-stage multi-view subspace clustering with dictionary learning
    Guo, Wei
    Wang, Zhe
    Chi, Ziqiu
    Xu, Xinlei
    Li, Dongdong
    Wu, Songyang
    KNOWLEDGE-BASED SYSTEMS, 2023, 259
  • [40] Open-Vocabulary One-Stage Detection with Hierarchical Visual-Language Knowledge Distillation
    Mal, Zongyang
    Luo, Guan
    Gao, Jin
    Li, Liang
    Chen, Yuxin
    Wang, Shaoru
    Zhang, Congxuan
    Hu, Weiming
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14054 - 14063