A Unified-Model via Block Coordinate Descent for Learning the Importance of Filter

被引:0
|
作者
Li, Qinghua [1 ]
Zhang, Xue [1 ]
Li, Cuiping [1 ]
Chen, Hong [1 ]
机构
[1] Renmin Univ China, Informat Sch, Beijing, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Deep Learning; Pruning Models; Accelerating Deep CNNs;
D O I
10.1145/3460426.3463627
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Convolutional Neural Networks (CNNs) are increasingly used in multimedia retrieval, and accelerating Deep CNNs has recently received an ever-increasing research focus. Among various approaches proposed in the literature, filter pruning has been regarded as a promising solution, which is due to its advantage in significant speedup and memory reduction of both network model and intermediate feature maps. Many works have been proposed to find unimportant filters, and then prune it for accelerating Deep CNNs. However, they mainly focus on using heuristic methods to evaluate the importance of filters, such as the statistical information of filters (e.g., prune filter with small l(2)-norm), which may be not perfect. In this paper, we propose a novel filter pruning method, namely A Unified-Model via Block Coordinate Descent for Learning the Importance of Filter (U-BCD). The importance of the filters in our U-BCD is learned by optimizing method. We can simultaneously learn the filter parameters and the importance of filters by block coordinate descent method. When applied to two image classification benchmarks, the effectiveness of our U-BCD is validated. Notably, on CIFAR-10, our U-BCD reduces more than 57% FLOPs on ResNet-110 with even 0.08% relative accuracy improvement, and also achieve state-of-the-art results on ILSVRC-2012.
引用
收藏
页码:192 / 200
页数:9
相关论文
共 40 条
  • [21] Parallel Randomized Block Coordinate Descent for Neural Probabilistic Language Model with High-Dimensional Output Targets
    Liu, Xin
    Yan, Junchi
    Wang, Xiangfeng
    Zha, Hongyuan
    PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 334 - 348
  • [22] Learning Model-Based Sparsity via Projected Gradient Descent
    Bahmani, Sohail
    Boufounos, Petros T.
    Raj, Bhiksha
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2016, 62 (04) : 2092 - 2099
  • [23] Shaped pattern synthesis for hybrid analog-digital arrays via manifold optimization-enabled block coordinate descent
    Li, Hongtao
    Ding, Zhoupeng
    Chen, Shengyao
    Feng, Qi
    Ran, Longyao
    Liu, Zhong
    SIGNAL PROCESSING, 2024, 228
  • [24] Economic Decision Model and Algorithm for Large Machine Tamping Maintenance of Ballasted Track Based on Block Coordinate Descent Method
    Qu J.
    Guo Z.
    Yang F.
    Xu F.
    Zhongguo Tiedao Kexue/China Railway Science, 2023, 44 (02): : 32 - 41
  • [25] Efficient learning of decision-making models: A penalty block coordinate descent algorithm for data-driven inverse optimization
    Gupta, Rishabh
    Zhang, Qi
    COMPUTERS & CHEMICAL ENGINEERING, 2023, 170
  • [26] Unified Neural Topic Model via Contrastive Learning and Term Weighting
    Han, Sungwon
    Shin, Mingi
    Park, Sungkyu
    Jung, Changwook
    Cha, Meeyoung
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1802 - 1817
  • [27] Unified model-free interaction screening via CV-entropy filter
    Xiong, Wei
    Chen, Yaxian
    Ma, Shuangge
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2023, 180
  • [28] Active-set based block coordinate descent algorithm in group LASSO for self-exciting threshold autoregressive model
    Nasir, Muhammad Jaffri Mohd
    Khan, Ramzan Nazim
    Nair, Gopalan
    Nur, Darfiana
    STATISTICAL PAPERS, 2024, 65 (05) : 2973 - 3006
  • [29] On the Importance of Feedback for Categorization: Revisiting Category Learning Experiments Using an Adaptive Filter Model
    Marchant, Nicolas
    Chaigneau, Sergio E.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-ANIMAL LEARNING AND COGNITION, 2022, 48 (04) : 295 - 306
  • [30] Knowledge graph extension with a pre-trained language model via unified learning method
    Choi, Bonggeun
    Ko, Youngjoong
    KNOWLEDGE-BASED SYSTEMS, 2023, 262