A Unified-Model via Block Coordinate Descent for Learning the Importance of Filter

被引:0
|
作者
Li, Qinghua [1 ]
Zhang, Xue [1 ]
Li, Cuiping [1 ]
Chen, Hong [1 ]
机构
[1] Renmin Univ China, Informat Sch, Beijing, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Deep Learning; Pruning Models; Accelerating Deep CNNs;
D O I
10.1145/3460426.3463627
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Convolutional Neural Networks (CNNs) are increasingly used in multimedia retrieval, and accelerating Deep CNNs has recently received an ever-increasing research focus. Among various approaches proposed in the literature, filter pruning has been regarded as a promising solution, which is due to its advantage in significant speedup and memory reduction of both network model and intermediate feature maps. Many works have been proposed to find unimportant filters, and then prune it for accelerating Deep CNNs. However, they mainly focus on using heuristic methods to evaluate the importance of filters, such as the statistical information of filters (e.g., prune filter with small l(2)-norm), which may be not perfect. In this paper, we propose a novel filter pruning method, namely A Unified-Model via Block Coordinate Descent for Learning the Importance of Filter (U-BCD). The importance of the filters in our U-BCD is learned by optimizing method. We can simultaneously learn the filter parameters and the importance of filters by block coordinate descent method. When applied to two image classification benchmarks, the effectiveness of our U-BCD is validated. Notably, on CIFAR-10, our U-BCD reduces more than 57% FLOPs on ResNet-110 with even 0.08% relative accuracy improvement, and also achieve state-of-the-art results on ILSVRC-2012.
引用
收藏
页码:192 / 200
页数:9
相关论文
共 40 条
  • [1] EFFECTS OF APPROXIMATIONS IN UNIFIED-MODEL GENERATOR-COORDINATE CALCULATIONS
    SHARON, YY
    BULLETIN OF THE AMERICAN PHYSICAL SOCIETY, 1969, 14 (04): : 606 - &
  • [2] Faster Coordinate Descent via Adaptive Importance Sampling
    Perekrestenko, Dmytro
    Cevher, Volkan
    Jaggi, Martin
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 54, 2017, 54 : 869 - 877
  • [3] Global Convergence of Block Coordinate Descent in Deep Learning
    Zeng, Jinshan
    Lau, Tim Tsz-Kit
    Lin, Shao-Bo
    Yao, Yuan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [4] A Local Block Coordinate Descent Algorithm for the CSC Model
    Zisselman, Ev
    Sulam, Jeremias
    Elad, Michael
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8200 - 8209
  • [5] Interpreting and Extending the Guided Filter via Cyclic Coordinate Descent
    Dai, Longquan
    Yuan, Mengke
    Tang, Liang
    Xie, Yuan
    Zhang, Xiaopeng
    Tang, Jinhui
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (02) : 767 - 778
  • [6] Asynchronous Distributed Reinforcement Learning for LQR Control via Zeroth-Order Block Coordinate Descent
    Jing, Gangshan
    Bai, He
    George, Jemin
    Chakrabortty, Aranya
    Sharma, Piyush K.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (11) : 7524 - 7539
  • [7] Federated Block Coordinate Descent Scheme for Learning Global and Personalized Models
    Wu, Ruiyuan
    Scaglione, Anna
    Wai, Hoi-To
    Karakoc, Nurullah
    Hreinsson, Kari
    Ma, Wing-Kin
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10355 - 10362
  • [8] Algorithms for nonnegative matrix and tensor factorizations: a unified view based on block coordinate descent framework
    Kim, Jingu
    He, Yunlong
    Park, Haesun
    JOURNAL OF GLOBAL OPTIMIZATION, 2014, 58 (02) : 285 - 319
  • [9] Algorithms for nonnegative matrix and tensor factorizations: a unified view based on block coordinate descent framework
    Jingu Kim
    Yunlong He
    Haesun Park
    Journal of Global Optimization, 2014, 58 : 285 - 319
  • [10] Importance Sampling Strategy for Non-Convex Randomized Block-Coordinate Descent
    Flamary, Remi
    Rakotomamonjy, Alain
    Gasso, Gilles
    2015 IEEE 6TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP), 2015, : 301 - 304