A Unified-Model via Block Coordinate Descent for Learning the Importance of Filter

被引:0
|
作者
Li, Qinghua [1 ]
Zhang, Xue [1 ]
Li, Cuiping [1 ]
Chen, Hong [1 ]
机构
[1] Renmin Univ China, Informat Sch, Beijing, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Deep Learning; Pruning Models; Accelerating Deep CNNs;
D O I
10.1145/3460426.3463627
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Convolutional Neural Networks (CNNs) are increasingly used in multimedia retrieval, and accelerating Deep CNNs has recently received an ever-increasing research focus. Among various approaches proposed in the literature, filter pruning has been regarded as a promising solution, which is due to its advantage in significant speedup and memory reduction of both network model and intermediate feature maps. Many works have been proposed to find unimportant filters, and then prune it for accelerating Deep CNNs. However, they mainly focus on using heuristic methods to evaluate the importance of filters, such as the statistical information of filters (e.g., prune filter with small l(2)-norm), which may be not perfect. In this paper, we propose a novel filter pruning method, namely A Unified-Model via Block Coordinate Descent for Learning the Importance of Filter (U-BCD). The importance of the filters in our U-BCD is learned by optimizing method. We can simultaneously learn the filter parameters and the importance of filters by block coordinate descent method. When applied to two image classification benchmarks, the effectiveness of our U-BCD is validated. Notably, on CIFAR-10, our U-BCD reduces more than 57% FLOPs on ResNet-110 with even 0.08% relative accuracy improvement, and also achieve state-of-the-art results on ILSVRC-2012.
引用
收藏
页码:192 / 200
页数:9
相关论文
共 40 条
  • [31] Increasingly Packing Multiple Facial-Informatics Modules in A Unified Deep-Learning Model via Lifelong Learning
    Hung, Steven C. Y.
    Lee, Jia-Hong
    Wan, Timmy S. T.
    Chen, Chein-Hung
    Chan, Yi-Ming
    Chen, Chu-Song
    ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 339 - 343
  • [32] An automatic feedback model for learning programming via block-based programming platforms
    Cakiroglu, Unal
    Mumcu, Suheda
    COMPUTER APPLICATIONS IN ENGINEERING EDUCATION, 2023, 31 (05) : 1398 - 1411
  • [33] Large-Scale Adaptive Semi-Supervised Learning via Unified Inductive and Transductive Model
    Wang, De
    Nie, Feiping
    Huang, Heng
    PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 482 - 491
  • [34] Feature importance: Opening a soil-transmitted helminth machine learning model via SHAP
    Matias Scavuzzo, Carlos
    Manuel Scavuzzo, Juan
    Natalia Campero, Micaela
    Anegagrie, Melaku
    Amor Aramendia, Aranzazu
    Benito, Agustin
    Periago, Victoria
    INFECTIOUS DISEASE MODELLING, 2022, 7 (01) : 262 - 276
  • [35] Continual Learning for Multilingual Neural Machine Translation via Dual Importance-based Model Division
    Liu, Junpeng
    Huang, Kaiyu
    Yu, Hao
    Li, Jiuyi
    Su, Jinsong
    Huang, Degen
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 12011 - 12027
  • [36] Learning Multiple Adverse Weather Removal via Two-stage Knowledge Learning and Multi-contrastive Regularization: Toward a Unified Model
    Chen, Wei-Ting
    Huang, Zhi-Kai
    Tsai, Cheng-Che
    Yang, Hao-Hsiang
    Ding, Jian-Jiun
    Kuo, Sy-Yen
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17632 - 17641
  • [37] Unified HumanRobotEnvironment Interaction Control in Contact-Rich Collaborative Manipulation Tasks via Model-Based Reinforcement Learning
    Liu, Xing
    Liu, Yu
    Liu, Zhengxiong
    Huang, Panfeng
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 70 (11) : 11474 - 11482
  • [38] Hybrid Localization using Model- and Learning-Based Methods: Fusion of Monte Carlo and E2E Localizations via Importance Sampling
    Akai, Naoki
    Hirayama, Takatsugu
    Murase, Hiroshi
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 6469 - 6475
  • [39] The Importance of Close Follow-Up in Patients with Early-Grade Diabetic Retinopathy: A Taiwan Population-Based Study Grading via Deep Learning Model
    Lee, Chia-Cheng
    Hsing, Shi-Chue
    Lin, Yu-Ting
    Lin, Chin
    Chen, Jiann-Torng
    Chen, Yi-Hao
    Fang, Wen-Hui
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (18)
  • [40] Integration of machine learning and process-based model outputs via ensemble Kalman filter enhanced space-time modelling of soil organic carbon in a highly human impacted area
    Xie, Enze
    Chen, Jian
    Peng, Yuxuan
    Yan, Guojing
    Zhao, Yongcun
    SOIL USE AND MANAGEMENT, 2024, 40 (04)