Pruning Deep Neural Networks with l0-constrained Optimization

被引：1

作者：

Phan, Dzung T. ^{[1
]}

Nguyen, Lam M. ^{[1
]}

Nguyen, Nam H. ^{[1
]}

Kalagnanam, Jayant R. ^{[1
]}

机构：

[1] IBM Res, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA

来源：

20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020) | 2020年

关键词：

D O I：

10.1109/ICDM50108.2020.00152

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep neural networks (DNNs) give state-of-the-art accuracy in many tasks, but they can require large amounts of memory storage, energy consumption, and long inference times. Modern DNNs can have hundreds of million parameters, which make it difficult for DNNs to be deployed in some applications with low-resource environments. Pruning redundant connections without sacrificing accuracy is one of popular approaches to overcome these limitations. We propose two l(0)-constrained optimization models for pruning deep neural networks layer-by-layer. The first model is devoted to a general activation function, while the second one is specifically for a ReLU. We introduce an efficient cutting plane algorithm to solve the latter to optimality. Our experiments show that the proposed approach achieves competitive compression rates over several state-of-the-art baseline methods.

引用

页码：1214 / 1219

页数：6

共 50 条

[21] Constrained Optimization Based Low-Rank Approximation of Deep Neural Networks
Li, Chong
Shi, C. J. Richard
COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 746 - 761
[22] Anonymous Model Pruning for Compressing Deep Neural Networks
Zhang, Lechun
Chen, Guangyao
Shi, Yemin
Zhang, Quan
Tan, Mingkui
Wang, Yaowei
Tian, Yonghong
Huang, Tiejun
THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2020), 2020, : 161 - 164
[23] A New Pruning Method to Train Deep Neural Networks
Guo, Haonan
Ren, Xudie
Li, Shenghong
COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2018, 423 : 767 - 775
[24] Task dependent deep LDA pruning of neural networks
Tian, Qing
Arbel, Tal
Clark, James J.
COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 203
[25] Trained Rank Pruning for Efficient Deep Neural Networks
Xu, Yuhui
Li, Yuxi
Zhang, Shuai
Wen, Wei
Wang, Botao
Dai, Wenrui
Qi, Yingyong
Chen, Yiran
Lin, Weiyao
Xiong, Hongkai
FIFTH WORKSHOP ON ENERGY EFFICIENT MACHINE LEARNING AND COGNITIVE COMPUTING - NEURIPS EDITION (EMC2-NIPS 2019), 2019, : 14 - 17
[26] CUP: Cluster Pruning for Compressing Deep Neural Networks
Duggal, Rahul
Xiao, Cao
Vuduc, Richard
Duen Horng Chau
Sun, Jimeng
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5102 - 5106
[27] Pruning Deep Neural Networks by Optimal Brain Damage
Liu, Chao
Zhang, Zhiyong
Wang, Dong
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1092 - 1095
[28] Structured Pruning for Deep Convolutional Neural Networks: A Survey
He, Yang
Xiao, Lingao
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (05) : 2900 - 2919
[29] Class-dependent Pruning of Deep Neural Networks
Entezari, Rahim
Saukh, Olga
2020 IEEE SECOND WORKSHOP ON MACHINE LEARNING ON EDGE IN SENSOR SYSTEMS (SENSYS-ML 2020), 2020, : 13 - 18
[30] Holistic Filter Pruning for Efficient Deep Neural Networks
Enderich, Lukas
Timm, Fabian
Burgard, Wolfram
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2595 - 2604

← 1 2 3 4 5 →