Convolution Filter Compression via Sparse Linear Combinations of Quantized Basis

被引：0

作者：

Lan, Weichao ^{[1
]}

Cheung, Yiu-Ming ^{[1
]}

Lan, Liang ^{[2
]}

Jiang, Juyong ^{[3
]}

Hu, Zhikai ^{[1
]}

机构：

[1] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China

[2] Hong Kong Baptist Univ, Dept Interact Media, Hong Kong, Peoples R China

[3] Hong Kong Univ Sci & Technol Guangzhou, Guangzhou 511458, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年

关键词：

Convolution; Quantization (signal); Nonlinear filters; Maximum likelihood detection; Kernel; Filtering algorithms; Tensors; Filter decomposition; network compression; quantization; NETWORKS;

D O I：

10.1109/TNNLS.2024.3457943

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Convolutional neural networks (CNNs) have achieved significant performance on various real-life tasks. However, the large number of parameters in convolutional layers requires huge storage and computation resources, making it challenging to deploy CNNs on memory-constrained embedded devices. In this article, we propose a novel compression method that generates the convolution filters in each layer using a set of learnable low-dimensional quantized filter bases. The proposed method reconstructs the convolution filters by stacking the linear combinations of these filter bases. By using quantized values in weights, the compact filters can be represented using fewer bits so that the network can be highly compressed. Furthermore, we explore the sparsity of coefficients through $L_1$ -ball projection when conducting linear combination to further reduce the storage consumption and prevent overfitting. We also provide a detailed analysis of the compression performance of the proposed method. Evaluations of image classification and object detection tasks using various network structures demonstrate that the proposed method achieves a higher compression ratio with comparable accuracy compared with the existing state-of-the-art filter decomposition and network quantization methods.

引用

页数：14

共 50 条

[1] INPAINTING WITH SPARSE LINEAR COMBINATIONS OF EXEMPLARS
Wohlberg, Brendt
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 689 - 692
[2] A linear programming approach to sparse linear regression with quantized data
Cerone, V
Fosson, S. M.
Regruto, D.
2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 2990 - 2995
[3] Functional linear regression for functional response via sparse basis selection
Han, Kyunghee
Shin, Hyejin
JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2015, 44 (03) : 376 - 389
[4] Functional linear regression for functional response via sparse basis selection
Kyunghee Han
Hyejin Shin
Journal of the Korean Statistical Society, 2015, 44 : 376 - 389
[5] APPROXIMATION BY LINEAR-COMBINATIONS OF POSITIVE CONVOLUTION INTEGRALS
VOGT, L
JOURNAL OF APPROXIMATION THEORY, 1989, 57 (02) : 178 - 201
[6] Lossy Compression via Sparse Linear Regression: Computationally Efficient Encoding and Decoding
Venkataramanan, Ramji
Sarkar, Tuhin
Tatikonda, Sekhar
2013 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS (ISIT), 2013, : 1182 - +
[7] Lossy Compression via Sparse Linear Regression: Computationally Efficient Encoding and Decoding
Venkataramanan, Ramji
Sarkar, Tuhin
Tatikonda, Sekhar
IEEE TRANSACTIONS ON INFORMATION THEORY, 2014, 60 (06) : 3265 - 3278
[8] VIDEO SUPER-RESOLUTION VIA SPARSE COMBINATIONS OF KEY-FRAME PATCHES IN A COMPRESSION CONTEXT
Bevilacqua, Marco
Roumy, Aline
Guillemot, Christine
Morel, Marie-Line Alberi
2013 PICTURE CODING SYMPOSIUM (PCS), 2013, : 337 - 340
[9] Memory-Side Acceleration and Sparse Compression for Quantized Packed Convolutions
Weaver, Alex
Kavi, Krishna
Vasireddy, Pranathi
Mehta, Gayatri
2022 IEEE 34TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2022), 2022, : 81 - 90
[10] ECG compression retaining the best natural basis k-coefficients via sparse decomposition
Adamo, Alessandro
Grossi, Giuliano
Lanzarotti, Raffaella
Lin, Jianyi
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2015, 15 : 11 - 17

← 1 2 3 4 5 →