Accelerating Convolutional Neural Networks with Dominant Convolutional Kernel and Knowledge Pre-regression

被引：13

作者：

Wang, Zhenyang ^{[1
]}

Deng, Zhidong ^{[1
]}

Wang, Shiyao ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Comp Sci, Tsinghua Natl Lab Informat Sci & Technol, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China

来源：

COMPUTER VISION - ECCV 2016, PT VIII | 2016年 / 9912卷

关键词：

Dominant convolutional kernel; Knowledge pre-regression; Model compression; Knowledge distilling;

D O I：

10.1007/978-3-319-46484-8_32

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Aiming at accelerating the test time of deep convolutional neural networks (CNNs), we propose a model compression method that contains a novel dominant kernel (DK) and a new training method called knowledge pre-regression (KP). In the combined model DK(2)PNet, DK is presented to significantly accomplish a low-rank decomposition of convolutional kernels, while KP is employed to transfer knowledge of intermediate hidden layers from a larger teacher network to its compressed student network on the basis of a cross entropy loss function instead of previous Euclidean distance. Compared to the latest results, the experimental results achieved on CIFAR-10, CIFAR-100, MNIST, and SVHN benchmarks show that our DK(2)PNet method has the best performance in the light of being close to the state of the art accuracy and requiring dramatically fewer number of model parameters.

引用

页码：533 / 548

页数：16

共 50 条

[31] Accelerating Convolutional Neural Networks by Removing Interspatial and Interkernel Redundancies
Zeng, Linghua
Tian, Xinmei
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (02) : 452 - 464
[32] Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
He, Yang
Kang, Guoliang
Dong, Xuanyi
Fu, Yanwei
Yang, Yi
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2234 - 2240
[33] Accelerating Sparse Convolutional Neural Networks with Systolic Arrays on FPGA
Nehete, Hemkant
Verma, Gaurav
Yadav, Shailendra
Kaushik, Brajesh Kumar
APPLICATIONS OF MACHINE LEARNING 2023, 2023, 12675
[34] Convolutional Neural Networks Optimized by Logistic Regression Model
Yang, Bo
Zhao, Zuopeng
Xu, Xinzheng
INTELLIGENT INFORMATION PROCESSING VIII, 2016, 486 : 91 - 96
[35] Kernel pooling feature representation of pre-trained convolutional neural networks for leaf recognition
Feng, Shu
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (03) : 4255 - 4282
[36] Kernel pooling feature representation of pre-trained convolutional neural networks for leaf recognition
Shu Feng
Multimedia Tools and Applications, 2022, 81 : 4255 - 4282
[37] Diagonal-kernel convolutional neural networks for image classification
Li, Guoqing
Shen, Xuzhao
Li, Jiaojie
Wang, Jiuyang
DIGITAL SIGNAL PROCESSING, 2021, 108
[38] Diagonal-kernel convolutional neural networks for image classification
Li, Guoqing
Shen, Xuzhao
Li, Jiaojie
Wang, Jiuyang
Digital Signal Processing: A Review Journal, 2021, 108
[39] DK-CNNs: Dynamic kernel convolutional neural networks
Liu, Jialin
Chao, Fei
Lin, Chih-Min
Zhou, Changle
Shang, Changjing
NEUROCOMPUTING, 2021, 422 : 95 - 108
[40] Accelerating Deep Convolutional Neural on GPGPU
Zurek, Dominik
Pietron, Marcin
Wiatr, Kazimierz
INTELLIGENT COMPUTING, VOL 2, 2021, 284 : 712 - 724

← 1 2 3 4 5 →