Accelerating Convolutional Neural Networks with Dominant Convolutional Kernel and Knowledge Pre-regression

被引:13
|
作者
Wang, Zhenyang [1 ]
Deng, Zhidong [1 ]
Wang, Shiyao [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci, Tsinghua Natl Lab Informat Sci & Technol, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
来源
关键词
Dominant convolutional kernel; Knowledge pre-regression; Model compression; Knowledge distilling;
D O I
10.1007/978-3-319-46484-8_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aiming at accelerating the test time of deep convolutional neural networks (CNNs), we propose a model compression method that contains a novel dominant kernel (DK) and a new training method called knowledge pre-regression (KP). In the combined model DK(2)PNet, DK is presented to significantly accomplish a low-rank decomposition of convolutional kernels, while KP is employed to transfer knowledge of intermediate hidden layers from a larger teacher network to its compressed student network on the basis of a cross entropy loss function instead of previous Euclidean distance. Compared to the latest results, the experimental results achieved on CIFAR-10, CIFAR-100, MNIST, and SVHN benchmarks show that our DK(2)PNet method has the best performance in the light of being close to the state of the art accuracy and requiring dramatically fewer number of model parameters.
引用
收藏
页码:533 / 548
页数:16
相关论文
共 50 条
  • [1] A Study on Accelerating Convolutional Neural Networks
    Lin, Hsien-, I
    Cheng, Chung-Sheng
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING 2019 (ICCMSE-2019), 2019, 2186
  • [2] Kernel Pooling for Convolutional Neural Networks
    Cui, Yin
    Zhou, Feng
    Wang, Jiang
    Liu, Xiao
    Lin, Yuanqing
    Belongie, Serge
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3049 - 3058
  • [3] Kernel Graph Convolutional Neural Networks
    Nikolentzos, Giannis
    Meladianos, Polykarpos
    Tixier, Antoine Jean-Pierre
    Skianis, Konstantinos
    Vazirgiannis, Michalis
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 22 - 32
  • [4] Accelerating Convolutional Neural Networks in Frequency Domain via Kernel-sharing Approach
    Liu, Bosheng
    Liang, Hongyi
    Wu, Jigang
    Chen, Xiaoming
    Liu, Peng
    Han, Yinhe
    2023 28TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC, 2023, : 733 - 738
  • [5] Estimation of motion blur kernel parameters using regression convolutional neural networks
    Varela, Luis G.
    Boucheron, Laura E.
    Sandoval, Steven
    Voelz, David
    Siddik, Abu Bucker
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (02)
  • [6] On the regularization of convolutional kernel tensors in neural networks
    Guo, Pei-Chang
    Ye, Qiang
    LINEAR & MULTILINEAR ALGEBRA, 2022, 70 (12): : 2318 - 2330
  • [7] The Kernel Dynamics of Convolutional Neural Networks in Manifolds
    WU Wei
    JING Xiaoyuan
    DU Wencai
    Chinese Journal of Electronics, 2020, 29 (06) : 1185 - 1192
  • [8] Vector-kernel convolutional neural networks
    Ou, Jun
    Li, Yujian
    NEUROCOMPUTING, 2019, 330 : 253 - 258
  • [9] Accelerating cardiovascular model building with convolutional neural networks
    Maher, Gabriel
    Wilson, Nathan
    Marsden, Alison
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2019, 57 (10) : 2319 - 2335
  • [10] Accelerating TDECQ Assessments using Convolutional Neural Networks
    Varughese, Siddharth
    Garon, Daniel A.
    Melgar, Alirio
    Thomas, Varghese A.
    Zivny, Pavel
    Hazzard, Shane
    Ralph, Stephan E.
    2020 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXPOSITION (OFC), 2020,