Accelerating Convolutional Neural Networks with Dominant Convolutional Kernel and Knowledge Pre-regression

被引:13
|
作者
Wang, Zhenyang [1 ]
Deng, Zhidong [1 ]
Wang, Shiyao [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci, Tsinghua Natl Lab Informat Sci & Technol, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
来源
关键词
Dominant convolutional kernel; Knowledge pre-regression; Model compression; Knowledge distilling;
D O I
10.1007/978-3-319-46484-8_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aiming at accelerating the test time of deep convolutional neural networks (CNNs), we propose a model compression method that contains a novel dominant kernel (DK) and a new training method called knowledge pre-regression (KP). In the combined model DK(2)PNet, DK is presented to significantly accomplish a low-rank decomposition of convolutional kernels, while KP is employed to transfer knowledge of intermediate hidden layers from a larger teacher network to its compressed student network on the basis of a cross entropy loss function instead of previous Euclidean distance. Compared to the latest results, the experimental results achieved on CIFAR-10, CIFAR-100, MNIST, and SVHN benchmarks show that our DK(2)PNet method has the best performance in the light of being close to the state of the art accuracy and requiring dramatically fewer number of model parameters.
引用
收藏
页码:533 / 548
页数:16
相关论文
共 50 条
  • [31] Accelerating Convolutional Neural Networks by Removing Interspatial and Interkernel Redundancies
    Zeng, Linghua
    Tian, Xinmei
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (02) : 452 - 464
  • [32] Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
    He, Yang
    Kang, Guoliang
    Dong, Xuanyi
    Fu, Yanwei
    Yang, Yi
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2234 - 2240
  • [33] Accelerating Sparse Convolutional Neural Networks with Systolic Arrays on FPGA
    Nehete, Hemkant
    Verma, Gaurav
    Yadav, Shailendra
    Kaushik, Brajesh Kumar
    APPLICATIONS OF MACHINE LEARNING 2023, 2023, 12675
  • [34] Convolutional Neural Networks Optimized by Logistic Regression Model
    Yang, Bo
    Zhao, Zuopeng
    Xu, Xinzheng
    INTELLIGENT INFORMATION PROCESSING VIII, 2016, 486 : 91 - 96
  • [35] Kernel pooling feature representation of pre-trained convolutional neural networks for leaf recognition
    Feng, Shu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (03) : 4255 - 4282
  • [36] Kernel pooling feature representation of pre-trained convolutional neural networks for leaf recognition
    Shu Feng
    Multimedia Tools and Applications, 2022, 81 : 4255 - 4282
  • [37] Diagonal-kernel convolutional neural networks for image classification
    Li, Guoqing
    Shen, Xuzhao
    Li, Jiaojie
    Wang, Jiuyang
    DIGITAL SIGNAL PROCESSING, 2021, 108
  • [38] Diagonal-kernel convolutional neural networks for image classification
    Li, Guoqing
    Shen, Xuzhao
    Li, Jiaojie
    Wang, Jiuyang
    Digital Signal Processing: A Review Journal, 2021, 108
  • [39] DK-CNNs: Dynamic kernel convolutional neural networks
    Liu, Jialin
    Chao, Fei
    Lin, Chih-Min
    Zhou, Changle
    Shang, Changjing
    NEUROCOMPUTING, 2021, 422 : 95 - 108
  • [40] Accelerating Deep Convolutional Neural on GPGPU
    Zurek, Dominik
    Pietron, Marcin
    Wiatr, Kazimierz
    INTELLIGENT COMPUTING, VOL 2, 2021, 284 : 712 - 724