Deep Learning Model Compression With Rank Reduction in Tensor Decomposition

被引:0
|
作者
Dai, Wei [1 ,2 ]
Fan, Jicong [1 ,3 ]
Miao, Yiming [1 ,2 ]
Hwang, Kai [1 ,2 ]
机构
[1] Chinese Univ Hong Kong, Sch Data Sci, Shenzhen 518172, Peoples R China
[2] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518129, Peoples R China
[3] Shenzhen Res Inst Big Data, Shenzhen 518172, Peoples R China
基金
中国国家自然科学基金;
关键词
Tensors; Training; Matrix decomposition; Image coding; Computational modeling; Adaptation models; Deep learning; Deep learning (DL); low-rank decomposition; model compression; rank reduction (RR); NEURAL-NETWORKS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large neural network models are hard to deploy on lightweight edge devices demanding large network bandwidth. In this article, we propose a novel deep learning (DL) model compression method. Specifically, we present a dual-model training strategy with an iterative and adaptive rank reduction (RR) in tensor decomposition. Our method regularizes the DL models while preserving model accuracy. With adaptive RR, the hyperparameter search space is significantly reduced. We provide a theoretical analysis of the convergence and complexity of the proposed method. Testing our method for the LeNet, VGG, ResNet, EfficientNet, and RevCol over MNIST, CIFAR-10/100, and ImageNet datasets, our method outperforms the baseline compression methods in both model compression and accuracy preservation. The experimental results validate our theoretical findings. For the VGG-16 on CIFAR-10 dataset, our compressed model has shown a 0.88% accuracy gain with 10.41 times storage reduction and 6.29 times speedup. For the ResNet-50 on ImageNet dataset, our compressed model results in 2.36 times storage reduction and 2.17 times speedup. In federated learning (FL) applications, our scheme reduces 13.96 times the communication overhead. In summary, our compressed DL method can improve the image understanding and pattern recognition processes significantly.
引用
收藏
页码:1315 / 1328
页数:14
相关论文
共 50 条
  • [1] Deep Learning Model Compression With Rank Reduction in Tensor Decomposition
    Dai, Wei
    Fan, Jicong
    Miao, Yiming
    Hwang, Kai
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1315 - 1328
  • [2] Ensemble of Tensor Train Decomposition and Quantization Methods for Deep Learning Model Compression
    Ademola, Olutosin Ajibola
    Eduard, Petlenkov
    Mairo, Leier
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [3] QUANTIZATION AND APPLICATION OF LOW-RANK TENSOR DECOMPOSITION BASED ON THE DEEP LEARNING MODEL
    Zhao, Jia
    3C TIC, 2023, 12 (01): : 330 - 350
  • [4] Tensor Decomposition Learning for Compression of Multidimensional Signals
    Aidini, Anastasia
    Tsagkatakis, Grigorios
    Tsakalides, Panagiotis
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (03) : 476 - 490
  • [5] Deep Compression with Low Rank and Sparse Integrated Decomposition
    Huang, Junhao
    Sun, Weize
    Huang, Lei
    Chen, Shaowu
    PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, : 289 - 292
  • [6] TDLC: Tensor decomposition-based direct learning-compression algorithm for DNN model compression
    Liu, Weirong
    Liu, Peidong
    Shi, Changhong
    Zhang, Zhiqiang
    Li, Zhijun
    Liu, Chaorong
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (13):
  • [7] A NOVEL RANK SELECTION SCHEME IN TENSOR RING DECOMPOSITION BASED ON REINFORCEMENT LEARNING FOR DEEP NEURAL NETWORKS
    Cheng, Zhiyu
    Li, Baopu
    Fan, Yanwen
    Bao, Yingze
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3292 - 3296
  • [8] DeepTensor: Low-Rank Tensor Decomposition With Deep Network Priors
    Saragadam, Vishwanath
    Balestriero, Randall
    Veeraraghavan, Ashok
    Baraniuk, Richard G.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10337 - 10348
  • [9] Fast CP-compression layer: Tensor CP-decomposition to compress layers in deep learning
    Ji, Yuwang
    Wang, Qiang
    IET IMAGE PROCESSING, 2022, 16 (09) : 2535 - 2543
  • [10] Tensor decomposition to compress convolutional layers in deep learning
    Wang, Yinan
    Guo, Weihong Grace
    Yue, Xiaowei
    IISE TRANSACTIONS, 2022, 54 (05) : 481 - 495