Compressing Deep Neural Networks for Recognizing Places

被引:0
|
作者
Saha, Soham [1 ]
Varma, Girish [1 ]
Jawahar, C. V. [1 ]
机构
[1] Int Inst Informat Technol, KCIS, CVIT, Hyderabad, India
关键词
Visual Place Recognition; Model Compression; Image Retrieval;
D O I
10.1109/ACPR.2017.154
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual place recognition on low memory devices such as mobile phones and robotics systems is a challenging problem. The state of the art models for this task uses deep learning architectures having close to 100 million parameters which takes over 400MB of memory. This makes these models infeasible to be deployed in low memory devices and gives rise to the need of compressing them. Hence we study the effectiveness of model compression techniques like trained quantization and pruning for reducing the number of parameters on one of the best performing image retrieval models called NetVLAD. We show that a compressed network can be created by starting with a model pre-trained for the task of visual place recognition and then fine-tuning it via trained pruning and quantization. The compressed model is able to produce the same mAP as the original uncompressed network. We achieve almost 50% parameter pruning with no loss in mAP and 70% pruning with close to 2% mAP reduction, while also performing 8-bit quantization. Furthermore, together with 5-bit quantization, we perform about 50% parameter reduction by pruning and get only about 3% reduction in mAP. The resulting compressed networks have sizes of around 30MB and 65MB which makes them easily usable in memory constrained devices.
引用
收藏
页码:352 / 357
页数:6
相关论文
共 50 条
  • [21] Compressing Neural Networks with the Hashing Trick
    Chen, Wenlin
    Wilson, James T.
    Tyree, Stephen
    Weinberger, Kilian Q.
    Chen, Yixin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 2285 - 2294
  • [22] Recognizing irregular entities in biomedical text via deep neural networks
    Li, Fei
    Zhang, Meishan
    Tian, Bo
    Chen, Bo
    Fu, Guohong
    Ji, Donghong
    PATTERN RECOGNITION LETTERS, 2018, 105 : 105 - 113
  • [23] Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularization in Ternary Networks
    Faraone, Julian
    Fraser, Nicholas
    Gambardella, Giulio
    Blott, Michaela
    Leong, Philip H. W.
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II, 2017, 10635 : 393 - 404
  • [24] OPQ: Compressing Deep Neural Networks with One-shot Pruning-Quantization
    Hu, Peng
    Peng, Xi
    Zhu, Hongyuan
    Aly, Mohamed M. Sabry
    Lin, Jie
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7780 - 7788
  • [25] COMPRESSING DEEP NEURAL NETWORKS USING TOEPLITZ MATRIX: ALGORITHM DESIGN AND FPGA IMPLEMENTATION
    Liao, Siyu
    Samiee, Ashkan
    Deng, Chunhua
    Bai, Yu
    Yuan, Bo
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1443 - 1447
  • [26] Techniques for Compressing Deep Convolutional Neural Network
    Chaman, Shilpa
    2020 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2020), 2020, : 48 - 53
  • [27] Improving shallow neural network by compressing deep neural network
    Carvalho, Marcus
    Pratama, Mahardhika
    2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 1382 - 1387
  • [28] Compressing Deep Networks by Neuron Agglomerative Clustering
    Wang, Li-Na
    Liu, Wenxue
    Liu, Xiang
    Zhong, Guoqiang
    Roy, Partha Pratim
    Dong, Junyu
    Huang, Kaizhu
    SENSORS, 2020, 20 (21) : 1 - 16
  • [29] Storing and Compressing Video into Neural Networks by Overfitting
    Egawa, Hiroki
    Shibata, Yuichiro
    COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS, 2019, 772 : 615 - 626
  • [30] Compressing neural networks via formal methods
    Ressi, Dalila
    Romanello, Riccardo
    Rossi, Sabina
    Piazza, Carla
    NEURAL NETWORKS, 2024, 178