Compressing Deep Neural Networks for Recognizing Places

被引:0
|
作者
Saha, Soham [1 ]
Varma, Girish [1 ]
Jawahar, C. V. [1 ]
机构
[1] Int Inst Informat Technol, KCIS, CVIT, Hyderabad, India
关键词
Visual Place Recognition; Model Compression; Image Retrieval;
D O I
10.1109/ACPR.2017.154
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual place recognition on low memory devices such as mobile phones and robotics systems is a challenging problem. The state of the art models for this task uses deep learning architectures having close to 100 million parameters which takes over 400MB of memory. This makes these models infeasible to be deployed in low memory devices and gives rise to the need of compressing them. Hence we study the effectiveness of model compression techniques like trained quantization and pruning for reducing the number of parameters on one of the best performing image retrieval models called NetVLAD. We show that a compressed network can be created by starting with a model pre-trained for the task of visual place recognition and then fine-tuning it via trained pruning and quantization. The compressed model is able to produce the same mAP as the original uncompressed network. We achieve almost 50% parameter pruning with no loss in mAP and 70% pruning with close to 2% mAP reduction, while also performing 8-bit quantization. Furthermore, together with 5-bit quantization, we perform about 50% parameter reduction by pruning and get only about 3% reduction in mAP. The resulting compressed networks have sizes of around 30MB and 65MB which makes them easily usable in memory constrained devices.
引用
收藏
页码:352 / 357
页数:6
相关论文
共 50 条
  • [31] Compressing Convolutional Neural Networks in the Frequency Domain
    Chen, Wenlin
    Wilson, James
    Tyree, Stephen
    Weinberger, Kilian Q.
    Chen, Yixin
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1475 - 1484
  • [32] Building Emotional Machines: Recognizing Image Emotions Through Deep Neural Networks
    Kim, Hye-Rin
    Kim, Yeong-Seok
    Kim, Seon Joo
    Lee, In-Kwon
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (11) : 2980 - 2992
  • [33] Delta-DNN: Efficiently Compressing Deep Neural Networks via Exploiting Floats Similarity
    Hu, Zhenbo
    Zou, Xiangyu
    Xia, Wen
    Jin, Sian
    Tao, Dingwen
    Liu, Yang
    Zhang, Weizhe
    Zhang, Zheng
    PROCEEDINGS OF THE 49TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2020, 2020,
  • [34] BHNN: a Memory-Efficient Accelerator for Compressing Deep Neural Networks with Blocked Hashing Techniques
    Zhu, Jingyang
    Qian, Zhiliang
    Tsui, Chi-Ying
    2017 22ND ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2017, : 690 - 695
  • [35] Compressing Deep Convolutional Neural Networks by Stacking Low-dimensional Binary Convolution Filters
    Lan, Weichao
    Lan, Liang
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8235 - 8242
  • [36] CIRCNN: Accelerating and Compressing Deep Neural Networks Using Block-Circulant Weight Matrices
    Ding, Caiwen
    Liao, Siyu
    Wang, Yanzhi
    Li, Zhe
    Liu, Ning
    Zhuo, Youwei
    Wang, Chao
    Qian, Xuehai
    Bai, Yu
    Yuan, Geng
    Ma, Xiaolong
    Zhang, Yipeng
    Tang, Jian
    Qiu, Qinru
    Lin, Xue
    Yuan, Bo
    50TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2017, : 395 - 408
  • [37] Differential evolution based layer-wise weight pruning for compressing deep neural networks
    Wu, Tao
    Li, Xiaoyang
    Zhou, Deyun
    Li, Na
    Shi, Jiao
    Sensors (Switzerland), 2021, 21 (03): : 1 - 20
  • [38] Differential Evolution Based Layer-Wise Weight Pruning for Compressing Deep Neural Networks
    Wu, Tao
    Li, Xiaoyang
    Zhou, Deyun
    Li, Na
    Shi, Jiao
    SENSORS, 2021, 21 (03) : 1 - 20
  • [39] Spectral Pruning: Compressing Deep Neural Networks via Spectral Analysis and its Generalization Error
    Suzuki, Taiji
    Abe, Hiroshi
    Murata, Tomoya
    Horiuchi, Shingo
    Ito, Kotaro
    Wachi, Tokuma
    Hirai, So
    Yukishima, Masatoshi
    Nishimura, Tomoaki
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2839 - 2846
  • [40] Dissociable Neural Systems for Recognizing Places and Navigating through Them
    Persichetti, Andrew S.
    Dilks, Daniel D.
    JOURNAL OF NEUROSCIENCE, 2018, 38 (48): : 10295 - 10304