Incremental Deep Learning Method for Object Detection Model Based on Knowledge Distillation

被引：0

作者：

Fang W. ^{[1
]}

Chen A. ^{[1
]}

Meng N. ^{[1
]}

Cheng H. ^{[1
]}

Wang Q. ^{[1
]}

机构：

[1] School of Computer and Info. Technol., Beijing Jiaotong Univ., Beijing

来源：

Gongcheng Kexue Yu Jishu/Advanced Engineering Sciences | 2022年 / 54卷 / 06期

关键词：

edge computing; incremental learning; knowledge distillation; model compression; object detection;

D O I：

10.15961/j.jsuese.202100925

中图分类号：

学科分类号：

摘要：

With the advent of the Internet of Everything era, the number of IoT devices with object detection capability has exploded. Accordingly, massive amounts of real-time data are generated at the edges of the network. Thus, edge computing has become an emerging computing paradigm that has the characteristics of low latency, low bandwidth and high security. While the traditional deep learning approaches usually assume that all data have been obtained before model training, a large number of new data and new categories are often gradually generated and obtained over time in real edge computing environment. In order to execute the object detection task efficiently on resource-constrained edge devices when the input data samples are accumulated and updated in batches, an incremental learning method based on knowledge distillation of multiple intermediate layers (ILMIL) was proposed in this paper. First, to preserve the obtained knowledge from existing data, a new metric called MFRRK was proposed for covering knowledge from multiple intermediate network layers. The discrepancy between the intermediate layers’ features of teacher model and student model were added in ILMIL to model training. Compared to the recent incremental learning methods based on knowledge distillation, the student model adapted ILMIL was able to alleviate forgetting by learning more knowledge of the old classes from the intermediate layers of the teacher model. Then, the incremental training for current model was conducted to avoid resource costs of using multiple independent models for training. To further reduce the model complexity, the model pruning technique can be used before knowledge distillation to compress the current model. Extensive experiments under different scenarios and conditions demonstrated that the proposed ILMIL method can effectively reduce the model calculation and storage costs, alleviate the catastrophic forgetting of existing knowledge, and maintain acceptable inference accuracy. © 2022 Editorial Department of Journal of Sichuan University. All rights reserved.

引用

页码：59 / 66

页数：7

共 25 条

[1] Keyan Cao, Yefan Liu, Gongjie Meng, Et al., An overview on edge computing research[J], IEEE Access, 8, pp. 85714-85728, (2020)
[2] Weisong Shi, Jie Cao, Quan Zhang, Et al., Edge computing: Vision and challenges[J], IEEE Internet of Things Journal, 3, 5, pp. 637-646, (2016)
[3] Viola P, Jones M., Rapid object detection using a boosted cascade of simple features, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.CVPR 2001, (2001)
[4] Girshick R, Donahue J, Darrell T, Et al., Rich feature hierarchies for accurate object detection and semantic segmentation [C], Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 580-587, (2014)
[5] Girshick R., Fast R-CNN[C], Proceedings of the 2015 IEEE International Conference on Computer Vision, pp. 1440-1448, (2015)
[6] Shaoqing Ren, Kaiming He, Girshick R, Et al., Faster R-CNN: Towards real-time object detection with region proposal networks[J], IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 6, pp. 1137-1149, (2017)
[7] Redmon J, Divvala S, Girshick R, Et al., You only look once: Unified,real-time object detection[C], Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, (2016)
[8] Liu W, Anguelov D, Erhan D, Et al., Ssd:Single shot multibox detector[M], European conference on computer vision, pp. 21-37, (2016)
[9] Kaiwen Duan, Bai Song, Xie Lingxi, Et al., CenterNet:Keypoint triplets for object detection[C], Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 6568-6577, (2019)
[10] Rebuffi S A, Kolesnikov A, Sperl G, Et al., iCaRL:Incremental classifier and representation learning[C], Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, pp. 5533-5542, (2017)

← 1 2 3 →