Human body detection algorithm in complex monitoring scenes

被引:0
|
作者
Zhang S. [1 ]
Li J. [1 ]
机构
[1] School of Mathematics and Statistics, Xidian University, Xi'an
来源
Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University | 2021年 / 48卷 / 05期
关键词
Deep neural network; Feature fusion; Feature map generation strategy; Human body detection; Video surveillance;
D O I
10.19665/j.issn1001-2400.2021.05.009
中图分类号
学科分类号
摘要
In the video surveillance scene, due to the influence of factors such as complex background, multi-posture and occlusion, existing human body detection algorithms have problems such as low accuracy and weak model generalization ability. In response to the above problems, we have designed a detection network feature fusion method and feature map generation strategy based on the feature image pyramid and multi-scale receptive field theory. By relying on the lightweight feature image pyramid technology and combining optimization methods such as data enhancement, anchor box matching strategy and occlusion loss function, we have further proposed a human body detection algorithm EFIPNet based on the deep neural network. Meantime, in order to fully verify the effectiveness of the EFIPNet algorithm, this paper establishes 4 diversified video surveillance scene data sets, which involves a total of 50 common human body postures. The validation of the algorithm shows that the human detection network we have designed can effectively improve the detection accuracy of the human body, and achieve accurate and real-time human body detection in complex monitoring scenarios. In addition, in order to verify the effectiveness of different modules in the EFIPNet algorithm, we have used the ablation research method to analyze the influence of the main modules in the network on the performance of the human body detection model. On the Person dataset, compared with the SSD detection algorithm, the EFIPNet algorithm improves the detection accuracy of human targets by 4.34% while maintaining the detection speed of 45 frames per second. © 2021, The Editorial Board of Journal of Xidian University. All right reserved.
引用
收藏
页码:68 / 77
页数:9
相关论文
共 16 条
  • [1] REN S, HE K, GIRSHICK R, Et al., FasterR-CNN: Towards Real-Time Object Detection with Region Proposal Networks, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 6, pp. 1137-1149, (2017)
  • [2] LIU W, ANGUELOV D, ERHAN D, Et al., SSD: Single Shot Multibox Detector, European Conference on Computer Vision, pp. 21-37, (2016)
  • [3] ZHANG S, WEN L, BIAN X, Et al., Single-Shot Refinement Neural Network for Object Detec-tion, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4203-4212, (2018)
  • [4] ZHENGQ, LI Z, ZHANG Z, Et al., ThunderNet: Towards Real-Time Generic Object Detection on Mobile De-vices, Proceedings of the IEEE International Conference on Computer Vision, pp. 6717-6726, (2019)
  • [5] TAN M, PANG R, LE Q V., EfficientDet: Scalable and Efficient Object Detection, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 10781-10790, (2020)
  • [6] TIAN Y, PING L, WANG X, Et al., Deep Learning Strong Parts for Pedestrian Detection, IEEE International Conference on Computer Vision, pp. 1904-1912, (2016)
  • [7] MAO J, XIAO T, JIANG Y, Et al., WhatCan Help Pedestrian Detection, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6034-6043, (2017)
  • [8] WANG X, XIAO T, JIANG Y, Et al., Repulsion Loss: Detecting Pedestrians in a Crowd, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7774-7783, (2018)
  • [9] CHI C, ZHANG S, XING J, Et al., PedHunter: Occlusion Robust Pedestrian Detector in Crowded Scenes, Proceedings of the AAAI Conference on Artificial Intelligence, pp. 10639-10646, (2020)
  • [10] WANG Yudong, GUO Jichang, WANG Tianbao, Algorithm for Foggy-Image Pedestrian and Vehicle Detection, Journal of Xidian University, 47, 4, pp. 70-77, (2020)