Occluded Pedestrian Detection Algorithm Based on Improved YOLOv3

被引:13
|
作者
Li Xiang [1 ,2 ,3 ,4 ,5 ]
He Miao [1 ,2 ,3 ,4 ]
Luo Haibo [1 ,2 ,3 ,4 ]
机构
[1] Chinese Acad Sci, Key Lab Optoelect Informat Proc, Shenyang 110016, Liaoning, Peoples R China
[2] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Liaoning, Peoples R China
[3] Chinese Acad Sci, Inst Robot, Shenyang 110169, Liaoning, Peoples R China
[4] Chinese Acad Sci, Inst Intelligent Mfg, Shenyang 110169, Liaoning, Peoples R China
[5] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
machine vision; object detection; neural network; pedestrian detection; attention mechanism;
D O I
10.3788/AOS202242.1415003
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
In crowded scenes, it is difficult for YOLOv3 to detect the objects that overlap each other heavily. Aiming at the reasons for the decline of YOLOv3 performance, three improvements are proposed. Firstly, a Tight Loss function is proposed, which optimizes the variance and mean of the coordinates of the prediction boxes to make the prediction boxes belonging to the same target more compact, thus reducing the false positive rate. Secondly, a high-resolution feature pyramid is proposed, in which the resolution of each pyramid feature is improved by upsampling, and shallow features are introduced to enhance the differences between adjacent sub-features, so as to generate distinguishing depth features for highly overlapped targets. Thirdly, a detection head based on spatial attention mechanism is proposed to reduce the number of redundant prediction boxes, so as to reduce the computational burden of the non- maximum suppression (NMS) process. The experimental results on the crowded dataset CrowdHuman show that the average accuracy and recall rate of YOLOv3 detection are improved by 2. 91 percentage points and 3. 20 percentage points, and the miss rate is reduced by 1. 24 percentage points by using the proposed algorithms under the condition of using the traditional NMS method, which demonstrates the effectiveness of the proposed algorithms in boosting the performance in occluded pedestrian detection.
引用
收藏
页数:10
相关论文
共 27 条
  • [11] Loshchilov I., 2017, Proc. 5th International Conf. on Learning Representations, P1
  • [12] An improved one-stage pedestrian detection method based on multi-scale attention feature extraction
    Ma, Jun
    Wan, Honglin
    Wang, Junxia
    Xia, Hao
    Bai, Chengjie
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2021, 18 (06) : 1965 - 1978
  • [13] SPATIOTEMPORAL SCENE INTERPRETATION OF SPACE VIDEOS VIA DEEP NEURAL NETWORK AND TRACKLET ANALYSIS
    Mou, Lichao
    Zhu, Xiao Xiang
    [J]. 2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 1823 - 1826
  • [14] Spatial Relationship Detection Method of Remote Sensing Objects
    Nong Yuanjun
    Wang Junjie
    Zhao Xuebing
    Zhang Junhang
    Geng Hui
    Xu Xiaodong
    [J]. ACTA OPTICA SINICA, 2021, 41 (16)
  • [15] Real-Time Object Detection in Remote Sensing Images Based on Embedded System
    Nong Yuanjun
    Wang Junjie
    [J]. ACTA OPTICA SINICA, 2021, 41 (10)
  • [16] Redmon J., 2018, YOLOv3: An Incremental Improvement
  • [17] IterDet: Iterative Scheme for Object Detection in Crowded Environments
    Rukhovich, Danila
    Sofiiuk, Konstantin
    Galeev, Danil
    Barinova, Olga
    Konushin, Anton
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2020, 2021, 12644 : 344 - 354
  • [18] Shao S., 2018, ARXIV PREPRINT ARXIV
  • [19] [沈峘 Shen Huan], 2010, [光学学报, Acta Optica Sinica], V30, P1076
  • [20] Traffic Light Detection Based on Optimized YOLOv3 Algorithm
    Sun Yingchun
    Pan Shuguo
    Zhao Tao
    Gao Wang
    Wei Jiansheng
    [J]. ACTA OPTICA SINICA, 2020, 40 (12)