Occluded Pedestrian Detection Algorithm Based on Improved YOLOv3

被引:13
|
作者
Li Xiang [1 ,2 ,3 ,4 ,5 ]
He Miao [1 ,2 ,3 ,4 ]
Luo Haibo [1 ,2 ,3 ,4 ]
机构
[1] Chinese Acad Sci, Key Lab Optoelect Informat Proc, Shenyang 110016, Liaoning, Peoples R China
[2] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Liaoning, Peoples R China
[3] Chinese Acad Sci, Inst Robot, Shenyang 110169, Liaoning, Peoples R China
[4] Chinese Acad Sci, Inst Intelligent Mfg, Shenyang 110169, Liaoning, Peoples R China
[5] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
machine vision; object detection; neural network; pedestrian detection; attention mechanism;
D O I
10.3788/AOS202242.1415003
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
In crowded scenes, it is difficult for YOLOv3 to detect the objects that overlap each other heavily. Aiming at the reasons for the decline of YOLOv3 performance, three improvements are proposed. Firstly, a Tight Loss function is proposed, which optimizes the variance and mean of the coordinates of the prediction boxes to make the prediction boxes belonging to the same target more compact, thus reducing the false positive rate. Secondly, a high-resolution feature pyramid is proposed, in which the resolution of each pyramid feature is improved by upsampling, and shallow features are introduced to enhance the differences between adjacent sub-features, so as to generate distinguishing depth features for highly overlapped targets. Thirdly, a detection head based on spatial attention mechanism is proposed to reduce the number of redundant prediction boxes, so as to reduce the computational burden of the non- maximum suppression (NMS) process. The experimental results on the crowded dataset CrowdHuman show that the average accuracy and recall rate of YOLOv3 detection are improved by 2. 91 percentage points and 3. 20 percentage points, and the miss rate is reduced by 1. 24 percentage points by using the proposed algorithms under the condition of using the traditional NMS method, which demonstrates the effectiveness of the proposed algorithms in boosting the performance in occluded pedestrian detection.
引用
收藏
页数:10
相关论文
共 27 条
  • [1] Bochkovskiy A., 2020, ARXIV 200410934
  • [2] Soft-NMS - Improving Object Detection With One Line of Code
    Bodla, Navaneeth
    Singh, Bharat
    Chellappa, Rama
    Davis, Larry S.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5562 - 5570
  • [3] Ge Z, 2020, 2020 IEEE INT C MULT
  • [4] Ge Z., 2021, YOLOX: Exceeding YOLO series in 2021., DOI 10.48550/ARXIV.2107.08430
  • [5] 驾驶特性的识别评估及其在智能汽车上的应用综述
    郭烈
    马跃
    岳明
    秦增科
    [J]. 交通运输工程学报, 2021, 21 (02) : 7 - 20
  • [6] Position Detection Algorithm of Road Obstacles Based on 3D LiDAR
    Hu Jie
    Liu Han
    Au Wencai
    Zhao Liang
    [J]. CHINESE JOURNAL OF LASERS-ZHONGGUO JIGUANG, 2021, 48 (24):
  • [7] Ji D F, 2020, INFORM CONTROL, V49, P401
  • [8] Kopsiaftis G, 2015, INT GEOSCI REMOTE SE, P1881, DOI 10.1109/IGARSS.2015.7326160
  • [9] Adaptive NMS: Refining Pedestrian Detection in a Crowd
    Liu, Songtao
    Huang, Di
    Wang, Yunhong
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6452 - 6461
  • [10] Research Progress of Key Technologies in Recognition Sensing for Opto-Electronic Information and Event
    Liu Tiegen
    Liu Kun
    Dai Lin
    Jiang Junfeng
    Wang Jian
    Ding Zhenyang
    Sang Mei
    Hu Haofeng
    Wang Shuang
    Xue Chao
    Wang Jingbin
    Deng Ye
    [J]. ACTA OPTICA SINICA, 2021, 41 (01)