Occluded Pedestrian Detection Algorithm Based on Improved YOLOv3

被引：13

作者：

Li Xiang ^{[1
,2
,3
,4
,5
]}

He Miao ^{[1
,2
,3
,4
]}

Luo Haibo ^{[1
,2
,3
,4
]}

机构：

[1] Chinese Acad Sci, Key Lab Optoelect Informat Proc, Shenyang 110016, Liaoning, Peoples R China

[2] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Liaoning, Peoples R China

[3] Chinese Acad Sci, Inst Robot, Shenyang 110169, Liaoning, Peoples R China

[4] Chinese Acad Sci, Inst Intelligent Mfg, Shenyang 110169, Liaoning, Peoples R China

[5] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

来源：

ACTA OPTICA SINICA | 2022年 / 42卷 / 14期

关键词：

machine vision; object detection; neural network; pedestrian detection; attention mechanism;

D O I：

10.3788/AOS202242.1415003

中图分类号：

O43 [光学];

学科分类号：

070207 ; 0803 ;

摘要：

In crowded scenes, it is difficult for YOLOv3 to detect the objects that overlap each other heavily. Aiming at the reasons for the decline of YOLOv3 performance, three improvements are proposed. Firstly, a Tight Loss function is proposed, which optimizes the variance and mean of the coordinates of the prediction boxes to make the prediction boxes belonging to the same target more compact, thus reducing the false positive rate. Secondly, a high-resolution feature pyramid is proposed, in which the resolution of each pyramid feature is improved by upsampling, and shallow features are introduced to enhance the differences between adjacent sub-features, so as to generate distinguishing depth features for highly overlapped targets. Thirdly, a detection head based on spatial attention mechanism is proposed to reduce the number of redundant prediction boxes, so as to reduce the computational burden of the non- maximum suppression (NMS) process. The experimental results on the crowded dataset CrowdHuman show that the average accuracy and recall rate of YOLOv3 detection are improved by 2. 91 percentage points and 3. 20 percentage points, and the miss rate is reduced by 1. 24 percentage points by using the proposed algorithms under the condition of using the traditional NMS method, which demonstrates the effectiveness of the proposed algorithms in boosting the performance in occluded pedestrian detection.

引用

页数：10

共 27 条

[11] Loshchilov I., 2017, Proc. 5th International Conf. on Learning Representations, P1
[12] An improved one-stage pedestrian detection method based on multi-scale attention feature extraction
Ma, Jun
Wan, Honglin
Wang, Junxia
Xia, Hao
Bai, Chengjie
[J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2021, 18 (06) : 1965 - 1978
[13] SPATIOTEMPORAL SCENE INTERPRETATION OF SPACE VIDEOS VIA DEEP NEURAL NETWORK AND TRACKLET ANALYSIS
Mou, Lichao
Zhu, Xiao Xiang
[J]. 2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 1823 - 1826
[14] Spatial Relationship Detection Method of Remote Sensing Objects
Nong Yuanjun
Wang Junjie
Zhao Xuebing
Zhang Junhang
Geng Hui
Xu Xiaodong
[J]. ACTA OPTICA SINICA, 2021, 41 (16)
[15] Real-Time Object Detection in Remote Sensing Images Based on Embedded System
Nong Yuanjun
Wang Junjie
[J]. ACTA OPTICA SINICA, 2021, 41 (10)
[16] Redmon J., 2018, YOLOv3: An Incremental Improvement
[17] IterDet: Iterative Scheme for Object Detection in Crowded Environments
Rukhovich, Danila
Sofiiuk, Konstantin
Galeev, Danil
Barinova, Olga
Konushin, Anton
[J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2020, 2021, 12644 : 344 - 354
[18] Shao S., 2018, ARXIV PREPRINT ARXIV
[19] [沈峘 Shen Huan], 2010, [光学学报, Acta Optica Sinica], V30, P1076
[20] Traffic Light Detection Based on Optimized YOLOv3 Algorithm
Sun Yingchun
Pan Shuguo
Zhao Tao
Gao Wang
Wei Jiansheng
[J]. ACTA OPTICA SINICA, 2020, 40 (12)

← 1 2 3 →