Pedestrian multi-object tracking algorithm based on improved YOLOX and multi-level data association

被引:0
|
作者
Han K. [1 ]
Peng J. [1 ]
机构
[1] School of Traffic & Transportation Engineering, Central South University, Changsha
关键词
attention mechanism; computer vision; data association; multi-object tracking; object detection;
D O I
10.19713/j.cnki.43-1423/u.T20230333
中图分类号
学科分类号
摘要
Object tracking is a fundamental problem in the field of computer vision. Pedestrian multi-object tracking has broad application prospects in many fields such as intelligent surveillance and intelligent transportation. However, frequent occlusion and scale change exist in the actual tracking scene, which brings great challenges to the multi-object tracking algorithm. In order to further improve the tracking accuracy, on the basis of DeepSORT, a pedestrian multi-object tracking algorithm based on improved YOLOX and multi-level data association was proposed. For the detector, in order to enhance the feature expression ability of the network and improve the detection accuracy, the ECA channel attention module and the ASFF adaptive feature fusion module were introduced into the YOLOX skeleton network and the neck network respectively. For identification features, in order to reduce the number of false matches in the data association step and improve tracking efficiency, the lightweight OSNet re-identification network and NSA Kalman filter were used to obtain target features. For data association, in order to reduce the number of identity switching and avoid target loss, the detection and tracking were classified. The different similarity calculation methods were used to realize multi-level data association based on detection confidence and trajectory state. The experimental results show that compared with the algorithm that simply combines YOLOX and DeepSORT before improvement,the introduction of ECA module and ASFF module in YOLOX can reduce the number of false detections. The reduction can be up to 17% when using YOLOX-s model. The feature extraction method combining OSNet model and NSA Kalman filter can improve the tracking stability, the IDF1 index is increased by 0.77%, and the IDSW is reduced by 947. The multi-level data association algorithm based on detection confidence and trajectory state can significantly improve the tracking performance, and the MOTA index is increased by 3.36%. The results of MOTA on MOT17 and MOT20 test sets are 80.4% and 77.7%, and IDF1 are 78.4% and 76.7%. Compared with other advanced algorithms, the proposed pedestrian multi-object tracking method achieves a better balance between tracking accuracy and tracking speed, which can provide reference for online pedestrian multi-object tracking applications in industry. © 2024, Central South University Press. All rights reserved.
引用
收藏
页码:94 / 105
页数:11
相关论文
共 30 条
  • [21] LIANG Hong, WU Ting, ZHANG Qian, Et al., Non-maximum suppression performs later in multi-object tracking, Applied Sciences, 12, 7, (2022)
  • [22] SHAO Shuai, ZHAO Zijian, LI Boxun, Et al., CrowdHuman: a benchmark for detecting human in a crowd, (2018)
  • [23] ZHANG Shanshan, BENENSON R, SCHIELE B., CityPersons: a diverse dataset for pedestrian detection [C], 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4457-4465, (2017)
  • [24] ESS A, LEIBE B, SCHINDLER K, Et al., A mobile vision system for robust multi-person tracking[C], 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1-8, (2008)
  • [25] MILAN A, LEAL-TAIXE L, REID I, Et al., MOT16: a benchmark for multi-object tracking, (2016)
  • [26] DENDORFER P, REZATOFIGHI H, MILAN A, Et al., MOT20: a benchmark for multi object tracking in crowded scenes, (2020)
  • [27] SUN Peize, CAO Jinkun, JIANG Yi, Et al., DanceTrack: multi-object tracking in uniform appearance and diverse motion[C], 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 20961-20970, (2022)
  • [28] EVANGELIDIS G D, PSARAKIS E Z., Parametric image alignment using enhanced correlation coefficient maximization[J], IEEE Transactions on Pattern Analysis and Machine Intelligence, 30, 10, (2008)
  • [29] CHU Peng, WANG Jiang, YOU Quanzeng, Et al., TransMOT: spatial-temporal graph transformer for multiple object tracking[C], 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pp. 4859-4869, (2023)
  • [30] YANG Fan, CHANG Xin, SAKTI S, Et al., ReMOT: a model-agnostic refinement for multiple object tracking, Image and Vision Computing, 106, (2021)