CSMOT: Make One-Shot Multi-Object Tracking in Crowded Scenes Great Again

被引:3
|
作者
Hou, Haoxiong [1 ,2 ]
Shen, Chao [1 ,2 ]
Zhang, Ximing [1 ]
Gao, Wei [1 ]
机构
[1] Chinese Acad Sci, Xian Inst Opt & Precis Mech, Xian 710119, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 101408, Peoples R China
关键词
one-shot; multi-object tracking; re-ID; coordinate attention; angle-center loss; data association;
D O I
10.3390/s23073782
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The current popular one-shot multi-object tracking (MOT) algorithms are dominated by the joint detection and embedding paradigm, which have high inference speeds and accuracy, but their tracking performance is unstable in crowded scenes. Not only does the detection branch have difficulty in obtaining the accurate object position, but the ambiguous appearance of features extracted by the re-identification (re-ID) branch also leads to identity switches. Focusing on the above problems, this paper proposes a more robust MOT algorithm, named CSMOT, based on FairMOT. First, on the basis of the encoder-decoder network, a coordinate attention module is designed to enhance the information interaction between channels (horizontal and vertical coordinates), which improves its object-detection abilities. Then, an angle-center loss that effectively maximizes intra-class similarity is proposed to optimize the re-ID branch, and the extracted re-ID features are made more discriminative. We further redesign the re-ID feature dimension to balance the detection and re-ID tasks. Finally, a simple and effective data association mechanism is introduced, which associates each detection instead of just the high-score detections during the tracking process. The experimental results show that our one-shot MOT algorithm achieves excellent tracking performance on multiple public datasets and can be effectively applied to crowded scenes. In particular, CSMOT decreases the number of ID switches by 11.8% and 33.8% on the MOT16 and MOT17 test datasets, respectively, compared to the baseline.
引用
收藏
页数:17
相关论文
共 45 条
  • [31] Vehicular/Non-Vehicular Multi-Class Multi-Object Tracking in Drone-Based Aerial Scenes
    Bisio, Igor
    Garibotto, Chiara
    Haleem, Halar
    Lavagetto, Fabio
    Sciarrone, Andrea
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (04) : 4961 - 4977
  • [32] OneShotDA: Online Multi-Object Tracker With One-Shot-Learning-Based Data Association
    Yoon, Kwangjin
    Gwak, Jeonghwan
    Song, Young-Min
    Yoon, Young-Chul
    Jeon, Moon-Gu
    IEEE ACCESS, 2020, 8 : 38060 - 38072
  • [33] Dynamic fry counting based on multi-object tracking and one-stage detection
    Zhang, Hanyu
    Li, Weiran
    Qi, Yanyu
    Liu, Haonan
    Li, Zhenbo
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 209
  • [34] Continuous Copy-Paste for One-stage Multi-object Tracking and Segmentation
    Xu, Zhenbo
    Meng, Ajin
    Shi, Zhenbo
    Yang, Wei
    Chen, Zhi
    Huang, Liusheng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15303 - 15312
  • [35] DIVOTrack: A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes
    Shengyu Hao
    Peiyuan Liu
    Yibing Zhan
    Kaixun Jin
    Zuozhu Liu
    Mingli Song
    Jenq-Neng Hwang
    Gaoang Wang
    International Journal of Computer Vision, 2024, 132 : 1075 - 1090
  • [36] DIVOTrack: A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes
    Hao, Shengyu
    Liu, Peiyuan
    Zhan, Yibing
    Jin, Kaixun
    Liu, Zuozhu
    Song, Mingli
    Hwang, Jenq-Neng
    Wang, Gaoang
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (04) : 1075 - 1090
  • [37] DETECTION-IDENTIFICATION BALANCING MARGIN LOSS FOR ONE-STAGE MULTI-OBJECT TRACKING
    Lee, Heansung
    Cho, Suhwan
    Jang, Sungjun
    Lee, Jungho
    Woo, Sungmin
    Lee, Sangyoun
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3081 - 3085
  • [38] One-Shot Multiple Object Tracking in UAV Videos Using Task-Specific Fine-Grained Features
    Wu, Han
    Nie, Jiahao
    He, Zhiwei
    Zhu, Ziming
    Gao, Mingyu
    REMOTE SENSING, 2022, 14 (16)
  • [39] Accurate 3D Multi-Object Detection and Tracking on Vietnamese Street Scenes Based on Sparse Point Cloud Data
    Loc, Hoang Duy
    Son, Le Anh
    Nang, Ho Xuan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (01) : 92 - 101
  • [40] Research on Real-Time Visual SLAM Method Based on 3D Multi-Object Tracking in Dynamic Scenes
    Chen J.
    Che Y.
    Tian X.
    Lan F.
    Zhou Y.
    Qiche Gongcheng/Automotive Engineering, 2024, 46 (05): : 776 - 783