CSMOT: Make One-Shot Multi-Object Tracking in Crowded Scenes Great Again

被引:3
|
作者
Hou, Haoxiong [1 ,2 ]
Shen, Chao [1 ,2 ]
Zhang, Ximing [1 ]
Gao, Wei [1 ]
机构
[1] Chinese Acad Sci, Xian Inst Opt & Precis Mech, Xian 710119, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 101408, Peoples R China
关键词
one-shot; multi-object tracking; re-ID; coordinate attention; angle-center loss; data association;
D O I
10.3390/s23073782
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The current popular one-shot multi-object tracking (MOT) algorithms are dominated by the joint detection and embedding paradigm, which have high inference speeds and accuracy, but their tracking performance is unstable in crowded scenes. Not only does the detection branch have difficulty in obtaining the accurate object position, but the ambiguous appearance of features extracted by the re-identification (re-ID) branch also leads to identity switches. Focusing on the above problems, this paper proposes a more robust MOT algorithm, named CSMOT, based on FairMOT. First, on the basis of the encoder-decoder network, a coordinate attention module is designed to enhance the information interaction between channels (horizontal and vertical coordinates), which improves its object-detection abilities. Then, an angle-center loss that effectively maximizes intra-class similarity is proposed to optimize the re-ID branch, and the extracted re-ID features are made more discriminative. We further redesign the re-ID feature dimension to balance the detection and re-ID tasks. Finally, a simple and effective data association mechanism is introduced, which associates each detection instead of just the high-score detections during the tracking process. The experimental results show that our one-shot MOT algorithm achieves excellent tracking performance on multiple public datasets and can be effectively applied to crowded scenes. In particular, CSMOT decreases the number of ID switches by 11.8% and 33.8% on the MOT16 and MOT17 test datasets, respectively, compared to the baseline.
引用
收藏
页数:17
相关论文
共 45 条
  • [41] SocialVis: Dynamic social visualization in dense scenes via real-time multi-object tracking and proximity graph construction
    Li, Bowen
    Li, Wei
    Wang, Jingqi
    Meng, Weiliang
    Zhang, Jiguang
    Zhang, Xiaopeng
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2024, 35 (03)
  • [42] Real-Time Object Tracking via Meta-Learning: Efficient Model Adaptation and One-Shot Channel Pruning
    Jung, Ilchae
    You, Kihyun
    Noh, Hyeonwoo
    Cho, Minsu
    Han, Bohyung
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11205 - 11212
  • [43] Attention-Enhanced One-Shot Attack against Single Object Tracking for Unmanned Aerial Vehicle Remote Sensing Images
    Jiang, Yan
    Yin, Guisheng
    REMOTE SENSING, 2023, 15 (18)
  • [44] Applying Object Detection and Embedding Techniques to One-Shot Class-Incremental Multi-Label Image Classification
    Park, Youngki
    Shin, Youhyun
    APPLIED SCIENCES-BASEL, 2023, 13 (18):
  • [45] ReTrackVLM: Transformer-Enhanced Multi-Object Tracking with Cross-Modal Embeddings and Zero-Shot Re-Identification Integration
    Bayraktar, Ertugrul
    APPLIED SCIENCES-BASEL, 2025, 15 (04):