Robust Multi-Object Tracking with pseudo-information guided motion and enhanced semantic vision

被引:0
|
作者
Zhang, Yukuan [1 ,2 ]
Wang, Shengsheng [1 ,2 ]
Fu, Zihao [1 ,2 ]
Zhao, Limin [3 ]
Zhao, Jiarui [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Qianwei South Campus,2699 Qianjin St, Changchun 130012, Jilin Province, Peoples R China
[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130012, Peoples R China
[3] Beihang Univ, Sch Space & Environm, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-Object Tracking; Pseudo information; Semantic information; Embedding clusters; Discriminative power; ONLINE;
D O I
10.1016/j.eswa.2025.126846
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The key to Multi-Object Tracking is to differentiate multiple instances in a video sequence and maintain their identity continuity. To achieve this goal, most methods model the motion or appearance cues of instances. However, when faced with complex scenarios like camera motion, occlusion, and crowding, trackers often lack discriminative capabilities. In this paper, we propose a robust tracker, named RccTrack, that combines motion cues guided by pseudo information and enhanced visual clues to overcome the aforementioned issues. Specifically, pseudo-observation information is constructed for guiding trajectory localization and generate interference-resistant trajectories. Pseudo-state information is constructed for guiding the calculation of inter- frame target motion directions. These pseudo-information is used to enhance the discriminative power of the motion cues. For visual cues, a semantic fusion network is designed to extract strong discriminative appearance information and store them in our hierarchical fusion embedding clusters, thus enhancing the discriminative power of the visual cues. In addition, we design the cascade matching method, which performs the association task based on the trajectory length information to distinguish confusing targets. In the matching stage, the two cues mentioned above are combined to enhance the discriminative power of the tracker. Experimental results demonstrate that RccTrack achieves state-of-the-art performance on MOT16, MOT17, MOT20, and DanceTrack benchmarks.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] UMTSS: a unifocal motion tracking surveillance system for multi-object tracking in videos
    Soma Hazra
    Shaurjya Mandal
    Banani Saha
    Sunirmal Khatua
    Multimedia Tools and Applications, 2023, 82 : 12401 - 12422
  • [42] UMTSS: a unifocal motion tracking surveillance system for multi-object tracking in videos
    Hazra, Soma
    Mandal, Shaurjya
    Saha, Banani
    Khatua, Sunirmal
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (08) : 12401 - 12422
  • [43] Semantically Enhanced Multi-Object Detection and Tracking for Autonomous Vehicles
    Wen, Tao
    Freris, Nikolaos M.
    IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (06) : 4600 - 4615
  • [44] A Comparative Study of BatchEnsemble for Multi-Object Tracking Approximations in Embedded Vision
    Nsinga, Robert
    Karungaru, Stephen
    Terada, Kenji
    FIFTEENTH INTERNATIONAL CONFERENCE ON QUALITY CONTROL BY ARTIFICIAL VISION, 2021, 11794
  • [45] Multi-Object Tracking with Interacting Vehicles and Road Map Information
    Danzer, Andreas
    Gies, Fabian
    Dietmayer, Klaus
    2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 589 - 595
  • [46] Embedded omni-vision navigator based on multi-object tracking
    Fu, Huazhu
    Cao, Zuoliang
    Cao, Xiaochun
    MACHINE VISION AND APPLICATIONS, 2011, 22 (02) : 349 - 358
  • [47] AIPT: Adaptive information perception for online multi-object tracking
    Zhang, Yukuan
    Xie, Housheng
    Jia, Yunhua
    Meng, Jingrui
    Sang, Meng
    Qiu, Junhui
    Zhao, Shan
    Yang, Yang
    KNOWLEDGE-BASED SYSTEMS, 2024, 285
  • [48] Learning Spatio-Temporal Information for Multi-Object Tracking
    Wei, Jian
    Yang, Mei
    Liu, Feng
    IEEE ACCESS, 2017, 5 : 3869 - 3877
  • [49] Extending IOU Based Multi-Object Tracking by Visual Information
    Bochinski, Erik
    Senst, Tobias
    Sikora, Thomas
    2018 15TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2018, : 435 - 440
  • [50] Multi-object tracking with adaptive measurement noise and information fusion
    Huang, Xi
    Zhan, Yinwei
    IMAGE AND VISION COMPUTING, 2024, 144