Robust Multi-Object Tracking with pseudo-information guided motion and enhanced semantic vision

被引:0
|
作者
Zhang, Yukuan [1 ,2 ]
Wang, Shengsheng [1 ,2 ]
Fu, Zihao [1 ,2 ]
Zhao, Limin [3 ]
Zhao, Jiarui [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Qianwei South Campus,2699 Qianjin St, Changchun 130012, Jilin Province, Peoples R China
[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130012, Peoples R China
[3] Beihang Univ, Sch Space & Environm, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-Object Tracking; Pseudo information; Semantic information; Embedding clusters; Discriminative power; ONLINE;
D O I
10.1016/j.eswa.2025.126846
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The key to Multi-Object Tracking is to differentiate multiple instances in a video sequence and maintain their identity continuity. To achieve this goal, most methods model the motion or appearance cues of instances. However, when faced with complex scenarios like camera motion, occlusion, and crowding, trackers often lack discriminative capabilities. In this paper, we propose a robust tracker, named RccTrack, that combines motion cues guided by pseudo information and enhanced visual clues to overcome the aforementioned issues. Specifically, pseudo-observation information is constructed for guiding trajectory localization and generate interference-resistant trajectories. Pseudo-state information is constructed for guiding the calculation of inter- frame target motion directions. These pseudo-information is used to enhance the discriminative power of the motion cues. For visual cues, a semantic fusion network is designed to extract strong discriminative appearance information and store them in our hierarchical fusion embedding clusters, thus enhancing the discriminative power of the visual cues. In addition, we design the cascade matching method, which performs the association task based on the trajectory length information to distinguish confusing targets. In the matching stage, the two cues mentioned above are combined to enhance the discriminative power of the tracker. Experimental results demonstrate that RccTrack achieves state-of-the-art performance on MOT16, MOT17, MOT20, and DanceTrack benchmarks.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Research and implementation of multi-object tracking based on vision DSP
    Gong, Xuan
    Le, Zichun
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (06) : 1801 - 1809
  • [32] ReIDTracker Sea: Multi-Object Tracking in Maritime Computer Vision
    Huang, Kaer
    Chong, Weitu
    Yang, Hui
    Lertniphonphan, Kanokphan
    Xie, Jun
    Chen, Feng
    2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 813 - 820
  • [33] Robust pedestrian multi-object tracking in the intelligent bus environment
    Wang, Shaohua
    Guo, Yuhao
    Li, Yicheng
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (10)
  • [34] Robust multi-object tracking using deep learning framework
    Pang, Sh Ch
    Du, Anan
    Yu, Zh. Zh.
    JOURNAL OF OPTICAL TECHNOLOGY, 2015, 82 (08) : 516 - 527
  • [35] Robust Multimodal and Multi-Object Tracking for Autonomous Driving Applications
    Perez, Marc
    Agudo, Antonio
    2023 21ST INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS, ICAR, 2023, : 100 - 106
  • [36] TracTrac: A fast multi-object tracking algorithm for motion estimation
    Heyman, Joris
    COMPUTERS & GEOSCIENCES, 2019, 128 : 11 - 18
  • [37] UCMCTrack: Multi-Object Tracking with Uniform Camera Motion Compensation
    Yi, Kefu
    Luo, Kai
    Luo, Xiaolei
    Huang, Jiangui
    Wu, Hao
    Hu, Rongdong
    Hao, Wei
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6702 - 6710
  • [38] DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion
    Sun, Peize
    Cao, Jinkun
    Jiang, Yi
    Yuan, Zehuan
    Bai, Song
    Kitani, Kris
    Luo, Ping
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20961 - 20970
  • [39] Robust Multi-object Tracking to Acquire Object Oriented Videos in Indoor Sports
    Kim, Yookyung
    Cho, Kee-Seong
    2016 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC 2016): TOWARDS SMARTER HYPER-CONNECTED WORLD, 2016, : 1104 - 1107
  • [40] A Benchmark for Vision-based Multi-UAV Multi-object Tracking
    Shen, Hao
    Yang, Xiwen
    Lin, Defu
    Chai, Jianduo
    Huo, Jiakai
    Xing, Xiaofeng
    He, Shaoming
    2022 IEEE INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS (MFI), 2022,