UTM: A Unified Multiple Object Tracking Model with Identity-Aware Feature Enhancement

被引:25
|
作者
You, Sisi [1 ]
Yao, Hantao [2 ]
Bao, Bing-Kun [1 ]
Xu, Changsheng [2 ,3 ]
机构
[1] Nanjing Univ Posts & Telecommun, Nanjing, Peoples R China
[2] Chinese Acad Sci CASIA, Inst Automat, State Key Lab Multimodal Artificial Intelligence, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.02095
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, Multiple Object Tracking has achieved great success, which consists of object detection, feature embedding, and identity association. Existing methods apply the three-step or two-step paradigm to generate robust trajectories, where identity association is independent of other components. However, the independent identity association results in the identity-aware knowledge contained in the tracklet not be used to boost the detection and embedding modules. To overcome the limitations of existing methods, we introduce a novel Unified Tracking Model (UTM) to bridge those three components for generating a positive feedback loop with mutual benefits. The key insight of UTM is the Identity-Aware Feature Enhancement (IAFE), which is applied to bridge and benefit these three components by utilizing the identity-aware knowledge to boost detection and embedding. Formally, IAFE contains the Identity-Aware Boosting Attention (IABA) and the Identity-Aware Erasing Attention (IAEA), where IABA enhances the consistent regions between the current frame feature and identity-aware knowledge, and IAEA suppresses the distracted regions in the current frame feature. With better detections and embeddings, higher-quality tracklets can also be generated. Extensive experiments of public and private detections on three benchmarks demonstrate the robustness of UTM.
引用
收藏
页码:21876 / 21886
页数:11
相关论文
共 50 条
  • [1] MICap: A Unified Model for Identity-aware Movie Descriptions
    Raajesh, Haran
    Desanur, Naveen Reddy
    Khan, Zeeshan
    Tapaswi, Makarand
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 14011 - 14021
  • [2] The Solution Path Algorithm for Identity-Aware Multi-Object Tracking
    Yu, Shoou-I
    Meng, Deyu
    Zuo, Wangmeng
    Hauptmann, Alexander
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3871 - 3879
  • [3] Target Identity-aware Network Flow for Online Multiple Target Tracking
    Dehghan, Afshin
    Tian, Yicong
    Torre, Philip. H. S.
    Shah, Mubarak
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 1146 - 1154
  • [4] Enabling Identity-Aware Tracking via Fusion of Visual and Inertial Features
    Tsai, Richard Yi-Chia
    Ke, Hans Ting-Yuan
    Lin, Kate Ching-Ju
    Tseng, Yu-Chee
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 2260 - 2266
  • [5] Box-Aware Feature Enhancement for Single Object Tracking on Point Clouds
    Zheng, Chaoda
    Yan, Xu
    Gao, Jiantao
    Zhao, Weibing
    Zhang, Wei
    Li, Zhen
    Cui, Shuguang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13179 - 13188
  • [6] Deep Multi-View Correspondence for Identity-Aware Multi-Target Tracking
    Hanif, Adnan
    Bin Mansoor, Atif
    Imran, Ali Shariq
    2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 497 - 504
  • [7] An Asymmetric Feature Enhancement Network for Multiple Object Tracking of Unmanned Aerial Vehicle
    Ma, Jianbo
    Liu, Dongxu
    Qin, Senlin
    Jia, Ge
    Zhang, Jianlin
    Xu, Zhiyong
    REMOTE SENSING, 2024, 16 (01)
  • [8] Multiple Feature Fusion for Object Tracking
    Zhou, Yu
    Rao, Cong
    Lu, Qin
    Bai, Xiang
    Liu, Wenyu
    INTELLIGENT SCIENCE AND INTELLIGENT DATA ENGINEERING, ISCIDE 2011, 2012, 7202 : 145 - 152
  • [9] FSNet: An Identity-Aware Generative Model for Image-Based Face Swapping
    Natsume, Ryota
    Yatagawa, Tatsuya
    Morishima, Shigeo
    COMPUTER VISION - ACCV 2018, PT VI, 2019, 11366 : 117 - 132
  • [10] Gaze Differences in Multiple Object Tracking Versus Multiple Identity Tracking
    Lukavsky, Jiri
    Dechterenko, Filip
    PERCEPTION, 2019, 48 : 107 - 108