Transformer for multiple object tracking: Exploring locality to vision

被引:7
|
作者
Wu, Shan [1 ]
Hadachi, Amnir [1 ]
Lu, Chaoru [2 ]
Vivet, Damien [3 ]
机构
[1] Univ Tartu, Inst Comp Sci, ITS Lab, Narva mnt 18, EE-51009 Tartu, Estonia
[2] Oslo Metropolitan Univ, Ctr Metropolitan Digitalizat & Smartizat MetSmart, Dept Built Environm, Pilestredet 46, N-0167 Oslo, Norway
[3] Univ Toulouse, ISAE SUPAERO, 10 Ave Edouard Belin, F-31400 Toulouse, France
关键词
Multi-object tracking; Transformer; Deep learning; Locality to vision;
D O I
10.1016/j.patrec.2023.04.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-object tracking (MOT) is a critical task in various domains, such as traffic analysis, surveillance, and autonomous vehicles. The joint-detection-and-tracking paradigm has been extensively researched, which is faster and more convenient for training and deploying over the classic tracking-by-detection paradigm while achieving state-of-the-art performance. This paper explores the possibilities of enhancing the MOT system by leveraging the prevailing convolutional neural network (CNN) and a novel vision transformer technique Locality. There are several deficiencies in the transformer adopted for computer vision tasks. While the transformers are good at modeling global information for a long embedding, the locality mech-anism, which learns the local features, is missing. This could lead to negligence of small objects, which may cause security issues. We combine the TransTrack MOT system with the locality mechanism in-spired by LocalViT and find that the locality-enhanced system outperforms the baseline TransTrack by 5.3% MOTA on the MOT17 dataset. (c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页码:70 / 76
页数:7
相关论文
共 50 条
  • [41] Multiple Planar Object Tracking
    Zhang, Zhicheng
    Liu, Shengzhe
    Yang, Jufeng
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 23403 - 23413
  • [42] Object Tracking Using Computer Vision: A Review
    Kadam, Pushkar
    Fang, Gu
    Zou, Ju Jia
    COMPUTERS, 2024, 13 (06)
  • [43] Vision Based Object Tracking & Firing System
    Powale, Amar G.
    Khade, Vishwajit D.
    Kamble, Akash L.
    Rajarapollu, Prachi Rohit
    2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [44] A Fast Object Tracking Approach In Vision Application
    Zhang, Xiaojing
    Sha, Chenming
    Yue, Yajie
    APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 : 3265 - 3268
  • [45] Object Tracking and Positioning Based on Stereo Vision
    Zhou, Zhongwei
    Xu, Min
    Fu, Wei
    Zhao, Jizeng
    SENSORS, MEASUREMENT AND INTELLIGENT MATERIALS, PTS 1-4, 2013, 303-306 : 313 - +
  • [46] Visual Object Tracking in First Person Vision
    Matteo Dunnhofer
    Antonino Furnari
    Giovanni Maria Farinella
    Christian Micheloni
    International Journal of Computer Vision, 2023, 131 : 259 - 283
  • [47] Robot vision for autonomous object learning and tracking
    Sanfeliu, A
    PROGRESS IN PATTERN RECOGNITION, SPEECH AND IMAGE ANALYSIS, 2003, 2905 : 17 - 28
  • [48] Object tracking in a stereo and infrared vision system
    Colantonio, S.
    Benvenuti, M.
    Di Bono, M. G.
    Pieri, G.
    Salvetti, O.
    INFRARED PHYSICS & TECHNOLOGY, 2007, 49 (03) : 266 - 271
  • [49] Is First Person Vision Challenging for Object Tracking?
    Dunnhofer, Matteo
    Furnari, Antonino
    Farinella, Giovanni Maria
    Micheloni, Christian
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 2698 - 2710
  • [50] Dynamic Object Detection and Tracking in Vision SLAM
    Liu H.
    Niu L.
    Deng Y.
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)