Temporal Attention for Robust Multiple Object Pose Tracking

被引:0
|
作者
Li, Zhongluo [1 ]
Yoshimoto, Junichiro [2 ]
Ikeda, Kazushi [1 ]
机构
[1] Nara Inst Sci & Technol, Nara 6300192, Japan
[2] Fujita Hlth Univ, Sch Med, Toyoake, Aichi 4701192, Japan
关键词
Pose Estimation; Vision Transformer; Temporal Information;
D O I
10.1007/978-981-99-8070-3_42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating the pose of multiple objects has improved substantially since deep learning became widely used. However, the performance deteriorates when the objects are highly similar in appearance or when occlusions are present. This issue is usually addressed by leveraging temporal information that takes previous frames as priors to improve the robustness of estimation. Existing methods are either computationally expensive by using multiple frames, or are inefficiently integrated with ad hoc procedures. In this paper, we perform computationally efficient object association between two consecutive frames via attention through a video sequence. Furthermore, instead of heatmap-based approaches, we adopt a coordinate classification strategy that excludes post-processing, where the network is built in an end-to-end fashion. Experiments on real data show that our approach achieves state-of-the-art results on Pose-Track datasets.
引用
收藏
页码:551 / 561
页数:11
相关论文
共 50 条
  • [21] Multiple Object Detection in 360° Videos for Robust Tracking
    Kumar, V. Vineeth
    Naik, Shanthika
    Sarvani, Polisetty L.
    Pattanshetti, Shreya M.
    Mudenagudi, Uma
    Maralappanavar, Meena
    Patil, Priyadarshini
    Tabib, Ramesh A.
    Vandrotti, Basavaraja S.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT II, 2019, 11942 : 499 - 506
  • [22] Robust Object Tracking with Online Multiple Instance Learning
    Babenko, Boris
    Yang, Ming-Hsuan
    Belongie, Serge
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) : 1619 - 1632
  • [23] Robust Object Tracking based on Temporal and Spatial Deep Networks
    Teng, Zhu
    Xing, Junliang
    Wang, Qiang
    Lang, Congyan
    Feng, Songhe
    Jin, Yi
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1153 - 1162
  • [24] Deep Spatial and Temporal Network for Robust Visual Object Tracking
    Teng, Zhu
    Xing, Junliang
    Wang, Qiang
    Zhang, Baopeng
    Fan, Jianping
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 1762 - 1775
  • [25] Temporal Attention Control System for Multiple Objects Tracking
    Zang, Xuelian
    Brscic, Drazen
    Wang, Wei
    Kuehnlenz, Kolja
    PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 4836 - 4841
  • [26] Robust hierarchical multiple hypothesis tracker for multiple-object tracking
    Zulkifley, Mohd Asyraf
    Moran, Bill
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (16) : 12319 - 12331
  • [27] Structural pixel-wise target attention for robust object tracking
    Zhang, Huanlong
    Cheng, Liyun
    Zhang, Jianwei
    Huang, Wanwei
    Liu, Xiulei
    Yu, Junyang
    DIGITAL SIGNAL PROCESSING, 2021, 117
  • [28] Multiple object tracking: Anticipatory attention doesn't "bounce"
    Atsma, Jeroen
    Koning, Arno
    van Lier, Rob
    JOURNAL OF VISION, 2012, 12 (13):
  • [29] Multiple Object Tracking With Attention to Appearance, Structure, Motion and Size
    Karunasekera, Hasith
    Wang, Han
    Zhang, Handuo
    IEEE ACCESS, 2019, 7 : 104423 - 104434
  • [30] Multiple object tracking: effects of goal valence on allocation of attention
    Frielink-Loing, Andrea
    Koning, Arno
    van Lier, Rob
    PERCEPTION, 2022, 51 : 12 - 12