Temporal Attention for Robust Multiple Object Pose Tracking

被引:0
|
作者
Li, Zhongluo [1 ]
Yoshimoto, Junichiro [2 ]
Ikeda, Kazushi [1 ]
机构
[1] Nara Inst Sci & Technol, Nara 6300192, Japan
[2] Fujita Hlth Univ, Sch Med, Toyoake, Aichi 4701192, Japan
关键词
Pose Estimation; Vision Transformer; Temporal Information;
D O I
10.1007/978-981-99-8070-3_42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating the pose of multiple objects has improved substantially since deep learning became widely used. However, the performance deteriorates when the objects are highly similar in appearance or when occlusions are present. This issue is usually addressed by leveraging temporal information that takes previous frames as priors to improve the robustness of estimation. Existing methods are either computationally expensive by using multiple frames, or are inefficiently integrated with ad hoc procedures. In this paper, we perform computationally efficient object association between two consecutive frames via attention through a video sequence. Furthermore, instead of heatmap-based approaches, we adopt a coordinate classification strategy that excludes post-processing, where the network is built in an end-to-end fashion. Experiments on real data show that our approach achieves state-of-the-art results on Pose-Track datasets.
引用
收藏
页码:551 / 561
页数:11
相关论文
共 50 条
  • [1] Remapping attention in multiple object tracking
    Howe, Piers D. L.
    Drew, Trafton
    Pinto, Yair
    Horowitz, Todd S.
    VISION RESEARCH, 2011, 51 (05) : 489 - 495
  • [2] Modeling of Multiple Spatial-Temporal Relations for Robust Visual Object Tracking
    Wang, Shilei
    Wang, Zhenhua
    Sun, Qianqian
    Cheng, Gong
    Ning, Jifeng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5073 - 5085
  • [3] Robust monocular object pose tracking for large pose shift using 2D tracking
    Qiufu Wang
    Jiexin Zhou
    Zhang Li
    Xiaoliang Sun
    Qifeng Yu
    Visual Intelligence, 1 (1):
  • [4] Robust Object Pose Tracking for Augmented Reality Guidance and Teleoperation
    Black, David
    Salcudean, Septimiu
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 15
  • [5] Tracking object poses in the context of robust body pose estimates
    Darby, John
    Li, Baihua
    Costen, Nicholas
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2014, 127 : 57 - 72
  • [6] Reallocating attention during multiple object tracking
    Justin M. Ericson
    James C. Christensen
    Attention, Perception, & Psychophysics, 2012, 74 : 831 - 840
  • [7] Reallocating attention during multiple object tracking
    Ericson, Justin M.
    Christensen, James C.
    ATTENTION PERCEPTION & PSYCHOPHYSICS, 2012, 74 (05) : 831 - 840
  • [8] Visual Attention Is Required for Multiple Object Tracking
    Tran, Annie
    Hoffman, James E.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2016, 42 (12) : 2103 - 2114
  • [9] A robust object tracking method under pose variation and partial occlusion
    Hotta, Kazuhiro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (07): : 2132 - 2141
  • [10] Robust coverless video steganography based on pose estimation and object tracking
    Li, Nan
    Qin, Jiaohua
    Xiang, Xuyu
    Tan, Yun
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2024, 87