Towards Real-World Visual Tracking With Temporal Contexts

被引:23
|
作者
Cao, Ziang [1 ]
Huang, Ziyuan [2 ]
Pan, Liang [1 ]
Zhang, Shiwei [3 ]
Liu, Ziwei [1 ]
Fu, Changhong [4 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[2] Natl Univ Singapore, Dept Mech Engn, Singapore 119077, Singapore
[3] DAMO Acad, Alibaba Grp, Hangzhou 310052, Zhejiang, Peoples R China
[4] Tongji Univ, Sch Mech Engn, Shanghai 201804, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Latency-aware evaluations; real-world tests; temporal contexts; two-level framework; visual tracking; PLUS PLUS; NETWORK;
D O I
10.1109/TPAMI.2023.3307174
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual tracking has made significant improvements in the past few decades. Most existing state-of-the-art trackers 1) merely aim for performance in ideal conditions while overlooking the real-world conditions; 2) adopt the tracking-by-detection paradigm, neglecting rich temporal contexts; 3) only integrate the temporal information into the template, where temporal contexts among consecutive frames are far from being fully utilized. To handle those problems, we propose a two-level framework (TCTrack) that can exploit temporal contexts efficiently. Based on it, we propose a stronger version for real-world visual tracking, i.e., TCTrack++. It boils down to two levels: features and similarity maps. Specifically, for feature extraction, we propose an attention-based temporally adaptive convolution to enhance the spatial features using temporal information, which is achieved by dynamically calibrating the convolution weights. For similarity map refinement, we introduce an adaptive temporal transformer to encode the temporal knowledge efficiently and decode it for the accurate refinement of the similarity map. To further improve the performance, we additionally introduce a curriculum learning strategy. Also, we adopt online evaluation to measure performance in real-world conditions. Exhaustive experiments on 8 well-known benchmarks demonstrate the superiority of TCTrack++. Real-world tests directly verify that TCTrack++ can be readily used in real-world applications.
引用
收藏
页码:15834 / 15849
页数:16
相关论文
共 50 条
  • [21] Benchmarking framework for anomaly localization: Towards real-world deployment of automated visual inspection
    Gangopadhyay, Tryambak
    Hong, Sungmin
    Roy, Sujoy
    Shah, Yash
    Cheong, Lin Lee
    JOURNAL OF MANUFACTURING SYSTEMS, 2023, 69 : 64 - 75
  • [22] Real-world object categories and scene contexts conjointly structure statistical learning for the guidance of visual search
    Kershner, Ariel M.
    Hollingworth, Andrew
    ATTENTION PERCEPTION & PSYCHOPHYSICS, 2022, 84 (04) : 1304 - 1316
  • [23] Real-world object categories and scene contexts conjointly structure statistical learning for the guidance of visual search
    Ariel M. Kershner
    Andrew Hollingworth
    Attention, Perception, & Psychophysics, 2022, 84 : 1304 - 1316
  • [24] Temporal Paths in Real-World Sensor Networks
    Bollen, Erik
    Kuijpers, Bart
    Soliani, Valeria
    Vaisman, Alejandro
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2024, 13 (02)
  • [25] Modelling Visual Complexity of Real-World Scenes
    Nagle, Fintan S.
    Lavie, Nilli
    PERCEPTION, 2019, 48 : 77 - 77
  • [26] DETERMINANTS OF VISUAL ATTENTION IN REAL-WORLD SCENES
    LEWIS, MS
    PERCEPTUAL AND MOTOR SKILLS, 1975, 41 (02) : 411 - 416
  • [27] Canonical Visual Size for Real-World Objects
    Konkle, Talia
    Oliva, Aude
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2011, 37 (01) : 23 - 37
  • [28] An Integrated Robust Approach for Fast Face Tracking in Noisy Real-World Videos with Visual Constraints
    Ranganatha, S.
    Gowramma, Y. P.
    2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 772 - 776
  • [29] An Integrated Robust Approach for Fast Face Tracking in Noisy Real-World Videos with Visual Constraints
    Ranganatha, S.
    Gowramma, Y. P.
    PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL (I2C2), 2017,
  • [30] Students’ Application of Concavity and Inflection Points to Real-World Contexts
    Steven R. Jones
    International Journal of Science and Mathematics Education, 2019, 17 : 523 - 544