Visual object tracking by using ranking loss and spatial-temporal features

被引:0
|
作者
Saribas, Hasan [2 ]
Cevikalp, Hakan [1 ]
Kahvecioglu, Sinem [3 ]
机构
[1] Eskisehir Osmagazi Univ, Machine Learning & Comp Vis Lab, Elect & Elect Engn, Eskisehir, Turkiye
[2] Huawei Turkey R&D Ctr, Istanbul, Turkiye
[3] Eskisehir Tech Univ, Fac Aeronaut & Astronaut, Dept Avion, Eskisehir, Turkiye
关键词
Object tracking; Ranking loss; Two-stream network; Temporal features;
D O I
10.1007/s00138-023-01381-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a novel two-stream deep neural network tracker for robust object tracking. In the proposed network, we use both spatial and temporal features and employ a novel loss function called ranking loss. The class confidence scores coming from the two-stream (spatial and temporal) networks are fused at the end for final decision. Using ranking loss in the proposed tracker enforces the networks to learn giving higher scores to the candidate regions that frame the target object better. As a result, the tracker returns more precise bounding boxes framing the target object, and the risk of tracking error accumulation and drifts are largely mitigated when the proposed network architecture is used with a simple yet effective model update rule. We conducted extensive experiments on six different benchmarks, including OTB-2015, VOT-2017, TC-128, DTB70, NfS and UAV123. Our proposed tracker achieves the state-of-the-art results on the most of the tested challenging tracking datasets. Especially, our results on the OTB-2015, DTB70, NfS and TC-128 datasets are very promising.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Object Tracking over Multiple Uncalibrated Cameras Using Visual, Spatial and Temporal Similarities
    Wedge, Daniel
    Scott, Adele F.
    Ma, Zhonghua
    Vendrig, Jeroen
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PT II, 2010, 6475 : 167 - 178
  • [42] Short Boundary Detection Using Spatial-Temporal Features
    Ali, Muhammad
    Adnan, Awais
    INFORMATION TECHNOLOGY: NEW GENERATIONS, 2016, 448 : 971 - 981
  • [43] Video classification using spatial-temporal features and PCA
    Xu, LQ
    Li, YM
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 485 - 488
  • [44] Online object tracking based on CNN with spatial-temporal saliency guided sampling
    Zhang, Peng
    Zhuo, Tao
    Huang, Wei
    Chen, Kangli
    Kankanhalli, Mohan
    NEUROCOMPUTING, 2017, 257 : 115 - 127
  • [45] Spatial-temporal single object tracking with three-way decision theory
    Wang, Ziye
    Miao, Duoqian
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2023, 154 : 38 - 47
  • [46] Multi-Object Tracking With Spatial-Temporal Topology-Based Detector
    You, Sisi
    Yao, Hantao
    Xu, Changsheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 3023 - 3035
  • [47] Joint Spatial-Temporal Optimization for Stereo 3D Object Tracking
    Li, Peiliang
    Shi, Jieqi
    Shen, Shaojie
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6876 - 6885
  • [48] Visual tracking based on a unified tracking-and-detection framework with spatial-temporal consistency filtering
    Fang, Yang
    Ka, Seunghyun
    Jo, Geun-Sik
    COMPUTERS & ELECTRICAL ENGINEERING, 2019, 80
  • [49] TSTrack: A Robust Object Tracking Framework Integrated Temporal and Spatial Features
    Mu, Qi
    Wang, Xueqian
    He, Zuohui
    Li, Zhanli
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XII, 2025, 15042 : 344 - 360
  • [50] Online Multi-Object Tracking Using CNN-based Single Object Tracker with Spatial-Temporal Attention Mechanism
    Chu, Qi
    Ouyang, Wanli
    Li, Hongsheng
    Wang, Xiaogang
    Liu, Bin
    Yu, Nenghai
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4846 - 4855