UCT: Learning Unified Convolutional Networks for Real-time Visual Tracking

被引:68
|
作者
Zhu, Zheng [1 ,2 ]
Huang, Guan [3 ]
Zou, Wei [1 ,2 ]
Du, Dalong [3 ]
Huang, Chang [3 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Horizon Robot Inc, Beijing, Peoples R China
基金
国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
OBJECT TRACKING;
D O I
10.1109/ICCVW.2017.231
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural networks (CNN) based tracking approaches have shown favorable performance in recent benchmarks. Nonetheless, the chosen CNN features are always pre-trained in different task and individual components in tracking systems are learned separately, thus the achieved tracking performance may be suboptimal. Besides, most of these trackers are not designed towards realtime applications because of their time-consuming feature extraction and complex optimization details. In this paper, we propose an end-to-end framework to learn the convolutional features and perform the tracking process simultaneously, namely, a unified convolutional tracker (UCT). Specifically, The UCT treats feature extractor and tracking process (ridge regression) both as convolution operation and trains them jointly, enabling learned CNN features are tightly coupled to tracking process. In online tracking, an efficient updating method is proposed by introducing peak-versus-noise ratio (PNR) criterion, and scale changes are handled efficiently by incorporating a scale branch into network. The proposed approach results in superior tracking performance, while maintaining real-time speed. The standard UCT and UCT-Lite can track generic objects at 41 FPS and 154 FPS without further optimization, respectively. Experiments are performed on four challenging benchmark tracking datasets: OTB2013, OTB2015, VOT2014 and VOT2015, and our method achieves state-ofthe-art results on these benchmarks compared with other real-time trackers.
引用
收藏
页码:1973 / 1982
页数:10
相关论文
共 50 条
  • [21] Real-Time Visual Tracking: Promoting the Robustness of Correlation Filter Learning
    Sui, Yao
    Zhang, Ziming
    Wang, Guanghui
    Tang, Yafei
    Zhang, Li
    COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 662 - 678
  • [22] Simple Real-time Multi-face Tracking based on Convolutional Neural Networks
    Li, Xile
    Lang, Jochen
    2018 15TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV), 2018, : 337 - 344
  • [23] Visual Sensor Networks for Indoor Real-Time Surveillance and Tracking of Multiple Targets
    Giordano, Jacopo
    Lazzaretto, Margherita
    Michieletto, Giulia
    Cenedese, Angelo
    SENSORS, 2022, 22 (07)
  • [24] Real-time active visual tracking system
    Ribaric, S
    Adrinek, G
    Segvic, S
    MELECON 2004: PROCEEDINGS OF THE 12TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, VOLS 1-3, 2004, : 231 - 234
  • [25] Robust Real-Time Tracking for Visual Surveillance
    David Thirde
    Mark Borg
    Josep Aguilera
    Horst Wildenauer
    James Ferryman
    Martin Kampel
    EURASIP Journal on Advances in Signal Processing, 2007
  • [26] Real-time visual tracking of complex structures
    Drummond, T
    Cipolla, R
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (07) : 932 - 946
  • [27] Robust real-time tracking for visual surveillance
    Thirde, David
    Borg, Mark
    Aguilera, Josep
    Wildenauer, Horst
    Ferryman, James
    Kampel, Martin
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)
  • [28] ADAPTIVE BACKGROUND FOR REAL-TIME VISUAL TRACKING
    Li, He
    Yang, Daiqin
    Chen, Zhenzhong
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [29] Drowsiness detection in real-time via convolutional neural networks and transfer learning
    Salem, Dina
    Waleed, Mohamed
    Journal of Engineering and Applied Science, 2024, 71 (01):
  • [30] Real-Time Target Detection and Recognition with Deep Convolutional Networks for Intelligent Visual Surveillance
    Xu, Wen
    He, Jing
    Zhang, Hao Lan
    Mao, Bo
    Cao, Jie
    2016 IEEE/ACM 9TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2016, : 321 - 326