VITAL: VIsual Tracking via Adversarial Learning

被引:450
|
作者
Song, Yibing [1 ]
Ma, Chao [2 ]
Wu, Xiaohe [3 ]
Gong, Lijun [4 ]
Bao, Linchao [1 ]
Zuo, Wangmeng [3 ]
Shen, Chunhua [2 ]
Lau, Rynson W. H. [5 ]
Yang, Ming-Hsuan [6 ]
机构
[1] Tencent AI Lab, Shenzhen, Peoples R China
[2] Univ Adelaide, Adelaide, SA, Australia
[3] Harbin Inst Technol, Harbin, Heilongjiang, Peoples R China
[4] Tencent, Shenzhen, Peoples R China
[5] City Univ Hong Kong, Hong Kong, Peoples R China
[6] Univ Calif Merced, Merced, CA USA
关键词
D O I
10.1109/CVPR.2018.00937
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The tracking-by-detection framework consists of two stages, i.e., drawing samples around the target object in the first stage and classifying each sample as the target object or as background in the second stage. The performance of existing trackers using deep classification networks is limited by two aspects. First, the positive samples in each frame are highly spatially overlapped, and they fail to capture rich appearance variations. Second, there exists extreme class imbalance between positive and negative samples. This paper presents the VITAL algorithm to address these two problems via adversarial learning. To augment positive samples, we use a generative network to randomly generate masks, which are applied to adaptively dropout input features to capture a variety of appearance changes. With the use of adversarial learning, our network identifies the mask that maintains the most robust features of the target objects over a long temporal span. In addition, to handle the issue of class imbalance, we propose a high-order cost sensitive loss to decrease the effect of easy negative samples to facilitate training the classification network. Extensive experiments on benchmark datasets demonstrate that the proposed tracker performs favorably against state-of-the-art approaches.
引用
收藏
页码:8990 / 8999
页数:10
相关论文
共 50 条
  • [31] Online Decision Based Visual Tracking via Reinforcement Learning
    Song, Ke
    Zhang, Wei
    Song, Ran
    Li, Yibin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [32] Visual tracking via efficient kernel discriminant subspace learning
    Shen, CH
    van den Hengel, A
    Brooks, MJ
    2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 1605 - 1608
  • [33] Visual tracking via sparse representation and online dictionary learning
    Wu, Zhenyang, 1600, Springer Verlag (8703):
  • [34] Robust Visual Tracking via Binocular Consistent Sparse Learning
    Ma, Ziang
    Xiang, Zhiyu
    NEURAL PROCESSING LETTERS, 2017, 46 (02) : 627 - 642
  • [35] Robust Visual Tracking via Weighted Extreme Learning Machine
    Cao, Yi
    Ji, Hongbing
    Zhang, Wenbo
    Yin, Pengfei
    PROCEEDINGS OF 2016 SIXTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION & MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2016), 2016, : 888 - 891
  • [36] Generating visual representations for zero-shot learning via adversarial learning and variational autoencoders
    Gull, Muqaddas
    Arif, Omar
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2023, 52 (05) : 636 - 651
  • [37] Learning attention for object tracking with adversarial learning network
    Xu Cheng
    Chen Song
    Yongxiang Gu
    Beijing Chen
    EURASIP Journal on Image and Video Processing, 2020
  • [38] Learning attention for object tracking with adversarial learning network
    Cheng, Xu
    Song, Chen
    Gu, Yongxiang
    Chen, Beijing
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2020, 2020 (01)
  • [39] Learning Context Restrained Correlation Tracking Filters via Adversarial Negative Instance Generation
    Huang, Bo
    Xu, Tingfa
    Li, Jianan
    Luo, Fei
    Qin, Qingwang
    Chen, Junjie
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (09) : 6132 - 6145
  • [40] Physical Adversarial Textures That Fool Visual Object Tracking
    Wiyatno, Rey Reza
    Xu, Anqi
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4821 - 4830