ACSiamRPN: Adaptive Context Sampling for Visual Object Tracking

被引:4
|
作者
Qin, Xiaofei [1 ,2 ,3 ]
Zhang, Yipeng [4 ]
Chang, Hang [5 ]
Lu, Hao [6 ]
Zhang, Xuedian [1 ,2 ,3 ,7 ]
机构
[1] Univ Shanghai Sci & Technol, Sch Opt Elect & Comp Engn, Shanghai 200093, Peoples R China
[2] Shanghai Key Lab Contemporary Opt Syst, Shanghai 200093, Peoples R China
[3] Minist Educ, Key Lab Biomed Opt Technol & Devices, Shanghai 200093, Peoples R China
[4] Univ Shanghai Sci & Technol, Sch Mech Engn, Shanghai 200093, Peoples R China
[5] Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
[6] Guangxi Yuchai Machinery Co Ltd, Nanning 530007, Peoples R China
[7] Tongji Univ, Shanghai Inst Intelligent Sci & Technol, Shanghai 200092, Peoples R China
关键词
visual object tracking; SiamRPN; global context; selective kernel convolution; SIAMESE NETWORKS;
D O I
10.3390/electronics9091528
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In visual object tracking fields, the Siamese network tracker, based on the region proposal network (SiamRPN), has achieved promising tracking effects, both in speed and accuracy. However, it did not consider the relationship and differences between the long-range context information of various objects. In this paper, we add a global context block (GC block), which is lightweight and can effectively model long-range dependency, to the Siamese network part of SiamRPN so that the object tracker can better understand the tracking scene. At the same time, we propose a novel convolution module, called a cropping-inside selective kernel block (CiSK block), based on selective kernel convolution (SK convolution, a module proposed in selective kernel networks) and use it in the region proposal network (RPN) part of SiamRPN, which can adaptively adjust the size of the receptive field for different types of objects. We make two improvements to SK convolution in the CiSK block. The first improvement is that in the fusion step of SK convolution, we use both global average pooling (GAP) and global maximum pooling (GMP) to enhance global information embedding. The second improvement is that after the selection step of SK convolution, we crop out the outermost pixels of features to reduce the impact of padding operations. The experiment results show that on the OTB100 benchmark, we achieved an accuracy of 0.857 and a success rate of 0.643. On the VOT2016 and VOT2019 benchmarks, we achieved expected average overlap (EAO) scores of 0.394 and 0.240, respectively.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [1] ADAPTIVE SAMPLING FOR BAYESIAN VISUAL TRACKING
    Kawamoto, Kazuhiko
    2008 WORLD AUTOMATION CONGRESS PROCEEDINGS, VOLS 1-3, 2008, : 203 - 208
  • [2] Adaptive Context-Aware Discriminative Correlation Filters for Robust Visual Object Tracking
    Xu, Tianyang
    Feng, Zhen-Hua
    Wu, Xiao-Jun
    Kittler, Josef
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2514 - 2520
  • [3] An Adaptive Approach for Validation in Visual Object Tracking
    Shinora, Jasper Princy W.
    Agilandeeswari, L.
    Muralibabu, K.
    SECOND INTERNATIONAL SYMPOSIUM ON COMPUTER VISION AND THE INTERNET (VISIONNET'15), 2015, 58 : 478 - 485
  • [4] ADAPTIVE APPEARANCE LEARNING FOR VISUAL OBJECT TRACKING
    Khan, Zulfiqar Hasan
    Gu, Irene Yu-Hua
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 1413 - 1416
  • [5] ADSTrack: adaptive dynamic sampling for visual tracking
    Wang, Zhenhai
    Yuan, Lutao
    Ren, Ying
    Zhang, Sen
    Tian, Hongyu
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (01)
  • [6] Adaptive feature fusion for visual object tracking
    Zhao, Shaochuan
    Xu, Tianyang
    Wu, Xiao-Jun
    Zhu, Xue-Feng
    PATTERN RECOGNITION, 2021, 111
  • [7] Adaptive chaotic sampling particle filter to handle occlusion and fast motion in visual object tracking
    Firouznia, Marjan
    Koupaei, Javad Alikhani
    Faez, Karim
    Trunfio, Giuseppe A.
    Amindavar, Hamidreza
    DIGITAL SIGNAL PROCESSING, 2023, 134
  • [8] Adaptive particle sampling and adaptive appearance for multiple video object tracking
    Cheng, Hsu-Yung
    Hwang, Jenq-Neng
    SIGNAL PROCESSING, 2009, 89 (09) : 1844 - 1849
  • [9] Adaptive Hamiltonian MCMC sampling for robust visual tracking
    Wang, Fasheng
    Li, Xucheng
    Lu, Mingyu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (11) : 13087 - 13106
  • [10] Adaptive Hamiltonian MCMC sampling for robust visual tracking
    Fasheng Wang
    Xucheng Li
    Mingyu Lu
    Multimedia Tools and Applications, 2017, 76 : 13087 - 13106