Adaptive Information Selection for Infrared Object Tracking with Variable Scale Correlation Filter

被引:1
|
作者
Sun Mengyu [1 ]
Wang Peng [2 ]
Xu Junqi [1 ]
Li Xiaoyan [2 ]
Gao Hui [2 ]
Di Ruohai [2 ]
机构
[1] Xian Technol Univ, Sch Optoelect Engn, Xian 710021, Peoples R China
[2] Xian Technol Univ, Elect Informat Engn, Xian 710021, Peoples R China
基金
中国国家自然科学基金;
关键词
Infrared night vision technology; Target tracking; Correlation filter; Infrared image processing; Variable scale filter; Sparse representation;
D O I
10.3788/gzxb20235212.1210003
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
While there is less information contained in the infrared image, it still has problems with image blur and noise. It caused many difficulties in infrared visual target tracking. The discriminative correlation filter is a reliable method for tracking objects that can be trained online on an embedded hardware platform. But the variable scale of the object cannot be efficiently solved. We proposed an adaptive information selection for infrared object tracking with a variable scale correlation filter to solve the problem mentioned earlier. The proposed algorithm is divided into three parts, one for feature extractor, one for location filter, and one for scale filter. Three features are extracted by the feature extractor to represent the object. The insensitive feature and the histogram of gradient are chosen as the based features. The insensitive feature represents the intensity of the object at its current position in the infrared image. The information extracted from the gradient histogram, which contains more information about the shape of the target, represents the regions with sudden changes in the infrared image such as the edge and corner of the target. To enhance the object's representation, we generate a new histogram of gradients based on the insensitive feature, which has various representations of the object. After that, the position filter will receive the sample frame feature information. We train three position filters depend on three types of feature, respectively. In the training phase of the position filter, temporal regularization, spatial regularization and spatial information selection are added during optimization. The temporal regularization is used to enforce correlation filter more similarity to the last coefficients, which makes correlation filter more fit on the variation of the object. The spatial regularization is limited the correlation filter concentrate on the region of the object when training. For reducing redundancy of the object's information and speeding up the training process, we set different thresholds for coefficients. When the coefficients are lower than the threshold, we set it to zero to reduce the number of coefficients. The spatial coefficient of each channel is treated individually. Then, the position filter will be convolved with the feature of the current frame. We compute three response maps according to the different features. The response map corresponding to various features will be weighted and sum together to find the max value position which is the estimated object position. The weights of the various features are set to the hyparameter. The weight of the new feature is set smaller than the other base features, because of the larger receptive field. For better representation of the object's scale change, we extract the variable scale sample from the estimated object position. The ratio of the height and width of the variable scale sample is different. We flatten the features of all variable scale samples and concatenate them together along with the spatial dimension as the scale training features. The corresponding scale factor is obtained by convolution according to the obtained scale filter coefficient, so as to determine the boundary ratio and obtain the boundary of the object. We choose LSOTB-TIR dataset and PTB-TIR dataset as the test dataset. The proposed algorithm can reach 34.85 frames per seconds on the central processing unit platform. On the LSOTB-TIR data set, the accuracy and success rate of the proposed algorithm reached 71.3% and 59. 4% on the LSOTB-TIR dataset, and 80.4% and 61.1% on the PTB-TIR dataset, respectively. We analysis each component in our algorithm and find the spatial selection contributes more for precision. We also visualize the result of our method, which demonstrates our method can track the object consistently and can adapt the change of boundary ratio.
引用
收藏
页数:13
相关论文
共 33 条
  • [1] Context-Aware Correlation Filter Learning Toward Peak Strength for Visual Tracking
    Bouraffa, Tayssir
    Yan, Liping
    Feng, Zihang
    Xiao, Bo
    Wu, Q. M. Jonathan
    Xia, Yuanqing
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (10) : 5105 - 5115
  • [2] HiFT: Hierarchical Feature Transformer for Aerial Tracking
    Cao, Ziang
    Fu, Changhong
    Ye, Junjie
    Li, Bowen
    Li, Yiming
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15437 - 15446
  • [3] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [4] Danelljan M., 2014, BRIT MACHINE VISION, P1
  • [5] ECO: Efficient Convolution Operators for Tracking
    Danelljan, Martin
    Bhat, Goutam
    Khan, Fahad Shahbaz
    Felsberg, Michael
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6931 - 6939
  • [6] Learning Spatially Regularized Correlation Filters for Visual Tracking
    Danelljan, Martin
    Hager, Gustav
    Khan, Fahad Shahbaz
    Felsberg, Michael
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4310 - 4318
  • [7] AiATrack: Attention in Attention for Transformer Visual Tracking
    Gao, Shenyuan
    Zhou, Chunluan
    Ma, Chao
    Wang, Xinggang
    Yuan, Junsong
    [J]. COMPUTER VISION, ECCV 2022, PT XXII, 2022, 13682 : 146 - 164
  • [8] Gundogdu Erhan, 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), P1, DOI 10.1109/CVPRW.2015.7301290
  • [9] Evaluation of Feature Channels for Correlation-Filter-Based Visual Object Tracking in Infrared Spectrum
    Gundogdu, Erhan
    Koc, Aykut
    Solmaz, Berkan
    Hammoud, Riad I.
    Alatan, A. Aydin
    [J]. PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 290 - 298
  • [10] HUANG Yueping, 2022, Digital Signal Processing