Adaptive Information Selection for Infrared Object Tracking with Variable Scale Correlation Filter

被引：1

作者：

Sun Mengyu ^{[1
]}

Wang Peng ^{[2
]}

Xu Junqi ^{[1
]}

Li Xiaoyan ^{[2
]}

Gao Hui ^{[2
]}

Di Ruohai ^{[2
]}

机构：

[1] Xian Technol Univ, Sch Optoelect Engn, Xian 710021, Peoples R China

[2] Xian Technol Univ, Elect Informat Engn, Xian 710021, Peoples R China

来源：

ACTA PHOTONICA SINICA | 2023年 / 52卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Infrared night vision technology; Target tracking; Correlation filter; Infrared image processing; Variable scale filter; Sparse representation;

D O I：

10.3788/gzxb20235212.1210003

中图分类号：

O43 [光学];

学科分类号：

070207 ; 0803 ;

摘要：

While there is less information contained in the infrared image, it still has problems with image blur and noise. It caused many difficulties in infrared visual target tracking. The discriminative correlation filter is a reliable method for tracking objects that can be trained online on an embedded hardware platform. But the variable scale of the object cannot be efficiently solved. We proposed an adaptive information selection for infrared object tracking with a variable scale correlation filter to solve the problem mentioned earlier. The proposed algorithm is divided into three parts, one for feature extractor, one for location filter, and one for scale filter. Three features are extracted by the feature extractor to represent the object. The insensitive feature and the histogram of gradient are chosen as the based features. The insensitive feature represents the intensity of the object at its current position in the infrared image. The information extracted from the gradient histogram, which contains more information about the shape of the target, represents the regions with sudden changes in the infrared image such as the edge and corner of the target. To enhance the object's representation, we generate a new histogram of gradients based on the insensitive feature, which has various representations of the object. After that, the position filter will receive the sample frame feature information. We train three position filters depend on three types of feature, respectively. In the training phase of the position filter, temporal regularization, spatial regularization and spatial information selection are added during optimization. The temporal regularization is used to enforce correlation filter more similarity to the last coefficients, which makes correlation filter more fit on the variation of the object. The spatial regularization is limited the correlation filter concentrate on the region of the object when training. For reducing redundancy of the object's information and speeding up the training process, we set different thresholds for coefficients. When the coefficients are lower than the threshold, we set it to zero to reduce the number of coefficients. The spatial coefficient of each channel is treated individually. Then, the position filter will be convolved with the feature of the current frame. We compute three response maps according to the different features. The response map corresponding to various features will be weighted and sum together to find the max value position which is the estimated object position. The weights of the various features are set to the hyparameter. The weight of the new feature is set smaller than the other base features, because of the larger receptive field. For better representation of the object's scale change, we extract the variable scale sample from the estimated object position. The ratio of the height and width of the variable scale sample is different. We flatten the features of all variable scale samples and concatenate them together along with the spatial dimension as the scale training features. The corresponding scale factor is obtained by convolution according to the obtained scale filter coefficient, so as to determine the boundary ratio and obtain the boundary of the object. We choose LSOTB-TIR dataset and PTB-TIR dataset as the test dataset. The proposed algorithm can reach 34.85 frames per seconds on the central processing unit platform. On the LSOTB-TIR data set, the accuracy and success rate of the proposed algorithm reached 71.3% and 59. 4% on the LSOTB-TIR dataset, and 80.4% and 61.1% on the PTB-TIR dataset, respectively. We analysis each component in our algorithm and find the spatial selection contributes more for precision. We also visualize the result of our method, which demonstrates our method can track the object consistently and can adapt the change of boundary ratio.

引用

页数：13

共 33 条

[1] Context-Aware Correlation Filter Learning Toward Peak Strength for Visual Tracking
Bouraffa, Tayssir
Yan, Liping
Feng, Zihang
Xiao, Bo
Wu, Q. M. Jonathan
Xia, Yuanqing
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (10) : 5105 - 5115
[2] HiFT: Hierarchical Feature Transformer for Aerial Tracking
Cao, Ziang
Fu, Changhong
Ye, Junjie
Li, Bowen
Li, Yiming
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15437 - 15446
[3] Histograms of oriented gradients for human detection
Dalal, N
Triggs, B
[J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
[4] Danelljan M., 2014, BRIT MACHINE VISION, P1
[5] ECO: Efficient Convolution Operators for Tracking
Danelljan, Martin
Bhat, Goutam
Khan, Fahad Shahbaz
Felsberg, Michael
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6931 - 6939
[6] Learning Spatially Regularized Correlation Filters for Visual Tracking
Danelljan, Martin
Hager, Gustav
Khan, Fahad Shahbaz
Felsberg, Michael
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4310 - 4318
[7] AiATrack: Attention in Attention for Transformer Visual Tracking
Gao, Shenyuan
Zhou, Chunluan
Ma, Chao
Wang, Xinggang
Yuan, Junsong
[J]. COMPUTER VISION, ECCV 2022, PT XXII, 2022, 13682 : 146 - 164
[8] Gundogdu Erhan, 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), P1, DOI 10.1109/CVPRW.2015.7301290
[9] Evaluation of Feature Channels for Correlation-Filter-Based Visual Object Tracking in Infrared Spectrum
Gundogdu, Erhan
Koc, Aykut
Solmaz, Berkan
Hammoud, Riad I.
Alatan, A. Aydin
[J]. PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 290 - 298
[10] HUANG Yueping, 2022, Digital Signal Processing

← 1 2 3 4 →