Learning Feature Restoration Transformer for Robust Dehazing Visual Object Tracking

被引：0

作者：

Xu, Tianyang ^{[1
]}

Pan, Yifan ^{[1
]}

Feng, Zhenhua ^{[2
,3
]}

Zhu, Xuefeng ^{[1
]}

Cheng, Chunyang ^{[1
]}

Wu, Xiao-Jun ^{[1
]}

Kittler, Josef ^{[2
,3
]}

机构：

[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Peoples R China

[2] Univ Surrey, Sch Comp Sci & Elect Engn, Guildford GU2 7XH, England

[3] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU27XH, England

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2024年 / 132卷 / 12期

基金：

英国工程与自然科学研究理事会; 中国国家自然科学基金;

关键词：

Visual object tracking; Dehazing system; Siamese tracker; Feature restoration;

D O I：

10.1007/s11263-024-02182-9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, deep-learning-based visual object tracking has obtained promising results. However, a drastic performance drop is observed when transferring a pre-trained model to changing weather conditions, such as hazy imaging scenarios, where the data distribution differs from that of a natural training set. This problem challenges the open-world practical applications of accurate target tracking. In principle, visual tracking performance relies on the discriminative degree of features between the target and its surroundings, rather than the image-level visual quality. To this end, we design a feature restoration transformer that adaptively enhances the representation capability of the extracted visual features for robust tracking in both natural and hazy scenarios. Specifically, a feature restoration transformer is constructed with dedicated self-attention hierarchies for the refinement of potentially contaminated deep feature maps. We endow the feature extraction process with a refinement mechanism typically for hazy imaging scenarios, establishing a tracking system that is robust against foggy videos. In essence, the feature restoration transformer is jointly trained with a Siamese tracking transformer. Intuitively, the supervision for learning discriminative and salient features is facilitated by the entire restoration tracking system. The experimental results obtained on hazy imaging scenarios demonstrate the merits and superiority of the proposed restoration tracking system, with complementary restoration power to image-level dehazing. In addition, consistent advantages of our design can be observed when generalised to different video attributes, demonstrating its capacity to deal with open-world scenarios.

引用

页码：6021 / 6038

页数：18

共 50 条

[11] FEATURE FUSION FOR ROBUST OBJECT TRACKING
Islam, M. A.
Rasheduzzaman, M.
Elahi, M. M. Lutfe
Poon, Bruce
Amin, M. Ashraful
Yan, Hong
PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION (ICWAPR), 2015, : 138 - 145
[12] ONLINE LEARNING OF MULTI-FEATURE WEIGHTS FOR ROBUST OBJECT TRACKING
Zhou, Tao
Bhaskar, Harish
Xie, Kai
Yang, Jie
He, Xiangjian
Shi, Pengfei
2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 725 - 729
[13] Robust MIL-Based Feature Template Learning for Object Tracking
Lan, Xiangyuan
Yuen, Pong C.
Chellappa, Rama
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4118 - 4125
[14] Evaluating the Impact of Dehazing Algorithms on Visual Object Tracking Performance
Demir, Huseyin Seckin
Rajbharti, Noah
Sciarappo, Sloan
Christen, Jennifer Blain
Ozev, Sule
ADVANCES IN VISUAL COMPUTING, ISVC 2024, PT I, 2025, 15046 : 249 - 261
[15] Robust Visual Object Tracking Based on Feature Channel Weighting and Game Theory
Ma, Sugang
Zhao, Bo
Hou, Zhiqiang
Yu, Wangsheng
Pu, Lei
Zhang, Lei
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2023, 2023
[16] Multi-domain collaborative feature representation for robust visual object tracking
Jiqing Zhang
Kai Zhao
Bo Dong
Yingkai Fu
Yuxin Wang
Xin Yang
Baocai Yin
The Visual Computer, 2021, 37 : 2671 - 2683
[17] Multi-domain collaborative feature representation for robust visual object tracking
Zhang, Jiqing
Zhao, Kai
Dong, Bo
Fu, Yingkai
Wang, Yuxin
Yang, Xin
Yin, Baocai
VISUAL COMPUTER, 2021, 37 (9-11): : 2671 - 2683
[18] SCALE ROBUST ADAPTIVE FEATURE DENSITY APPROXIMATION FOR VISUAL OBJECT REPRESENTATION AND TRACKING
Liu, C. Y.
Yung, N. H. C.
Fang, R. G.
VISAPP 2009: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2009, : 535 - +
[19] Mutual Learning and Feature Fusion Siamese Networks for Visual Object Tracking
Jiang, Min
Zhao, Yuyao
Kong, Jun
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (08) : 3154 - 3167
[20] Multiple templates transformer for visual object tracking
Pang, Haibo
Su, Jie
Ma, Rongqi
Li, Tingting
Liu, Chengming
KNOWLEDGE-BASED SYSTEMS, 2023, 280

← 1 2 3 4 5 →