Learning Feature Restoration Transformer for Robust Dehazing Visual Object Tracking

被引:0
|
作者
Xu, Tianyang [1 ]
Pan, Yifan [1 ]
Feng, Zhenhua [2 ,3 ]
Zhu, Xuefeng [1 ]
Cheng, Chunyang [1 ]
Wu, Xiao-Jun [1 ]
Kittler, Josef [2 ,3 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Peoples R China
[2] Univ Surrey, Sch Comp Sci & Elect Engn, Guildford GU2 7XH, England
[3] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU27XH, England
基金
英国工程与自然科学研究理事会; 中国国家自然科学基金;
关键词
Visual object tracking; Dehazing system; Siamese tracker; Feature restoration;
D O I
10.1007/s11263-024-02182-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, deep-learning-based visual object tracking has obtained promising results. However, a drastic performance drop is observed when transferring a pre-trained model to changing weather conditions, such as hazy imaging scenarios, where the data distribution differs from that of a natural training set. This problem challenges the open-world practical applications of accurate target tracking. In principle, visual tracking performance relies on the discriminative degree of features between the target and its surroundings, rather than the image-level visual quality. To this end, we design a feature restoration transformer that adaptively enhances the representation capability of the extracted visual features for robust tracking in both natural and hazy scenarios. Specifically, a feature restoration transformer is constructed with dedicated self-attention hierarchies for the refinement of potentially contaminated deep feature maps. We endow the feature extraction process with a refinement mechanism typically for hazy imaging scenarios, establishing a tracking system that is robust against foggy videos. In essence, the feature restoration transformer is jointly trained with a Siamese tracking transformer. Intuitively, the supervision for learning discriminative and salient features is facilitated by the entire restoration tracking system. The experimental results obtained on hazy imaging scenarios demonstrate the merits and superiority of the proposed restoration tracking system, with complementary restoration power to image-level dehazing. In addition, consistent advantages of our design can be observed when generalised to different video attributes, demonstrating its capacity to deal with open-world scenarios.
引用
收藏
页码:6021 / 6038
页数:18
相关论文
共 50 条
  • [11] FEATURE FUSION FOR ROBUST OBJECT TRACKING
    Islam, M. A.
    Rasheduzzaman, M.
    Elahi, M. M. Lutfe
    Poon, Bruce
    Amin, M. Ashraful
    Yan, Hong
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION (ICWAPR), 2015, : 138 - 145
  • [12] ONLINE LEARNING OF MULTI-FEATURE WEIGHTS FOR ROBUST OBJECT TRACKING
    Zhou, Tao
    Bhaskar, Harish
    Xie, Kai
    Yang, Jie
    He, Xiangjian
    Shi, Pengfei
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 725 - 729
  • [13] Robust MIL-Based Feature Template Learning for Object Tracking
    Lan, Xiangyuan
    Yuen, Pong C.
    Chellappa, Rama
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4118 - 4125
  • [14] Evaluating the Impact of Dehazing Algorithms on Visual Object Tracking Performance
    Demir, Huseyin Seckin
    Rajbharti, Noah
    Sciarappo, Sloan
    Christen, Jennifer Blain
    Ozev, Sule
    ADVANCES IN VISUAL COMPUTING, ISVC 2024, PT I, 2025, 15046 : 249 - 261
  • [15] Robust Visual Object Tracking Based on Feature Channel Weighting and Game Theory
    Ma, Sugang
    Zhao, Bo
    Hou, Zhiqiang
    Yu, Wangsheng
    Pu, Lei
    Zhang, Lei
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2023, 2023
  • [16] Multi-domain collaborative feature representation for robust visual object tracking
    Jiqing Zhang
    Kai Zhao
    Bo Dong
    Yingkai Fu
    Yuxin Wang
    Xin Yang
    Baocai Yin
    The Visual Computer, 2021, 37 : 2671 - 2683
  • [17] Multi-domain collaborative feature representation for robust visual object tracking
    Zhang, Jiqing
    Zhao, Kai
    Dong, Bo
    Fu, Yingkai
    Wang, Yuxin
    Yang, Xin
    Yin, Baocai
    VISUAL COMPUTER, 2021, 37 (9-11): : 2671 - 2683
  • [18] SCALE ROBUST ADAPTIVE FEATURE DENSITY APPROXIMATION FOR VISUAL OBJECT REPRESENTATION AND TRACKING
    Liu, C. Y.
    Yung, N. H. C.
    Fang, R. G.
    VISAPP 2009: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2009, : 535 - +
  • [19] Mutual Learning and Feature Fusion Siamese Networks for Visual Object Tracking
    Jiang, Min
    Zhao, Yuyao
    Kong, Jun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (08) : 3154 - 3167
  • [20] Multiple templates transformer for visual object tracking
    Pang, Haibo
    Su, Jie
    Ma, Rongqi
    Li, Tingting
    Liu, Chengming
    KNOWLEDGE-BASED SYSTEMS, 2023, 280