Memory Propagation-based Target-aware Segmentation Tracker with Adaptive Mask-attention Decision Network

被引:0
|
作者
Zhang, Huanlong [1 ]
Fu, Weiqiang [1 ]
Zhou, Bin [1 ]
Zhou, Keyan [1 ]
Yang, Xiangbo [1 ]
Liu, Shanfeng [2 ]
机构
[1] Zhengzhou Univ Light Ind, Coll Elect & Informat Engn, Zhengzhou 450000, Peoples R China
[2] State Grid Henan Elect Power Res Inst, Zhengzhou 450000, Peoples R China
来源
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS | 2024年 / 18卷 / 09期
基金
中国国家自然科学基金;
关键词
Object Segmentation and Tracking; Siamese Network; Mask Propagation; Mask evaluation; Attention Mechanism; OBJECT TRACKING;
D O I
10.3837/tiis.2024.09.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Siamese-based segmentation and tracking algorithms improve accuracy and stability for video object segmentation and tracking tasks simultaneously. Although effective, variability in target appearance and background clutter can still affect segmentation accuracy and further influence the performance of tracking. In this paper, we present a memory propagation-based target- aware and mask-attention decision network for robust object segmentation and tracking. Firstly, a mask propagation-based attention module (MPAM) is constructed to explore the inherent correlation among image frames, which can mine mask information of the historical frames. By retrieving a memory bank (MB) that stores features and binary masks of historical frames, target attention maps are generated to highlight the target region on backbone features, thus suppressing the adverse effects of background clutter. Secondly, an attention refinement pathway (ARP) is designed to further refine the segmentation profile in the process of mask generation. A lightweight attention mechanism is introduced to calculate the weight of lowlevel features, paying more attention to low-level features sensitive to edge detail so as to obtain segmentation results. Finally, a mask fusion mechanism (MFM) is proposed to enhance the accuracy of the mask. By utilizing a mask quality assessment decision network, the corresponding quality scores of the "initial mask" and the "previous mask" can be obtained adaptively, thus achieving the assignment of weights and the fusion of masks. Therefore, the final mask enjoys higher accuracy and stability. Experimental results on multiple benchmarks demonstrate that our algorithm performs outstanding performance in a variety of challenging tracking tasks.
引用
收藏
页码:2605 / 2625
页数:21
相关论文
共 2 条
  • [1] Lightweight Target-Aware Attention Learning Network-Based Target Tracking Method
    Zhao, Yanchun
    Zhang, Jiapeng
    Duan, Rui
    Li, Fusheng
    Zhang, Huanlong
    MATHEMATICS, 2022, 10 (13)
  • [2] Decision Transformer based Target-aware Feature Selection Network for Behavior-oriented Recommendation
    Li, Tianyang
    Yan, Hongbin
    Xu, Hanwen
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 2430 - 2435