Holistic Prototype Attention Network for Few-Shot Video Object Segmentation

被引:8
|
作者
Tang, Yin [1 ]
Chen, Tao [1 ]
Jiang, Xiruo [1 ]
Yao, Yazhou [1 ]
Xie, Guo-Sen [1 ]
Shen, Heng-Tao [2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金;
关键词
Prototypes; Task analysis; Object segmentation; Semantic segmentation; Semantics; Feature extraction; Annotations; Few-shot video object segmentation; video object segmentation; few-shot semantic segmentation;
D O I
10.1109/TCSVT.2023.3296629
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Few-shot video object segmentation (FSVOS) aims to segment dynamic objects of unseen classes by resorting to a small set of support images that contain pixel-level object annotations. Existing methods have demonstrated that the domain agent-based attention mechanism is effective in FSVOS by learning the correlation between support images and query frames. However, the agent frame contains redundant pixel information and background noise, resulting in inferior segmentation performance. Moreover, existing methods tend to ignore inter-frame correlations in query videos. To alleviate the above dilemma, we propose a holistic prototype attention network (HPAN) for advancing FSVOS. Specifically, HPAN introduces a prototype graph attention module (PGAM) and a bidirectional prototype attention module (BPAM), transferring informative knowledge from seen to unseen classes. PGAM generates local prototypes from all foreground features and then utilizes their internal correlations to enhance the representation of the holistic prototypes. BPAM exploits the holistic information from support images and video frames by fusing co-attention and self-attention to achieve support-query semantic consistency and inner-frame temporal consistency. Extensive experiments on YouTube-FSVOS have been provided to demonstrate the effectiveness and superiority of our proposed HPAN method. Our source code and models are available anonymously at https://github.com/NUST-Machine-Intelligence-Laboratory/HPAN.
引用
收藏
页码:6699 / 6709
页数:11
相关论文
共 50 条
  • [31] Adaptive Prototype Learning and Allocation for Few-Shot Segmentation
    Li, Gen
    Jampani, Varun
    Sevilla-Lara, Laura
    Sun, Deqing
    Kim, Jonghyun
    Kim, Joongkyu
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8330 - 8339
  • [32] Variational Prototype Inference for Few-Shot Semantic Segmentation
    Wang, Haochen
    Yang, Yandan
    Cao, Xianbin
    Zhen, Xiantong
    Snoek, Cees
    Shao, Ling
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 525 - 534
  • [33] Few-shot defect segmentation based on cross-modal attention aggregation and adaptive prototype generation network
    Liu, Shi-Tong
    Zhang, Yun-Zhou
    Shan, De-Xing
    Jin, Yang
    Ning, Jian
    Kongzhi yu Juece/Control and Decision, 2024, 39 (11): : 3655 - 3663
  • [34] Contrastive prototype network with prototype augmentation for few-shot classification
    Jiang, Mengjuan
    Fan, Jiaqing
    He, Jiangzhen
    Du, Weidong
    Wang, Yansong
    Li, Fanzhang
    INFORMATION SCIENCES, 2025, 686
  • [35] Multiscale Attention-Based Prototypical Network For Few-Shot Semantic Segmentation
    Zhang, Yifei
    Sidibe, Desire
    Morel, Olivier
    Meriaudeau, Fabrice
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7372 - 7378
  • [36] ARNET:ATTENTION-BASED REFINEMENT NETWORK FOR FEW-SHOT SEMANTIC SEGMENTATION
    Li, Rusheng
    Liu, Hanhui
    Zhu, Yuesheng
    Bai, Zhiqiang
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2238 - 2242
  • [37] Center Heatmap Attention for Few-Shot Object Detection
    Li, Fanglin
    Yuan, Jie
    Yi, Fengshu
    Cai, Xiaomin
    Gao, Hao
    INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2021, 2021, 11884
  • [38] Object-Aware Attention in Few-Shot Learning
    Shen, Yeqing
    Mo, Lisha
    Ma, Huimin
    Hu, Tianyu
    Dong, Yuhan
    IMAGE AND GRAPHICS TECHNOLOGIES AND APPLICATIONS, IGTA 2021, 2021, 1480 : 95 - 108
  • [39] Few-Shot Air Object Detection Network
    Cai, Wei
    Wang, Xin
    Jiang, Xinhao
    Yang, Zhiyong
    Di, Xingyu
    Gao, Weijie
    ELECTRONICS, 2023, 12 (19)
  • [40] GRAPH AFFINITY NETWORK FOR FEW-SHOT SEGMENTATION
    Luo, Xiaoliu
    Zhang, Taiping
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 609 - 613