Exploring the Better Correlation for Few-Shot Video Object Segmentation

被引:0
|
作者
Luo, Naisong [1 ]
Wang, Yuan [1 ]
Sun, Rui [1 ]
Xiong, Guoxin [1 ]
Zhang, Tianzhu [1 ,2 ]
Wu, Feng [1 ,2 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci, Hefei 230027, Peoples R China
[2] Deep Space Explorat Lab, Hefei 230088, Peoples R China
基金
中国国家自然科学基金;
关键词
Few-shot video object segmentation; video object segmentation; few-shot learning;
D O I
10.1109/TCSVT.2024.3491214
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Few-shot video object segmentation (FSVOS) aims to achieve accurate segmentation of novel objects in given video sequences, where the target objects are specified by limited annotated images as support. Most previous top-performing methods adopt the support-query semantic correlation learning paradigm or the intra-query temporal correlation learning paradigm. Nevertheless, they either fail to model temporal consistency across frames, resulting in inconsecutive segmentation, or lose diverse support object information, leading to incomplete segmentation. Therefore, we argue that it is more desirable to achieve both correlations in a collaborative manner. In this work, we delve into the issues present in the combination of few-shot image segmentation methods and video object segmentation methods and propose a dedicated Collaborative Correlation Network (CoCoNet) to address these problems, including a pixel correlation calibration module and a temporal correlation mining module. The proposed CoCoNet enjoys several merits. First, the pixel correlation calibration module aims to mitigate the noise issue in support-query correlation by integrating the affinity learning strategy and the prototype learning strategy. Specifically, we employ Optimal Transport to enrich pixel correlation with contextual information, thereby reducing intra-class differences between support and query. Second, the temporal correlation mining module is responsible for alleviating the issue of uncertainty in the initial frame and establishing reliable guidance for subsequent frames of the query video. With the collaboration of these two modules, our CoCoNet can effectively establish support-query and temporal correlation simultaneously and achieve accurate FSVOS. Extensive experimental results on two challenging benchmarks demonstrate that our method performs favorably against state-of-the-art FSVOS methods.
引用
收藏
页码:2133 / 2146
页数:14
相关论文
共 50 条
  • [31] Interactive Few-Shot Learning: Limited Supervision, Better Medical Image Segmentation
    Feng, Ruiwei
    Zheng, Xiangshang
    Gao, Tianxiang
    Chen, Jintai
    Wang, Wenzhe
    Chen, Danny Z.
    Wu, Jian
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (10) : 2575 - 2588
  • [32] Exploring Quantization in Few-Shot Learning
    Wang, Meiqi
    Xue, Ruixin
    Lin, Jun
    Wang, Zhongfeng
    2020 18TH IEEE INTERNATIONAL NEW CIRCUITS AND SYSTEMS CONFERENCE (NEWCAS'20), 2020, : 279 - 282
  • [33] A comparative attention framework for better few-shot object detection on aerial images
    Le Jeune, Pierre
    Bahaduri, Bissmella
    Mokraoui, Anissa
    PATTERN RECOGNITION, 2025, 161
  • [34] Weakly-supervised Object Representation Learning for Few-shot Semantic Segmentation
    Ying, Xiaowen
    Li, Xin
    Chuah, Mooi Choo
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1496 - 1505
  • [35] Few-shot Segmentation and Semantic Segmentation for Underwater Imagery
    Kabir, Imran
    Shaurya, Shubham
    Maigur, Vijayalaxmi
    Thakurdesai, Nikhil
    Latnekar, Mahesh
    Raunak, Mayank
    Crandall, David
    Reza, Md Alimoor
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 11451 - 11457
  • [36] Robust Temporally-Coherent Strategy for Few-shot Video Instance Segmentation
    Wang, Qiuyue
    Zhang, Songyang
    He, Xuming
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 251 - 255
  • [37] A few-shot semantic segmentation method based on adaptively mining correlation network
    Huang, Zhifu
    Jiang, Bin
    Liu, Yu
    ROBOTICA, 2023, 41 (06) : 1828 - 1836
  • [38] Few-Shot Aerial Image Semantic Segmentation Leveraging Pyramid Correlation Fusion
    Ao, Wei
    Zheng, Shunyi
    Meng, Yan
    Gao, Zhi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 12
  • [39] Few-Shot Air Object Detection Network
    Cai, Wei
    Wang, Xin
    Jiang, Xinhao
    Yang, Zhiyong
    Di, Xingyu
    Gao, Weijie
    ELECTRONICS, 2023, 12 (19)
  • [40] Few-Shot Learning for Road Object Detection
    Majee, Anay
    Agrawal, Kshitij
    Subramanian, Anbumani
    AAAI WORKSHOP ON META-LEARNING AND METADL CHALLENGE, VOL 140, 2021, 140 : 115 - 126