Exploring the Better Correlation for Few-Shot Video Object Segmentation

被引:0
|
作者
Luo, Naisong [1 ]
Wang, Yuan [1 ]
Sun, Rui [1 ]
Xiong, Guoxin [1 ]
Zhang, Tianzhu [1 ,2 ]
Wu, Feng [1 ,2 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci, Hefei 230027, Peoples R China
[2] Deep Space Explorat Lab, Hefei 230088, Peoples R China
基金
中国国家自然科学基金;
关键词
Few-shot video object segmentation; video object segmentation; few-shot learning;
D O I
10.1109/TCSVT.2024.3491214
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Few-shot video object segmentation (FSVOS) aims to achieve accurate segmentation of novel objects in given video sequences, where the target objects are specified by limited annotated images as support. Most previous top-performing methods adopt the support-query semantic correlation learning paradigm or the intra-query temporal correlation learning paradigm. Nevertheless, they either fail to model temporal consistency across frames, resulting in inconsecutive segmentation, or lose diverse support object information, leading to incomplete segmentation. Therefore, we argue that it is more desirable to achieve both correlations in a collaborative manner. In this work, we delve into the issues present in the combination of few-shot image segmentation methods and video object segmentation methods and propose a dedicated Collaborative Correlation Network (CoCoNet) to address these problems, including a pixel correlation calibration module and a temporal correlation mining module. The proposed CoCoNet enjoys several merits. First, the pixel correlation calibration module aims to mitigate the noise issue in support-query correlation by integrating the affinity learning strategy and the prototype learning strategy. Specifically, we employ Optimal Transport to enrich pixel correlation with contextual information, thereby reducing intra-class differences between support and query. Second, the temporal correlation mining module is responsible for alleviating the issue of uncertainty in the initial frame and establishing reliable guidance for subsequent frames of the query video. With the collaboration of these two modules, our CoCoNet can effectively establish support-query and temporal correlation simultaneously and achieve accurate FSVOS. Extensive experimental results on two challenging benchmarks demonstrate that our method performs favorably against state-of-the-art FSVOS methods.
引用
收藏
页码:2133 / 2146
页数:14
相关论文
共 50 条
  • [21] Few-shot human-object interaction video recognition with transformers
    Li, Qiyue
    Xie, Xuemei
    Zhang, Jin
    Shi, Guangming
    NEURAL NETWORKS, 2023, 163 : 1 - 9
  • [22] Temporal Aggregation with Context Focusing for Few-Shot Video Object Detection
    Han, Wentao
    Lei, Jie
    Wang, Fahong
    Feng, Zunlei
    Liang, Ronghua
    Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics, 2023, : 2196 - 2201
  • [23] Few-shot Video-to-Video Synthesis
    Wang, Ting-Chun
    Liu, Ming-Yu
    Tao, Andrew
    Liu, Guilin
    Kautz, Jan
    Catanzaro, Bryan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [24] Generalized Few-shot Semantic Segmentation
    Tian, Zhuotao
    Lai, Xin
    Jiang, Li
    Liu, Shu
    Shu, Michelle
    Zhao, Hengshuang
    Jia, Jiaya
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11553 - 11562
  • [25] Incremental Few-Shot Instance Segmentation
    Ganea, Dan Andrei
    Boom, Bas
    Poppe, Ronald
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1185 - 1194
  • [26] Few-Shot Object Detection: A Survey
    Antonelli, Simone
    Avola, Danilo
    Cinque, Luigi
    Crisostomi, Donato
    Foresti, Gian Luca
    Galasso, Fabio
    Marini, Marco Raoul
    Mecca, Alessio
    Pannone, Daniele
    ACM COMPUTING SURVEYS, 2022, 54 (11S)
  • [27] Few-Shot Object Counting and Detection
    Thanh Nguyen
    Chau Pham
    Khoi Nguyen
    Minh Hoai
    COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 348 - 365
  • [28] Quaternion-Valued Correlation Learning for Few-Shot Semantic Segmentation
    Zheng, Zewen
    Huang, Guoheng
    Yuan, Xiaochen
    Pun, Chi-Man
    Liu, Hongrui
    Ling, Wing-Kuen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (05) : 2102 - 2115
  • [29] Query-support semantic correlation mining for few-shot segmentation
    Shao, Ji
    Gong, Bo
    Dai, Kanyuan
    Li, Daoliang
    Jing, Ling
    Chen, Yingyi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [30] Few-Shot Object Detection of drones
    Zou Weibao
    Liu Xindi
    Yang Jitao
    Qu Wei
    INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 1030 - 1034