Temporal Transductive Inference for Few-Shot Video Object Segmentation

被引:0
|
作者
Siam, Mennatullah [1 ]
机构
[1] Univ British Columbia, Comp Sci, Vancouver, BC, Canada
关键词
Few-shot learning; Transductive inference; Video object segmentation;
D O I
10.1007/s11263-025-02390-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot video object segmentation (FS-VOS) aims at segmenting video frames using a few labelled examples of classes not seen during initial training. In this paper, we present a simple but effective temporal transductive inference (TTI) approach that leverages temporal consistency in the unlabelled video frames during few-shot inference without episodic training. Key to our approach is the use of a video-level temporal constraint that augments frame-level constraints. The objective of the video-level constraint is to learn consistent linear classifiers for novel classes across the image sequence. It acts as a spatiotemporal regularizer during the transductive inference to increase temporal coherence and reduce overfitting on the few-shot support set. Empirically, our approach outperforms state-of-the-art meta-learning approaches in terms of mean intersection over union on YouTube-VIS by 2.5%. In addition, we introduce an improved benchmark dataset that is exhaustively labelled (i.e., all object occurrences are labelled, unlike the currently available). Our empirical results and temporal consistency analysis confirm the added benefits of the proposed spatiotemporal regularizer to improve temporal coherence. Our code and benchmark dataset is publicly available at, https://github.com/MSiam/tti_fsvos/.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Transductive distribution calibration for few-shot learning
    Li, Gang
    Zheng, Changwen
    Su, Bing
    Neurocomputing, 2022, 500 : 604 - 615
  • [22] Transductive Information Maximization For Few-Shot Learning
    Boudiaf, Malik
    Masud, Ziko Imtiaz
    Rony, Jerome
    Dolz, Jose
    Piantanida, Pablo
    Ben Ayed, Ismail
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [23] Realistic Evaluation of Transductive Few-Shot Learning
    Veilleux, Olivier
    Boudiaf, Malik
    Piantanida, Pablo
    Ben Ayed, Ismail
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [24] Transductive Few-Shot Classification on the Oblique Manifold
    Qi, Guodong
    Yu, Huimin
    Lu, Zhaohui
    Li, Shuzhao
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8392 - 8402
  • [25] TRANSDUCTIVE PROTOTYPICAL NETWORK FOR FEW-SHOT CLASSIFICATION
    Liu, Xinyue
    Liu, Pengxin
    Zong, Linlin
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1671 - 1675
  • [26] Transductive distribution calibration for few-shot learning
    Li, Gang
    Zheng, Changwen
    Su, Bing
    NEUROCOMPUTING, 2022, 500 : 604 - 615
  • [27] Temporal Speciation Network for Few-Shot Object Detection
    Zhao, Xiaowei
    Liu, Xianglong
    Ma, Yuqing
    Bai, Shihao
    Shen, Yifan
    Hao, Zeyu
    Liu, Aishan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8267 - 8278
  • [28] Adaptive dynamic inference for few-shot left atrium segmentation
    Chen, Jun
    Li, Xuejiao
    Zhang, Heye
    Cho, Yongwon
    Hwang, Sung Ho
    Gao, Zhifan
    Yang, Guang
    MEDICAL IMAGE ANALYSIS, 2024, 98
  • [29] Towards Practical Few-Shot Query Sets: Transductive Minimum Description Length Inference
    Martin, Segolene
    Boudiaf, Malik
    Chouzenoux, Emilie
    Pesquet, Jean-Christophe
    Ben Ayed, Ismail
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [30] CobNet: Cross Attention on Object and Background for Few-Shot Segmentation
    Guan, Haoyan
    Michael, Spratling
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 39 - 45