Temporal Transductive Inference for Few-Shot Video Object Segmentation

被引:0
|
作者
Siam, Mennatullah [1 ]
机构
[1] Univ British Columbia, Comp Sci, Vancouver, BC, Canada
关键词
Few-shot learning; Transductive inference; Video object segmentation;
D O I
10.1007/s11263-025-02390-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot video object segmentation (FS-VOS) aims at segmenting video frames using a few labelled examples of classes not seen during initial training. In this paper, we present a simple but effective temporal transductive inference (TTI) approach that leverages temporal consistency in the unlabelled video frames during few-shot inference without episodic training. Key to our approach is the use of a video-level temporal constraint that augments frame-level constraints. The objective of the video-level constraint is to learn consistent linear classifiers for novel classes across the image sequence. It acts as a spatiotemporal regularizer during the transductive inference to increase temporal coherence and reduce overfitting on the few-shot support set. Empirically, our approach outperforms state-of-the-art meta-learning approaches in terms of mean intersection over union on YouTube-VIS by 2.5%. In addition, we introduce an improved benchmark dataset that is exhaustively labelled (i.e., all object occurrences are labelled, unlike the currently available). Our empirical results and temporal consistency analysis confirm the added benefits of the proposed spatiotemporal regularizer to improve temporal coherence. Our code and benchmark dataset is publicly available at, https://github.com/MSiam/tti_fsvos/.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Reshaping Bioacoustics Event Detection: Leveraging Few-Shot Learning (FSL) with Transductive Inference and Data Augmentation
    Ijaz, Nouman
    Banoori, Farhad
    Koo, Insoo
    BIOENGINEERING-BASEL, 2024, 11 (07):
  • [42] ADAPTIVE ANCHOR LABEL PROPAGATION FOR TRANSDUCTIVE FEW-SHOT LEARNING
    Lazarou, Michalis
    Avrithis, Yannis
    Ren, Guangyu
    Stathaki, Tania
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 331 - 335
  • [43] Feature Transductive Distribution Optimization for Few-Shot Image Classification
    Liu, Qing
    Tang, Xianlun
    Wang, Ying
    Li, Xingchen
    Jiang, Xinyan
    Li, Weisheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2230 - 2243
  • [44] Transductive Graph-Attention Network for Few-shot Classification
    Pan, Lili
    Liu, Weifeng
    2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 190 - 195
  • [45] Transductive clustering optimization learning for few-shot image classification
    Wang, Yi
    Bian, Xiong
    Zhu, Songhao
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (04)
  • [46] Few-shot Video-to-Video Synthesis
    Wang, Ting-Chun
    Liu, Ming-Yu
    Tao, Andrew
    Liu, Guilin
    Kautz, Jan
    Catanzaro, Bryan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [47] Transductive Relation-Propagation Network for Few-shot Learning
    Ma, Yuqing
    Bai, Shihao
    An, Shan
    Liu, Wei
    Liu, Aishan
    Zhen, Xiantong
    Liu, Xianglong
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 804 - 810
  • [48] Capturing the few-shot class distribution: Transductive distribution optimization
    Liu, Xinyue
    Liu, Ligang
    Liu, Han
    Zhang, Xiaotong
    PATTERN RECOGNITION, 2023, 138
  • [49] A robust transductive distribution calibration method for few-shot learning
    Li, Jingcong
    Ye, Chunjin
    Wang, Fei
    Pan, Jiahui
    PATTERN RECOGNITION, 2025, 163
  • [50] Few-Shot Action Recognition with A Transductive Maximum Margin Classifier
    Pan, Fei
    Guo, Jie
    Guo, Yanwen
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,