Spatial constraint for efficient semi-supervised video object segmentation

被引:1
|
作者
Chen, Yadang [1 ,2 ]
Ji, Chuanjun [1 ,2 ]
Yang, Zhi-Xin [3 ,4 ]
Wu, Enhua [5 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Engn Res Ctr Digital Forens, Minist Educ, Nanjing 210044, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Sch Comp & Software, Nanjing 210044, Peoples R China
[3] Univ Macau, State Key Lab Internet Things Smart City, Macau 999078, Peoples R China
[4] Univ Macau, Dept Electromech Engn, Macau 999078, Peoples R China
[5] Univ Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Video object segmentation; Memory-based methods; Redundant information; Semantically similar objects;
D O I
10.1016/j.cviu.2023.103843
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised video object segmentation is the process of tracking and segmenting objects in a video sequence based on annotated masks for one or more frames. Recently, memory-based methods have attracted a significant amount of attention due to their strong performance. Having too much redundant information stored in memory, however, makes such methods inefficient and inaccurate. Moreover, a global matching strategy is usually used for memory reading, so these methods are susceptible to interference from semantically similar objects and are prone to incorrect segmentation. We propose a spatial constraint network to overcome these problems. In particular, we introduce a time-varying sensor and a dynamic feature memory to adaptively store pixel information to facilitate the modeling of the target object, which greatly reduces information redundancy in the memory without missing critical information. Furthermore, we propose an efficient memory reader that is less computationally intensive and has a smaller footprint. More importantly, we introduce a spatial constraint module to learn spatial consistency to obtain more precise segmentation; the target and distractors can be identified by the learned spatial response. The experimental results indicate that our method is competitive with state-of-the-art methods on several benchmark datasets. Our method also achieves an approximately 30 FPS inference speed, which is close to the requirement for real-time systems.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] SEMI-SUPERVISED SUBSPACE SEGMENTATION
    Wang, Dong
    Yin, Qiyue
    He, Ran
    Wang, Liang
    Tan, Tieniu
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2854 - 2858
  • [42] Curriculum Semi-supervised Segmentation
    Kervadec, Hoel
    Dolz, Jose
    Granger, Eric
    Ben Ayed, Ismail
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 : 568 - 576
  • [43] A baseline for semi-supervised learning of efficient semantic segmentation models
    Grubisic, Ivan
    Orsic, Marin
    Segvic, Sinisa
    PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
  • [44] An efficient weakly semi-supervised method for object automated annotation
    Xingzheng Wang
    Guoyao Wei
    Songwei Chen
    Jiehao Liu
    Multimedia Tools and Applications, 2024, 83 : 9417 - 9440
  • [45] An efficient weakly semi-supervised method for object automated annotation
    Wang, Xingzheng
    Wei, Guoyao
    Chen, Songwei
    Liu, Jiehao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 9417 - 9440
  • [46] A self-supervised semi-supervised echocardiographic video left ventricle segmentation method
    Wang, Tianxiang
    Dai, Qun
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 101
  • [47] Semi-Supervised Video Segmentation using Tree Structured Graphical Models
    Budvytis, Ignas
    Badrinarayanan, Vijay
    Cipolla, Roberto
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
  • [48] A semi-supervised recurrent neural network for video salient object detection
    Kompella, Aditya
    Kulkarni, Raghavendra, V
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (06): : 2065 - 2083
  • [49] A semi-supervised recurrent neural network for video salient object detection
    Aditya Kompella
    Raghavendra V. Kulkarni
    Neural Computing and Applications, 2021, 33 : 2065 - 2083
  • [50] Semi-Supervised Video Segmentation Using Tree Structured Graphical Models
    Badrinarayanan, Vijay
    Budvytis, Ignas
    Cipolla, Roberto
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (11) : 2751 - 2764