Spatial constraint for efficient semi-supervised video object segmentation

被引：1

作者：

Chen, Yadang ^{[1
,2
]}

Ji, Chuanjun ^{[1
,2
]}

Yang, Zhi-Xin ^{[3
,4
]}

Wu, Enhua ^{[5
]}

机构：

[1] Nanjing Univ Informat Sci & Technol, Engn Res Ctr Digital Forens, Minist Educ, Nanjing 210044, Peoples R China

[2] Nanjing Univ Informat Sci & Technol, Sch Comp & Software, Nanjing 210044, Peoples R China

[3] Univ Macau, State Key Lab Internet Things Smart City, Macau 999078, Peoples R China

[4] Univ Macau, Dept Electromech Engn, Macau 999078, Peoples R China

[5] Univ Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing 100190, Peoples R China

来源：

COMPUTER VISION AND IMAGE UNDERSTANDING | 2023年 / 237卷

基金：

中国国家自然科学基金;

关键词：

Video object segmentation; Memory-based methods; Redundant information; Semantically similar objects;

D O I：

10.1016/j.cviu.2023.103843

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semi-supervised video object segmentation is the process of tracking and segmenting objects in a video sequence based on annotated masks for one or more frames. Recently, memory-based methods have attracted a significant amount of attention due to their strong performance. Having too much redundant information stored in memory, however, makes such methods inefficient and inaccurate. Moreover, a global matching strategy is usually used for memory reading, so these methods are susceptible to interference from semantically similar objects and are prone to incorrect segmentation. We propose a spatial constraint network to overcome these problems. In particular, we introduce a time-varying sensor and a dynamic feature memory to adaptively store pixel information to facilitate the modeling of the target object, which greatly reduces information redundancy in the memory without missing critical information. Furthermore, we propose an efficient memory reader that is less computationally intensive and has a smaller footprint. More importantly, we introduce a spatial constraint module to learn spatial consistency to obtain more precise segmentation; the target and distractors can be identified by the learned spatial response. The experimental results indicate that our method is competitive with state-of-the-art methods on several benchmark datasets. Our method also achieves an approximately 30 FPS inference speed, which is close to the requirement for real-time systems.

引用

页数：10

共 50 条

[41] SEMI-SUPERVISED SUBSPACE SEGMENTATION
Wang, Dong
Yin, Qiyue
He, Ran
Wang, Liang
Tan, Tieniu
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2854 - 2858
[42] Curriculum Semi-supervised Segmentation
Kervadec, Hoel
Dolz, Jose
Granger, Eric
Ben Ayed, Ismail
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 : 568 - 576
[43] A baseline for semi-supervised learning of efficient semantic segmentation models
Grubisic, Ivan
Orsic, Marin
Segvic, Sinisa
PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
[44] An efficient weakly semi-supervised method for object automated annotation
Xingzheng Wang
Guoyao Wei
Songwei Chen
Jiehao Liu
Multimedia Tools and Applications, 2024, 83 : 9417 - 9440
[45] An efficient weakly semi-supervised method for object automated annotation
Wang, Xingzheng
Wei, Guoyao
Chen, Songwei
Liu, Jiehao
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 9417 - 9440
[46] A self-supervised semi-supervised echocardiographic video left ventricle segmentation method
Wang, Tianxiang
Dai, Qun
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 101
[47] Semi-Supervised Video Segmentation using Tree Structured Graphical Models
Budvytis, Ignas
Badrinarayanan, Vijay
Cipolla, Roberto
2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
[48] A semi-supervised recurrent neural network for video salient object detection
Kompella, Aditya
Kulkarni, Raghavendra, V
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (06): : 2065 - 2083
[49] A semi-supervised recurrent neural network for video salient object detection
Aditya Kompella
Raghavendra V. Kulkarni
Neural Computing and Applications, 2021, 33 : 2065 - 2083
[50] Semi-Supervised Video Segmentation Using Tree Structured Graphical Models
Badrinarayanan, Vijay
Budvytis, Ignas
Cipolla, Roberto
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (11) : 2751 - 2764

← 1 2 3 4 5 →