Fast pixel-matching for video object segmentation

被引：8

作者：

Yu, Siyue ^{[1
]}

Xiao, Jimin ^{[1
]}

Zhang, Bingfeng ^{[1
]}

Lim, Eng Gee ^{[1
]}

Zhao, Yao ^{[2
]}

机构：

[1] Xian Jiaotong Liverpool Univ, Suzhou, Jiangsu, Peoples R China

[2] Beijing Jiaotong Univ, Beijing, Peoples R China

来源：

SIGNAL PROCESSING-IMAGE COMMUNICATION | 2021年 / 98卷

基金：

中国国家自然科学基金;

关键词：

Non-local pixel matching; Mask-propagation; Encoder-decoder;

D O I：

10.1016/j.image.2021.116373

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Video object segmentation, aiming to segment the foreground objects given the annotation of the first frame, has been attracting increasing attentions. Many state-of-the-art approaches have achieved great performance by relying on online model updating or mask-propagation techniques. However, most online models require high computational cost due to model fine-tuning during inference. Most mask-propagation based models are faster but with relatively low performance due to failure to adapt to object appearance variation. In this paper, we are aiming to design a new model to make a good balance between speed and performance. We propose a model, called NPMCA-net, which directly localizes foreground objects based on mask-propagation and non-local technique by matching pixels in reference and target frames. Since we bring in information of both first and previous frames, our network is robust to large object appearance variation, and can better adapt to occlusions. Extensive experiments show that our approach can achieve a new state-of-the-art performance with a fast speed at the same time (86.5% IoU on DAVIS-2016 and 72.2% IoU on DAVIS-2017, with speed of 0.11s per frame) under the same level comparison. Source code is available at https://github.com/siyueyu/NPMCA-net.

引用

页数：10

共 50 条

[31] Lightweight video object segmentation: Integrating online knowledge distillation for fast segmentation
Hou, Zhiqiang
Wang, Chenxu
Ma, Sugang
Dong, Jiale
Wang, Yunchen
Yu, Wangsheng
Yang, Xiaobao
KNOWLEDGE-BASED SYSTEMS, 2025, 308
[32] Fast Video Object Segmentation Using Markov Random Field
Mak, Chun-Man
Cham, Wai-Kuen
2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 347 - 352
[33] FAST TEXTURE SEGMENTATION FOR OBJECT-ORIENTED VIDEO CODING
LAVAGETTO, F
COCURULLO, F
EUROPEAN TRANSACTIONS ON TELECOMMUNICATIONS, 1995, 6 (03): : 241 - 253
[34] Fast Appearance Modeling for Automatic Primary Video Object Segmentation
Yang, Jiong
Price, Brian
Shen, Xiaohui
Lin, Zhe
Yuan, Junsong
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (02) : 503 - 515
[35] RANet: Ranking Attention Network for Fast Video Object Segmentation
Wang, Ziqin
Xu, Jun
Liu, Li
Zhu, Fan
Shao, Ling
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3977 - 3986
[36] Fast Video Object Segmentation via Dynamic Targeting Network
Zhang, Lu
Lin, Zhe
Zhang, Jianming
Lu, Huchuan
He, You
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5581 - 5590
[37] Fast Interactive Video Object Segmentation with Graph Neural Networks
Varga, Viktor
Lorincz, Andras
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[38] Fast texture segmentation for object-oriented video coding
Lavagetto, Fabio
Cocurullo, Fabio
European transactions on telecommunications and related technologies, 1995, 6 (03): : 241 - 253
[39] DMVOS: Discriminative Matching for real-time Video Object Segmentation
Wen, Peisong
Yang, Ruolin
Xu, Qianqian
Qian, Chen
Huang, Qingming
Cong, Runming
Si, Jianlou
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2048 - 2056
[40] COMatchNet: Co-Attention Matching Network for Video Object Segmentation
Huang, Lufei
Sun, Fengming
Yuan, Xia
PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 271 - 284

← 1 2 3 4 5 →