Fast pixel-matching for video object segmentation

被引:8
|
作者
Yu, Siyue [1 ]
Xiao, Jimin [1 ]
Zhang, Bingfeng [1 ]
Lim, Eng Gee [1 ]
Zhao, Yao [2 ]
机构
[1] Xian Jiaotong Liverpool Univ, Suzhou, Jiangsu, Peoples R China
[2] Beijing Jiaotong Univ, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Non-local pixel matching; Mask-propagation; Encoder-decoder;
D O I
10.1016/j.image.2021.116373
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Video object segmentation, aiming to segment the foreground objects given the annotation of the first frame, has been attracting increasing attentions. Many state-of-the-art approaches have achieved great performance by relying on online model updating or mask-propagation techniques. However, most online models require high computational cost due to model fine-tuning during inference. Most mask-propagation based models are faster but with relatively low performance due to failure to adapt to object appearance variation. In this paper, we are aiming to design a new model to make a good balance between speed and performance. We propose a model, called NPMCA-net, which directly localizes foreground objects based on mask-propagation and non-local technique by matching pixels in reference and target frames. Since we bring in information of both first and previous frames, our network is robust to large object appearance variation, and can better adapt to occlusions. Extensive experiments show that our approach can achieve a new state-of-the-art performance with a fast speed at the same time (86.5% IoU on DAVIS-2016 and 72.2% IoU on DAVIS-2017, with speed of 0.11s per frame) under the same level comparison. Source code is available at https://github.com/siyueyu/NPMCA-net.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Lightweight video object segmentation: Integrating online knowledge distillation for fast segmentation
    Hou, Zhiqiang
    Wang, Chenxu
    Ma, Sugang
    Dong, Jiale
    Wang, Yunchen
    Yu, Wangsheng
    Yang, Xiaobao
    KNOWLEDGE-BASED SYSTEMS, 2025, 308
  • [32] Fast Video Object Segmentation Using Markov Random Field
    Mak, Chun-Man
    Cham, Wai-Kuen
    2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 347 - 352
  • [33] FAST TEXTURE SEGMENTATION FOR OBJECT-ORIENTED VIDEO CODING
    LAVAGETTO, F
    COCURULLO, F
    EUROPEAN TRANSACTIONS ON TELECOMMUNICATIONS, 1995, 6 (03): : 241 - 253
  • [34] Fast Appearance Modeling for Automatic Primary Video Object Segmentation
    Yang, Jiong
    Price, Brian
    Shen, Xiaohui
    Lin, Zhe
    Yuan, Junsong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (02) : 503 - 515
  • [35] RANet: Ranking Attention Network for Fast Video Object Segmentation
    Wang, Ziqin
    Xu, Jun
    Liu, Li
    Zhu, Fan
    Shao, Ling
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3977 - 3986
  • [36] Fast Video Object Segmentation via Dynamic Targeting Network
    Zhang, Lu
    Lin, Zhe
    Zhang, Jianming
    Lu, Huchuan
    He, You
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5581 - 5590
  • [37] Fast Interactive Video Object Segmentation with Graph Neural Networks
    Varga, Viktor
    Lorincz, Andras
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [38] Fast texture segmentation for object-oriented video coding
    Lavagetto, Fabio
    Cocurullo, Fabio
    European transactions on telecommunications and related technologies, 1995, 6 (03): : 241 - 253
  • [39] DMVOS: Discriminative Matching for real-time Video Object Segmentation
    Wen, Peisong
    Yang, Ruolin
    Xu, Qianqian
    Qian, Chen
    Huang, Qingming
    Cong, Runming
    Si, Jianlou
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2048 - 2056
  • [40] COMatchNet: Co-Attention Matching Network for Video Object Segmentation
    Huang, Lufei
    Sun, Fengming
    Yuan, Xia
    PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 271 - 284