SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization

被引:22
|
作者
Lin, Zhihui [1 ,2 ]
Yang, Tianyu [2 ]
Li, Maomao [2 ]
Wang, Ziyu [3 ]
Yuan, Chun [4 ]
Jiang, Wenhao [3 ]
Liu, Wei [3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
[2] Tencent AI Lab, Shenzhen, Peoples R China
[3] Tencent Data Platform, Shenzhen, Peoples R China
[4] Tsinghua Shenzhen Int Grad Sch, Peng Cheng Lab, Shenzhen, Peoples R China
关键词
D O I
10.1109/CVPR52688.2022.00142
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Matching-based methods, especially those based on space-time memory, are significantly ahead of other solutions in semi-supervised video object segmentation (VOS). However, continuously growing and redundant template features lead to an inefficient inference. To alleviate this, we propose a novel Sequential Weighted Expectation-Maximization (SWEM) network to greatly reduce the redundancy of memory features. Different from the previous methods which only detect feature redundancy between frames, SWEM merges both intra-frame and inter-frame similar features by leveraging the sequential weighted EM algorithm. Further, adaptive weights for frame features endow SWEM with the flexibility to represent hard samples, improving the discrimination of templates. Besides, the proposed method maintains a fixed number of template features in memory, which ensures the stable inference complexity of the VOS system. Extensive experiments on commonly used DAVIS and YouTube-VOS datasets verify the high efficiency (36 FPS) and high performance (84.3% JSzT on DAVIS 2017 validation dataset) of SWEM.
引用
收藏
页码:1352 / 1362
页数:11
相关论文
共 50 条
  • [1] Real-time and light-weighted unsupervised video object segmentation network
    Zhao, Zongji
    Zhao, Sanyuan
    Shen, Jianbing
    PATTERN RECOGNITION, 2021, 120
  • [2] SwiftNet: Real-time Video Object Segmentation
    Wang, Haochen
    Jiang, Xiaolong
    Ren, Haibing
    Hu, Yao
    Bai, Song
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1296 - 1305
  • [3] Automatic image segmentation for concealed object detection using the expectation-maximization algorithm
    Lee, Dong-Su
    Yeom, Seokwon
    Son, Jung-Young
    Kim, Shin-Hwan
    OPTICS EXPRESS, 2010, 18 (10): : 10659 - 10667
  • [4] Synergy between Object Recognition and Image Segmentation Using the Expectation-Maximization Algorithm
    Kokkinos, Iasonas
    Maragos, Petros
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (08) : 1486 - 1501
  • [5] A real-time expectation-maximization algorithm for acquiring multiplanar maps of indoor environments with mobile robots
    Thrun, S
    Martin, C
    Liu, YF
    Hähnel, D
    Emery-Montemerlo, R
    Chakrabarti, D
    Burgard, W
    IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 2004, 20 (03): : 433 - 442
  • [6] Real-time video object segmentation for MPEG encoded video sequences
    Porikli, F
    REAL-TIME IMAGING VIII, 2004, 5297 : 195 - 203
  • [7] Expectation-Maximization Aided Modified Weighted Sequential Energy Detector for Distributed Cooperative Spectrum Sensing
    Rashid, Mohammed
    Nanzer, Jeffrey A.
    IEEE ACCESS, 2025, 13 : 24880 - 24893
  • [8] DMVOS: Discriminative Matching for real-time Video Object Segmentation
    Wen, Peisong
    Yang, Ruolin
    Xu, Qianqian
    Qian, Chen
    Huang, Qingming
    Cong, Runming
    Si, Jianlou
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2048 - 2056
  • [9] Real-time video object segmentation using HSV space
    Li, N
    Bu, JJ
    Chen, C
    2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2002, : 85 - 88
  • [10] Real-time and multi-video-object segmentation for compressed video sequences
    Fu Wenxiu
    Wang Bin
    Liu Ming
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 3, PROCEEDINGS, 2007, : 747 - +