SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization

被引:22
|
作者
Lin, Zhihui [1 ,2 ]
Yang, Tianyu [2 ]
Li, Maomao [2 ]
Wang, Ziyu [3 ]
Yuan, Chun [4 ]
Jiang, Wenhao [3 ]
Liu, Wei [3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
[2] Tencent AI Lab, Shenzhen, Peoples R China
[3] Tencent Data Platform, Shenzhen, Peoples R China
[4] Tsinghua Shenzhen Int Grad Sch, Peng Cheng Lab, Shenzhen, Peoples R China
关键词
D O I
10.1109/CVPR52688.2022.00142
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Matching-based methods, especially those based on space-time memory, are significantly ahead of other solutions in semi-supervised video object segmentation (VOS). However, continuously growing and redundant template features lead to an inefficient inference. To alleviate this, we propose a novel Sequential Weighted Expectation-Maximization (SWEM) network to greatly reduce the redundancy of memory features. Different from the previous methods which only detect feature redundancy between frames, SWEM merges both intra-frame and inter-frame similar features by leveraging the sequential weighted EM algorithm. Further, adaptive weights for frame features endow SWEM with the flexibility to represent hard samples, improving the discrimination of templates. Besides, the proposed method maintains a fixed number of template features in memory, which ensures the stable inference complexity of the VOS system. Extensive experiments on commonly used DAVIS and YouTube-VOS datasets verify the high efficiency (36 FPS) and high performance (84.3% JSzT on DAVIS 2017 validation dataset) of SWEM.
引用
收藏
页码:1352 / 1362
页数:11
相关论文
共 50 条
  • [21] CNN Implementation of a Moving Object Segmentation Approach for Real-Time Video Surveillance
    Rodriguez-Fernandez, D.
    Vilarino, D. L.
    Pardo, X. M.
    2008 11TH INTERNATIONAL WORKSHOP ON CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS, 2008, : 129 - 134
  • [22] Focal-plane moving object segmentation for real-time video surveillance
    Lopez Vilarino, David
    Dudek, Piotr
    Cabello Ferrer, Diego
    PROCEEDINGS OF 2008 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-10, 2008, : 1600 - +
  • [23] Real-time object segmentation and coding for selective-quality video communications
    Challapali, K
    Brodsky, T
    Lin, YT
    Yan, Y
    Chen, RY
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2004, 14 (06) : 813 - 824
  • [24] Towards Real-Time Segmentation on the Edge
    Li, Yanyu
    Yang, Changdi
    Zhao, Pu
    Yuan, Geng
    Niu, Wei
    Guan, Jiexiong
    Tang, Hao
    Qin, Minghai
    Jin, Qing
    Ren, Bin
    Lin, Xue
    Wang, Yanzhi
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1468 - 1476
  • [25] Real Time Video Object Segmentation in Compressed Domain
    Tan, Zhentao
    Liu, Bin
    Chu, Qi
    Zhong, Hangshi
    Wu, Yue
    Li, Weihai
    Yu, Nenghai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (01) : 175 - 188
  • [26] Real-Time Tracking Combined with Object Segmentation
    Wang, Hongzhi
    Sang, Nong
    Yan, Yi
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 4098 - 4103
  • [27] Real-Time Object-Based Video Segmentation Using Colour Segmentation and Connected Component Labeling
    Jau, U. L.
    Teh, C. S.
    VISUAL INFORMATICS: BRIDGING RESEARCH AND PRACTICE, 2009, 5857 : 110 - 121
  • [28] Real-time object segmentation based on GPU
    Lee, Sun-Ju
    Jeong, Chang-Sung
    2006 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PTS 1 AND 2, PROCEEDINGS, 2006, : 739 - 742
  • [29] Real-Time Moving Object Segmentation and Classification From HEVC Compressed Surveillance Video
    Zhao, Liang
    He, Zhihai
    Cao, Wenming
    Zhao, Debin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (06) : 1346 - 1357
  • [30] MobileVOS: Real-Time Video Object Segmentation Contrastive Learning meets Knowledge Distillation
    Miles, Roy
    Yucel, Mehmet Kerim
    Manganelli, Bruno
    Saa-Garriga, Albert
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10480 - 10490