Recurrent Dynamic Embedding for Video Object Segmentation

被引:37
|
作者
Li, Mingxing [1 ,3 ]
Hu, Li [2 ]
Xiong, Zhiwei [1 ]
Zhang, Bang [2 ]
Pan, Pan [2 ]
Liu, Dong [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Alibaba Grp, Alibaba DAMO Acad, Hangzhou, Peoples R China
[3] Alibaba, Hangzhou, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52688.2022.00139
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Space-time memory (STM) based video object segmentation (VOS) networks usually keep increasing memory bank every several frames, which shows excellent performance. However; 1) the hardware cannot withstand the ever-increasing memory requirements as the video length increases. 2) Storing lots of information inevitably introduces lots of noise, which is not conducive to reading the most important information from the memory bank In this paper, we propose a Recurrent Dynamic Embedding (RDE) to build a memory bank of constant size. Specifically, we explicitly generate and update RDE by the proposed Spatio-temporal Aggregation Module (SAM), which exploits the cue of historical information. To avoid error accumulation owing to the recurrent usage of SAM, we propose an unbiased guidance loss during the training stage, which makes SAM more robust in long videos. Moreover, the predicted masks in the memory bank are inaccurate due to the inaccurate network inference, which affects the segmentation of the query frame. To address this problem, we design a novel self-correction strategy so that the network can repair the embeddings of masks with different qualities in the memory bank Extensive experiments show our method achieves the best tradeoff between performance and speed.
引用
收藏
页码:1322 / 1331
页数:10
相关论文
共 50 条
  • [41] Video Segmentation by Tracing Discontinuities in a Trajectory Embedding
    Fragkiadaki, Katerina
    Zhang, Geng
    Shi, Jianbo
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 1846 - 1853
  • [42] Compressed Domain Video Object Segmentation
    Porikli, Fatih
    Bashir, Faisal
    Sun, Huifang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2010, 20 (01) : 2 - 14
  • [43] Application Of Segmentation Of Object Video In Robot
    Gong Heng
    PROCEEDINGS OF THE 2015 INTERNATIONAL INDUSTRIAL INFORMATICS AND COMPUTER ENGINEERING CONFERENCE, 2015, : 1411 - 1414
  • [44] Video object segmentation using SVMs
    Zhao, Y
    Li, HL
    Ahalt, SC
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING, 2003, : 333 - 337
  • [45] Weakly Supervised Video Object Segmentation
    Wang, Yufei
    Hu, Yongjiang
    Liew, Alan Wee-Chung
    Wang, Junhu
    PROCEEDINGS OF TENCON 2018 - 2018 IEEE REGION 10 CONFERENCE, 2018, : 0315 - 0320
  • [46] Video object segmentation with a Potts model
    Zhao, Jieyu
    Wang, Xiaoquan
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2007, : 742 - +
  • [47] Methods for Referring Video Object Segmentation
    Wei, Caiying
    Jia, Lei
    Computer Engineering and Applications, 61 (02): : 73 - 83
  • [48] Research on Video Object Segmentation Algorithm
    Bo, Guan
    PROCEEDINGS OF THE 2015 4TH NATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING ( NCEECE 2015), 2016, 47 : 1399 - 1402
  • [49] Video Object Segmentation with Referring Expressions
    Khoreva, Anna
    Rohrbach, Anna
    Schiele, Bernt
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 7 - 12
  • [50] Video Object Segmentation Based on Disparity
    Xingming, Ouyang
    Wei, Wei
    ADVANCES IN WEB AND NETWORK TECHNOLOGIES, AND INFORMATION MANAGEMENT, 2009, 5731 : 36 - 44