Recurrent Dynamic Embedding for Video Object Segmentation

被引:37
|
作者
Li, Mingxing [1 ,3 ]
Hu, Li [2 ]
Xiong, Zhiwei [1 ]
Zhang, Bang [2 ]
Pan, Pan [2 ]
Liu, Dong [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Alibaba Grp, Alibaba DAMO Acad, Hangzhou, Peoples R China
[3] Alibaba, Hangzhou, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52688.2022.00139
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Space-time memory (STM) based video object segmentation (VOS) networks usually keep increasing memory bank every several frames, which shows excellent performance. However; 1) the hardware cannot withstand the ever-increasing memory requirements as the video length increases. 2) Storing lots of information inevitably introduces lots of noise, which is not conducive to reading the most important information from the memory bank In this paper, we propose a Recurrent Dynamic Embedding (RDE) to build a memory bank of constant size. Specifically, we explicitly generate and update RDE by the proposed Spatio-temporal Aggregation Module (SAM), which exploits the cue of historical information. To avoid error accumulation owing to the recurrent usage of SAM, we propose an unbiased guidance loss during the training stage, which makes SAM more robust in long videos. Moreover, the predicted masks in the memory bank are inaccurate due to the inaccurate network inference, which affects the segmentation of the query frame. To address this problem, we design a novel self-correction strategy so that the network can repair the embeddings of masks with different qualities in the memory bank Extensive experiments show our method achieves the best tradeoff between performance and speed.
引用
收藏
页码:1322 / 1331
页数:10
相关论文
共 50 条
  • [21] Learning Quality-aware Dynamic Memory for Video Object Segmentation
    Liu, Yong
    Yu, Ran
    Yin, Fei
    Zhao, Xinyuan
    Zhao, Wei
    Xia, Weihao
    Yang, Yujiu
    COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 : 468 - 486
  • [22] Joint Video Object Discovery and Segmentation by Coupled Dynamic Markov Networks
    Liu, Ziyi
    Wang, Le
    Hua, Gang
    Zhang, Qilin
    Niu, Zhenxing
    Wu, Ying
    Zheng, Nanning
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (12) : 5840 - 5853
  • [23] Video object segmentation based on dynamic perception update and feature fusion
    Hou, Zhiqiang
    Li, Fucheng
    Dong, Jiale
    Dai, Nan
    Ma, Sugang
    Fan, Jiulun
    IMAGE AND VISION COMPUTING, 2024, 150
  • [24] Video Object of Interest Segmentation
    Zhou, Siyuan
    Zhan, Chunru
    Wang, Biao
    Ge, Tiezheng
    Jiang, Yuning
    Niu, Li
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3805 - 3813
  • [25] An Overview of Video Object Segmentation
    Zhu, Shiping
    Guo, Zhichao
    2012 INTERNATIONAL CONFERENCE ON INDUSTRIAL CONTROL AND ELECTRONICS ENGINEERING (ICICEE), 2012, : 1019 - 1021
  • [26] Gamifying Video Object Segmentation
    Spampinato, Concetto
    Palazzo, Simone
    Giordano, Daniela
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (10) : 1942 - 1958
  • [27] On guiding video object segmentation
    Ortego, Diego
    McGuinness, Kevin
    SanMiguel, Juan C.
    Arazo, Eric
    Martinez, Jose M.
    O'Connor, Noel E.
    2019 INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2019,
  • [28] Video object clustering segmentation
    Lin, Q
    Zhang, X
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 2840 - 2843
  • [29] Object segmentation for video coding
    Chen, LH
    Chen, JR
    Liao, HY
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS: IMAGE, SPEECH AND SIGNAL PROCESSING, 2000, : 383 - 386
  • [30] Hierarchical Video Object Segmentation
    Xing, Junliang
    Ai, Haizhou
    Lao, Shihong
    2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 67 - 71