Efficient Regional Memory Network for Video Object Segmentation

被引:103
|
作者
Xie, Haozhe [1 ,2 ]
Yao, Hongxun [1 ]
Zhou, Shangchen [3 ]
Zhang, Shengping [1 ,4 ]
Sun, Wenxiu [2 ,5 ]
机构
[1] Harbin Inst Technol, Harbin, Peoples R China
[2] SenseTime Res & Tetras AI, Hong Kong, Peoples R China
[3] Nanyang Technol Univ, S Lab, Singapore, Singapore
[4] Peng Cheng Lab, Shenzhen, Peoples R China
[5] Shanghai AI Lab, Shanghai, Peoples R China
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR46437.2021.00134
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, several Space-Time Memory based networks have shown that the object cues (e.g. video frames as well as the segmented object masks) from the past frames are useful for segmenting objects in the current frame. However, these methods exploit the information from the memory by global-to-global matching between the current and past frames, which lead to mismatching to similar objects and high computational complexity. To address these problems, we propose a novel local-to-local matching solution for semi-supervised VOS, namely Regional Memory Network (RMNet). In RMNet, the precise regional memory is constructed by memorizing local regions where the target objects appear in the past frames. For the current query frame, the query regions are tracked and predicted based on the optical flow estimated from the previous frame. The proposed local-to-local matching effectively alleviates the ambiguity of similar objects in both memory and query frames, which allows the information to be passed from the regional memory to the query region efficiently and effectively. Experimental results indicate that the proposed RMNet performs favorably against state-of-the-art methods on the DAVIS and YouTube-VOS datasets.
引用
收藏
页码:1286 / 1295
页数:10
相关论文
共 50 条
  • [11] Unsupervised Video Object Segmentation via Prototype Memory Network
    Lee, Minhyeok
    Cho, Suhwan
    Lee, Seunghoon
    Park, Chaewon
    Lee, Sangyoun
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5913 - 5923
  • [12] Adaptive Sparse Memory Networks for Efficient and Robust Video Object Segmentation
    Dang, Jisheng
    Zheng, Huicheng
    Xu, Xiaohao
    Wang, Longguang
    Hu, Qingyong
    Guo, Yulan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (02) : 3820 - 3833
  • [13] Video Object Segmentation Using Kernelized Memory Network With Multiple Kernels
    Seong, Hongje
    Hyun, Junhyuk
    Kim, Euntai
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 2595 - 2612
  • [14] Video Object Segmentation using Point-based Memory Network
    Gao, Mingqi
    Han, Jungong
    Zheng, Feng
    Yu, James J. Q.
    Montana, Giovanni
    PATTERN RECOGNITION, 2023, 134
  • [15] An efficient video object segmentation scheme
    Ong, EP
    Tye, BJ
    Lin, WS
    Etoh, M
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3361 - 3364
  • [16] Fast Real-Time Video Object Segmentation with a Tangled Memory Network
    Mei, Jianbiao
    Wang, Mengmeng
    Yang, Yu
    Li, Yanjun
    Liu, Yong
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2023, 14 (03)
  • [17] Alignment Before Aggregation: Trajectory Memory Retrieval Network for Video Object Segmentation
    Sun, Rui
    Wang, Yuan
    Mai, Huayu
    Zhang, Tianzhu
    Wu, Feng
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1218 - 1228
  • [18] Learning Video Object Segmentation with Visual Memory
    Tokmakov, Pavel
    Inria, Karteek Alahari
    Schmid, Cordelia
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4491 - 4500
  • [19] Adaptive Memory Management for Video Object Segmentation
    Pourganjalikhan, Ali
    Poullis, Charalambos
    2022 19TH CONFERENCE ON ROBOTS AND VISION (CRV 2022), 2022, : 75 - 82
  • [20] Temporo-Spatial Parallel Sparse Memory Networks for Efficient Video Object Segmentation
    Dang, Jisheng
    Zheng, Huicheng
    Wang, Bimei
    Wang, Longguang
    Guo, Yulan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 17291 - 17304