Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

被引:0
|
作者
Cheng, Ho Kei [1 ]
Tai, Yu-Wing [2 ]
Tang, Chi-Keung [3 ]
机构
[1] Univ Illinois, Urbana, IL 61801 USA
[2] Kuaishou Technol, Beijing, Peoples R China
[3] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a simple yet effective approach to modeling space-time correspondences in the context of video object segmentation. Unlike most existing approaches, we establish correspondences directly between frames without re-encoding the mask features for every object, leading to a highly efficient and robust framework. With the correspondences, every node in the current query frame is inferred by aggregating features from the past in an associative fashion. We cast the aggregation process as a voting problem and find that the existing inner-product affinity leads to poor use of memory with a small (fixed) subset of memory nodes dominating the votes, regardless of the query. In light of this phenomenon, we propose using the negative squared Euclidean distance instead to compute the affinities. We validate that every memory node now has a chance to contribute, and experimentally show that such diversified voting is beneficial to both memory efficiency and inference accuracy. The synergy of correspondence networks and diversified voting works exceedingly well, achieves new state-of-the-art results on both DAVIS and YouTubeVOS datasets while running significantly faster at 20+ FPS for multiple objects without bells and whistles.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Video Object Segmentation using Space-Time Memory Networks
    Oh, Seoung Wug
    Lee, Joon-Young
    Xu, Ning
    Kim, Seon Joo
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9225 - 9234
  • [2] Space-Time Memory Networks for Video Object Segmentation With User Guidance
    Oh, Seoung Wug
    Lee, Joon-Young
    Xu, Ning
    Kim, Seon Joo
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (01) : 442 - 455
  • [3] Space-time Reinforcement Network for Video Object Segmentation
    School of Computer Science, Nanjing University of Information Science and Technology, Nanjing, China
    不详
    不详
    arXiv,
  • [4] Memory Aggregation Networks for Efficient Interactive Video Object Segmentation
    Miao, Jiaxu
    Wei, Yunchao
    Yang, Yi
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 10363 - 10372
  • [5] Boosting Video Object Segmentation via Space-time Correspondence Learning
    Zhang, Yurong
    Li, Liulei
    Wang, Wenguan
    Xie, Rong
    Song, Li
    Zhang, Wenjun
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2246 - 2256
  • [6] Adaptive Sparse Memory Networks for Efficient and Robust Video Object Segmentation
    Dang, Jisheng
    Zheng, Huicheng
    Xu, Xiaohao
    Wang, Longguang
    Hu, Qingyong
    Guo, Yulan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (02) : 3820 - 3833
  • [7] Multi-Object Tracking and Segmentation with a Space-Time Memory Network
    Miah, Mehdi
    Bilodeau, Guillaume-Alexandre
    Saunier, Nicolas
    2023 20TH CONFERENCE ON ROBOTS AND VISION, CRV, 2023, : 184 - 193
  • [8] Efficient Regional Memory Network for Video Object Segmentation
    Xie, Haozhe
    Yao, Hongxun
    Zhou, Shangchen
    Zhang, Shengping
    Sun, Wenxiu
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1286 - 1295
  • [9] Robust and Efficient Memory Network for Video Object Segmentation
    Chen, Yadang
    Zhang, Dingwei
    Yang, Zhi-Xin
    Wu, Enhua
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1769 - 1774
  • [10] Temporo-Spatial Parallel Sparse Memory Networks for Efficient Video Object Segmentation
    Dang, Jisheng
    Zheng, Huicheng
    Wang, Bimei
    Wang, Longguang
    Guo, Yulan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 17291 - 17304