Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

被引：0

作者：

Cheng, Ho Kei ^{[1
]}

Tai, Yu-Wing ^{[2
]}

Tang, Chi-Keung ^{[3
]}

机构：

[1] Univ Illinois, Urbana, IL 61801 USA

[2] Kuaishou Technol, Beijing, Peoples R China

[3] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021) | 2021年 / 34卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a simple yet effective approach to modeling space-time correspondences in the context of video object segmentation. Unlike most existing approaches, we establish correspondences directly between frames without re-encoding the mask features for every object, leading to a highly efficient and robust framework. With the correspondences, every node in the current query frame is inferred by aggregating features from the past in an associative fashion. We cast the aggregation process as a voting problem and find that the existing inner-product affinity leads to poor use of memory with a small (fixed) subset of memory nodes dominating the votes, regardless of the query. In light of this phenomenon, we propose using the negative squared Euclidean distance instead to compute the affinities. We validate that every memory node now has a chance to contribute, and experimentally show that such diversified voting is beneficial to both memory efficiency and inference accuracy. The synergy of correspondence networks and diversified voting works exceedingly well, achieves new state-of-the-art results on both DAVIS and YouTubeVOS datasets while running significantly faster at 20+ FPS for multiple objects without bells and whistles.

引用

页数：14

共 50 条

[1] Video Object Segmentation using Space-Time Memory Networks
Oh, Seoung Wug
Lee, Joon-Young
Xu, Ning
Kim, Seon Joo
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9225 - 9234
[2] Space-Time Memory Networks for Video Object Segmentation With User Guidance
Oh, Seoung Wug
Lee, Joon-Young
Xu, Ning
Kim, Seon Joo
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (01) : 442 - 455
[3] Space-time Reinforcement Network for Video Object Segmentation
School of Computer Science, Nanjing University of Information Science and Technology, Nanjing, China
不详
不详
arXiv,
[4] Memory Aggregation Networks for Efficient Interactive Video Object Segmentation
Miao, Jiaxu
Wei, Yunchao
Yang, Yi
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 10363 - 10372
[5] Boosting Video Object Segmentation via Space-time Correspondence Learning
Zhang, Yurong
Li, Liulei
Wang, Wenguan
Xie, Rong
Song, Li
Zhang, Wenjun
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2246 - 2256
[6] Adaptive Sparse Memory Networks for Efficient and Robust Video Object Segmentation
Dang, Jisheng
Zheng, Huicheng
Xu, Xiaohao
Wang, Longguang
Hu, Qingyong
Guo, Yulan
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (02) : 3820 - 3833
[7] Multi-Object Tracking and Segmentation with a Space-Time Memory Network
Miah, Mehdi
Bilodeau, Guillaume-Alexandre
Saunier, Nicolas
2023 20TH CONFERENCE ON ROBOTS AND VISION, CRV, 2023, : 184 - 193
[8] Efficient Regional Memory Network for Video Object Segmentation
Xie, Haozhe
Yao, Hongxun
Zhou, Shangchen
Zhang, Shengping
Sun, Wenxiu
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1286 - 1295
[9] Robust and Efficient Memory Network for Video Object Segmentation
Chen, Yadang
Zhang, Dingwei
Yang, Zhi-Xin
Wu, Enhua
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1769 - 1774
[10] Temporo-Spatial Parallel Sparse Memory Networks for Efficient Video Object Segmentation
Dang, Jisheng
Zheng, Huicheng
Wang, Bimei
Wang, Longguang
Guo, Yulan
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 17291 - 17304

← 1 2 3 4 5 →