Segmenting Moving Objects via an Object-Centric Layered Representation

被引：0

作者：

Xie, Junyu ^{[1
]}

Xie, Weidi ^{[1
,2
]}

Zisserman, Andrew ^{[1
]}

机构：

[1] Univ Oxford, Dept Engn Sci, Visual Geometry Grp, Oxford, England

[2] Shanghai Jiao Tong Univ, Coop Medianet Innovat Ctr, Shanghai, Peoples R China

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年

基金：

英国工程与自然科学研究理事会;

关键词：

SEGMENTATION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The objective of this paper is a model that is able to discover, track and segment multiple moving objects in a video. We make four contributions: First, we introduce an object-centric segmentation model with a depth-ordered layer representation. This is implemented using a variant of the transformer architecture that ingests optical flow, where each query vector specifies an object and its layer for the entire video. The model can effectively discover multiple moving objects and handle mutual occlusions; Second, we introduce a scalable pipeline for generating multi-object synthetic training data via layer compositions, that is used to train the proposed model, significantly reducing the requirements for labour-intensive annotations, and supporting Sim2Real generalisation; Third, we conduct thorough ablation studies, showing that the model is able to learn object permanence and temporal shape consistency, and is able to predict amodal segmentation masks; Fourth, we evaluate our model, trained only on synthetic data, on standard video segmentation benchmarks, DAVIS, MoCA, SegTrack, FBMS-59, and achieve state-of-the-art performance among existing methods that do not rely on any manual annotations. With test-time adaptation, we observe further performance boosts.

引用

页数：14

共 50 条

[41] Generalization and Robustness Implications in Object-Centric Learning
Dittadi, Andrea
Papa, Samuele
De Vita, Michele
Scholkopf, Bernhard
Winther, Ole
Locatello, Francesco
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[42] Precision and Fitness in Object-Centric Process Mining
Adams, Jan Niklas
van der Aalst, Wil M. P.
2021 3RD INTERNATIONAL CONFERENCE ON PROCESS MINING (ICPM 2021), 2021, : 128 - 135
[43] An Object-Centric Paradigm for Robot Programming by Demonstration
Huang, Di-Wei
Katz, Garrett E.
Langsfeld, Joshua D.
Oh, Hyuk
Gentili, Rodolphe J.
Reggia, James A.
FOUNDATIONS OF AUGMENTED COGNITION, AC 2015, 2015, 9183 : 745 - 756
[44] SSVEP stimuli design for object-centric BCI
Gergondet, Pierre
Kheddar, Abderrahmane
BRAIN-COMPUTER INTERFACES, 2015, 2 (01) : 11 - 28
[45] Object-Centric Diffusion for Efficient Video Editing
Kahatapitiya, Kumara
Karjauv, Adil
Abati, Davide
Porikli, Fatih
Asano, Yuki M.
Habibian, Amirhossein
COMPUTER VISION-ECCV 2024, PT LVII, 2025, 15115 : 91 - 108
[46] Object-Centric Spatial Pooling for Image Classification
Russakovsky, Olga
Lin, Yuanqing
Yu, Kai
Li Fei-Fei
COMPUTER VISION - ECCV 2012, PT II, 2012, 7573 : 1 - 15
[47] DJXPerf: Identifying Memory Inefficiencies via Object-Centric Profiling for Java']Java
Li, Bolun
Su, Pengfei
Chabbi, Milind
Jiao, Shuyin
Liu, Xu
PROCEEDINGS OF THE 21ST ACM/IEEE INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, CGO 2023, 2023, : 81 - 94
[48] Sparse 3D Reconstruction via Object-Centric Ray Sampling
Cerkezi, Llukman
Favaro, Paolo
2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 432 - 441
[49] Scaling Object-centric Robotic Manipulation with Multimodal Object Identification
Mitash, Chaitanya
Hussein, Mostafa
Vanbaar, Jeroen
Terhuja, Vikedo
Katyal, Kapil
2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 1913 - 1920
[50] Operational process monitoring: An object-centric approach
Park, Gyunam
van der Aalst, Wil M. P.
COMPUTERS IN INDUSTRY, 2025, 164

← 1 2 3 4 5 →