Segmenting Moving Objects via an Object-Centric Layered Representation

被引:0
|
作者
Xie, Junyu [1 ]
Xie, Weidi [1 ,2 ]
Zisserman, Andrew [1 ]
机构
[1] Univ Oxford, Dept Engn Sci, Visual Geometry Grp, Oxford, England
[2] Shanghai Jiao Tong Univ, Coop Medianet Innovat Ctr, Shanghai, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
SEGMENTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The objective of this paper is a model that is able to discover, track and segment multiple moving objects in a video. We make four contributions: First, we introduce an object-centric segmentation model with a depth-ordered layer representation. This is implemented using a variant of the transformer architecture that ingests optical flow, where each query vector specifies an object and its layer for the entire video. The model can effectively discover multiple moving objects and handle mutual occlusions; Second, we introduce a scalable pipeline for generating multi-object synthetic training data via layer compositions, that is used to train the proposed model, significantly reducing the requirements for labour-intensive annotations, and supporting Sim2Real generalisation; Third, we conduct thorough ablation studies, showing that the model is able to learn object permanence and temporal shape consistency, and is able to predict amodal segmentation masks; Fourth, we evaluate our model, trained only on synthetic data, on standard video segmentation benchmarks, DAVIS, MoCA, SegTrack, FBMS-59, and achieve state-of-the-art performance among existing methods that do not rely on any manual annotations. With test-time adaptation, we observe further performance boosts.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Generalization and Robustness Implications in Object-Centric Learning
    Dittadi, Andrea
    Papa, Samuele
    De Vita, Michele
    Scholkopf, Bernhard
    Winther, Ole
    Locatello, Francesco
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [42] Precision and Fitness in Object-Centric Process Mining
    Adams, Jan Niklas
    van der Aalst, Wil M. P.
    2021 3RD INTERNATIONAL CONFERENCE ON PROCESS MINING (ICPM 2021), 2021, : 128 - 135
  • [43] An Object-Centric Paradigm for Robot Programming by Demonstration
    Huang, Di-Wei
    Katz, Garrett E.
    Langsfeld, Joshua D.
    Oh, Hyuk
    Gentili, Rodolphe J.
    Reggia, James A.
    FOUNDATIONS OF AUGMENTED COGNITION, AC 2015, 2015, 9183 : 745 - 756
  • [44] SSVEP stimuli design for object-centric BCI
    Gergondet, Pierre
    Kheddar, Abderrahmane
    BRAIN-COMPUTER INTERFACES, 2015, 2 (01) : 11 - 28
  • [45] Object-Centric Diffusion for Efficient Video Editing
    Kahatapitiya, Kumara
    Karjauv, Adil
    Abati, Davide
    Porikli, Fatih
    Asano, Yuki M.
    Habibian, Amirhossein
    COMPUTER VISION-ECCV 2024, PT LVII, 2025, 15115 : 91 - 108
  • [46] Object-Centric Spatial Pooling for Image Classification
    Russakovsky, Olga
    Lin, Yuanqing
    Yu, Kai
    Li Fei-Fei
    COMPUTER VISION - ECCV 2012, PT II, 2012, 7573 : 1 - 15
  • [47] DJXPerf: Identifying Memory Inefficiencies via Object-Centric Profiling for Java']Java
    Li, Bolun
    Su, Pengfei
    Chabbi, Milind
    Jiao, Shuyin
    Liu, Xu
    PROCEEDINGS OF THE 21ST ACM/IEEE INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, CGO 2023, 2023, : 81 - 94
  • [48] Sparse 3D Reconstruction via Object-Centric Ray Sampling
    Cerkezi, Llukman
    Favaro, Paolo
    2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 432 - 441
  • [49] Scaling Object-centric Robotic Manipulation with Multimodal Object Identification
    Mitash, Chaitanya
    Hussein, Mostafa
    Vanbaar, Jeroen
    Terhuja, Vikedo
    Katyal, Kapil
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 1913 - 1920
  • [50] Operational process monitoring: An object-centric approach
    Park, Gyunam
    van der Aalst, Wil M. P.
    COMPUTERS IN INDUSTRY, 2025, 164