Segmenting Moving Objects via an Object-Centric Layered Representation

被引:0
|
作者
Xie, Junyu [1 ]
Xie, Weidi [1 ,2 ]
Zisserman, Andrew [1 ]
机构
[1] Univ Oxford, Dept Engn Sci, Visual Geometry Grp, Oxford, England
[2] Shanghai Jiao Tong Univ, Coop Medianet Innovat Ctr, Shanghai, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
SEGMENTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The objective of this paper is a model that is able to discover, track and segment multiple moving objects in a video. We make four contributions: First, we introduce an object-centric segmentation model with a depth-ordered layer representation. This is implemented using a variant of the transformer architecture that ingests optical flow, where each query vector specifies an object and its layer for the entire video. The model can effectively discover multiple moving objects and handle mutual occlusions; Second, we introduce a scalable pipeline for generating multi-object synthetic training data via layer compositions, that is used to train the proposed model, significantly reducing the requirements for labour-intensive annotations, and supporting Sim2Real generalisation; Third, we conduct thorough ablation studies, showing that the model is able to learn object permanence and temporal shape consistency, and is able to predict amodal segmentation masks; Fourth, we evaluate our model, trained only on synthetic data, on standard video segmentation benchmarks, DAVIS, MoCA, SegTrack, FBMS-59, and achieve state-of-the-art performance among existing methods that do not rely on any manual annotations. With test-time adaptation, we observe further performance boosts.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Learning global object-centric representations via disentangled slot attention
    Chen, Tonglin
    Huang, Yinxuan
    Shen, Zhimeng
    Huang, Jinghao
    Li, Bin
    Xue, Xiangyang
    MACHINE LEARNING, 2025, 114 (02)
  • [22] Vision-Based Object-Centric Safety Assessment Using Fuzzy Inference: Monitoring Struck-By Accidents with Moving Objects
    Kim, Hongjo
    Kim, Kinam
    Kim, Hyoungkwan
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2016, 30 (04)
  • [23] OPerA: Object-Centric Performance Analysis
    Park, Gyunam
    Adams, Jan Niklas
    van der Aalst, Wil M. P.
    CONCEPTUAL MODELING (ER 2022), 2022, 13607 : 281 - 292
  • [24] OCπ: Object-Centric Process Insights
    Adams, Jan Niklas
    van der Aalst, Wil M. P.
    APPLICATION AND THEORY OF PETRI NETS AND CONCURRENCY (PETRI NETS 2022), 2022, 13288 : 139 - 150
  • [25] Object-Centric Unsupervised Image Captioning
    Meng, Zihang
    Yang, David
    Cao, Xuefei
    Shah, Ashish
    Lim, Ser-Nam
    COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 219 - 235
  • [26] Discovering Object-centric Petri Nets
    van der Aalst, Wil M. P.
    Berti, Alessandro
    FUNDAMENTA INFORMATICAE, 2020, 175 (1-4) : 1 - 40
  • [27] Enhanced Object Representation on Moving Objects Classification
    Yu, Tin Tin
    Win, Zin Mar
    2019 INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION TECHNOLOGIES (ICAIT), 2019, : 177 - 182
  • [28] Discovery of Object-Centric Declarative Models
    Christfort, Axel K. F.
    Rivkin, Audrey
    Fahland, Dirk
    Hildebrandt, Thomas T.
    Slaats, Tijs
    2024 6TH INTERNATIONAL CONFERENCE ON PROCESS MINING, ICPM, 2024, : 137 - 144
  • [29] Permission Analysis for Object-Centric Processes
    Breitmayer, Marius
    Arnold, Lisa
    Reichert, Manfred
    INTELLIGENT INFORMATION SYSTEMS, CAISE FORUM 2024, 2024, 520 : 11 - 19
  • [30] Object-centric process predictive analytics
    Galanti, Riccardo
    De Leoni, Massimiliano
    Navarin, Nicola
    Marazzi, Alan
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213