Segmenting Moving Objects via an Object-Centric Layered Representation

被引:0
|
作者
Xie, Junyu [1 ]
Xie, Weidi [1 ,2 ]
Zisserman, Andrew [1 ]
机构
[1] Univ Oxford, Dept Engn Sci, Visual Geometry Grp, Oxford, England
[2] Shanghai Jiao Tong Univ, Coop Medianet Innovat Ctr, Shanghai, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
SEGMENTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The objective of this paper is a model that is able to discover, track and segment multiple moving objects in a video. We make four contributions: First, we introduce an object-centric segmentation model with a depth-ordered layer representation. This is implemented using a variant of the transformer architecture that ingests optical flow, where each query vector specifies an object and its layer for the entire video. The model can effectively discover multiple moving objects and handle mutual occlusions; Second, we introduce a scalable pipeline for generating multi-object synthetic training data via layer compositions, that is used to train the proposed model, significantly reducing the requirements for labour-intensive annotations, and supporting Sim2Real generalisation; Third, we conduct thorough ablation studies, showing that the model is able to learn object permanence and temporal shape consistency, and is able to predict amodal segmentation masks; Fourth, we evaluate our model, trained only on synthetic data, on standard video segmentation benchmarks, DAVIS, MoCA, SegTrack, FBMS-59, and achieve state-of-the-art performance among existing methods that do not rely on any manual annotations. With test-time adaptation, we observe further performance boosts.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Provably Learning Object-Centric Representations
    Brady, Jack
    Zimmermann, Roland S.
    Sharma, Yash
    Schoelkopf, Bernhard
    von Kuegelgen, Julius
    Brendel, Wieland
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [32] Object-Centric Conformance Alignments with Synchronization
    Gianola, Alessandro
    Montali, Marco
    Winkler, Sarah
    ADVANCED INFORMATION SYSTEMS ENGINEERING, CAISE 2024, 2024, 14663 : 3 - 19
  • [33] Object-Centric Learning with Slot Attention
    Locatello, Francesco
    Weissenborn, Dirk
    Unterthiner, Thomas
    Mahendran, Aravindh
    Heigold, Georg
    Uszkoreit, Jakob
    Dosovitskiy, Alexey
    Kipf, Thomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [34] Discovery of Object-Centric Declarative Models
    Christfort, Axel K. F.
    Rivkin, Andrey
    Fahland, Dirk
    Hildebrandt, Thomas T.
    Slaats, Tijs
    2024 6TH INTERNATIONAL CONFERENCE ON PROCESS MINING, ICPM, 2024, : 121 - 128
  • [35] DJXPerf: Identifying Memory Inefficiencies via Object-Centric Profiling for Java
    Li, Bolun
    Su, Pengfei
    Chabbi, Milind
    Jiao, Shuyin
    Liu, Xu
    CGO 2023 - Proceedings of the 21st ACM/IEEE International Symposium on Code Generation and Optimization, 2023, : 81 - 94
  • [36] Semantic Tracklets: An Object-Centric Representation for Visual Multi-Agent Reinforcement Learning
    Liu, Iou-Jen
    Ren, Zhongzheng
    Yeh, Raymond A.
    Schwing, Alexander G.
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 5603 - 5610
  • [37] Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation
    Fan, Ke
    Lei, Jingshi
    Qian, Xuelin
    Yu, Miaopeng
    Xiao, Tianjun
    He, Tong
    Zhang, Zheng
    Fu, Yanwei
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1272 - 1281
  • [38] Time-traveling object-centric breakpoints
    Bourcier, Valentin
    Costiou, Steven
    Santander, Maximilian Ignacio Willembrinck
    Vanegue, Adrien
    Etien, Anne
    JOURNAL OF COMPUTER LANGUAGES, 2024, 80
  • [39] Deep Object-Centric Policies for Autonomous Driving
    Wang, Dequan
    Devin, Coline
    Cai, Qi-Zhi
    Yu, Fisher
    Darrell, Trevor
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 8853 - 8859
  • [40] Manifold geometric invariants and object-centric approach
    Jannson, TP
    APPLICATIONS AND SCIENCE OF NEURAL NETWORKS, FUZZY SYSTEMS, AND EVOLUTIONARY COMPUTATION V, 2002, 4787 : 158 - 173