SELF-SUPERVISION BY PREDICTION FOR OBJECT DISCOVERY IN VIDEOS

被引:1
|
作者
Besbinar, Beril [1 ]
Frossard, Pascal [1 ]
机构
[1] Ecole Polytech Fed Lausanne EPFL, Signal Proc Lab LTS4, Lausanne, Switzerland
关键词
Self-supervision; video prediction; object representation; unsupervised scene decomposition;
D O I
10.1109/ICIP42928.2021.9506062
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite their irresistible success, deep learning algorithms still heavily rely on annotated data, and unsupervised settings pose many challenges, such as finding the right inductive bias in diverse scenarios. In this paper, we propose an object-centric model for image sequence representation that uses the prediction task for self-supervision. By disentangling object representation and motion dynamics, our novel compositional structure explicitly handles occlusion and inpaints inferred objects and background for the composition of the predicted frame. Using auxiliary losses to promote spatially and temporally consistent object representations, we train our self-supervised framework without the help of any annotation or pretrained network. Initial experiments confirm that our new pipeline is a promising step towards object-centric video prediction.
引用
收藏
页码:1509 / 1513
页数:5
相关论文
共 50 条
  • [1] Link Prediction with Contextualized Self-Supervision
    Zhang, Daokun
    Yin, Jie
    Yu, Philip S. S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (07) : 7138 - 7151
  • [2] Cyclical Self-Supervision for Semi-Supervised Ejection Fraction Prediction From Echocardiogram Videos
    Dai, Weihang
    Li, Xiaomeng
    Ding, Xinpeng
    Cheng, Kwang-Ting
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (05) : 1446 - 1461
  • [3] Instance-Aware Multi-Object Self-Supervision for Monocular Depth Prediction
    Boulahbal, Houssem Eddine
    Voicila, Adrian
    Comport, Andrew, I
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 10962 - 10968
  • [4] THE FEASIBILITY OF SELF-SUPERVISION
    Hudelson, Earl
    JOURNAL OF EDUCATIONAL RESEARCH, 1952, 45 (05): : 335 - 347
  • [5] Labelling unlabelled videos from scratch with multi-modal self-supervision
    Asano, Yuki M.
    Patrick, Mandela
    Rupprecht, Christian
    Vedaldi, Andrea
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [6] Equivariant Spatio-temporal Self-supervision for LiDAR Object Detection
    Hegde, Deepti
    Lohit, Suhas
    Peng, Kuan-Chuan
    Jones, Michael J.
    Patel, Vishal M.
    COMPUTER VISION - ECCV 2024, PT XXVI, 2025, 15084 : 475 - 491
  • [7] Explainable Action Prediction through Self-Supervision on Scene Graphs
    Kochakarn, Pawit
    Martini, Daniele De
    Omeiza, Daniel
    Kunze, Lars
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 1479 - 1485
  • [8] Self-supervision, surveillance and transgression
    Simon, Gail
    JOURNAL OF FAMILY THERAPY, 2010, 32 (03) : 308 - 325
  • [9] Anomalies, representations, and self-supervision
    Dillon, Barry M.
    Favaro, Luigi
    Feiden, Friedrich
    Modak, Tanmoy
    Plehn, Tilman
    SCIPOST PHYSICS CORE, 2024, 7 (03):
  • [10] Symmetries, safety, and self-supervision
    Dillon, Barry M.
    Kasieczka, Gregor
    Olischlaeger, Hans
    Plehn, Tilman
    Sorrenson, Peter
    Vogel, Lorenz
    SCIPOST PHYSICS, 2022, 12 (06):