SELF-SUPERVISION BY PREDICTION FOR OBJECT DISCOVERY IN VIDEOS

被引：1

作者：

Besbinar, Beril ^{[1
]}

Frossard, Pascal ^{[1
]}

机构：

[1] Ecole Polytech Fed Lausanne EPFL, Signal Proc Lab LTS4, Lausanne, Switzerland

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2021年

关键词：

Self-supervision; video prediction; object representation; unsupervised scene decomposition;

D O I：

10.1109/ICIP42928.2021.9506062

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Despite their irresistible success, deep learning algorithms still heavily rely on annotated data, and unsupervised settings pose many challenges, such as finding the right inductive bias in diverse scenarios. In this paper, we propose an object-centric model for image sequence representation that uses the prediction task for self-supervision. By disentangling object representation and motion dynamics, our novel compositional structure explicitly handles occlusion and inpaints inferred objects and background for the composition of the predicted frame. Using auxiliary losses to promote spatially and temporally consistent object representations, we train our self-supervised framework without the help of any annotation or pretrained network. Initial experiments confirm that our new pipeline is a promising step towards object-centric video prediction.

引用

页码：1509 / 1513

页数：5

共 50 条

[1] Link Prediction with Contextualized Self-Supervision
Zhang, Daokun
Yin, Jie
Yu, Philip S. S.
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (07) : 7138 - 7151
[2] Cyclical Self-Supervision for Semi-Supervised Ejection Fraction Prediction From Echocardiogram Videos
Dai, Weihang
Li, Xiaomeng
Ding, Xinpeng
Cheng, Kwang-Ting
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (05) : 1446 - 1461
[3] Instance-Aware Multi-Object Self-Supervision for Monocular Depth Prediction
Boulahbal, Houssem Eddine
Voicila, Adrian
Comport, Andrew, I
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 10962 - 10968
[4] THE FEASIBILITY OF SELF-SUPERVISION
Hudelson, Earl
JOURNAL OF EDUCATIONAL RESEARCH, 1952, 45 (05): : 335 - 347
[5] Labelling unlabelled videos from scratch with multi-modal self-supervision
Asano, Yuki M.
Patrick, Mandela
Rupprecht, Christian
Vedaldi, Andrea
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[6] Equivariant Spatio-temporal Self-supervision for LiDAR Object Detection
Hegde, Deepti
Lohit, Suhas
Peng, Kuan-Chuan
Jones, Michael J.
Patel, Vishal M.
COMPUTER VISION - ECCV 2024, PT XXVI, 2025, 15084 : 475 - 491
[7] Explainable Action Prediction through Self-Supervision on Scene Graphs
Kochakarn, Pawit
Martini, Daniele De
Omeiza, Daniel
Kunze, Lars
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 1479 - 1485
[8] Self-supervision, surveillance and transgression
Simon, Gail
JOURNAL OF FAMILY THERAPY, 2010, 32 (03) : 308 - 325
[9] Anomalies, representations, and self-supervision
Dillon, Barry M.
Favaro, Luigi
Feiden, Friedrich
Modak, Tanmoy
Plehn, Tilman
SCIPOST PHYSICS CORE, 2024, 7 (03):
[10] Symmetries, safety, and self-supervision
Dillon, Barry M.
Kasieczka, Gregor
Olischlaeger, Hans
Plehn, Tilman
Sorrenson, Peter
Vogel, Lorenz
SCIPOST PHYSICS, 2022, 12 (06):

← 1 2 3 4 5 →