SELF-SUPERVISION BY PREDICTION FOR OBJECT DISCOVERY IN VIDEOS

被引:1
|
作者
Besbinar, Beril [1 ]
Frossard, Pascal [1 ]
机构
[1] Ecole Polytech Fed Lausanne EPFL, Signal Proc Lab LTS4, Lausanne, Switzerland
关键词
Self-supervision; video prediction; object representation; unsupervised scene decomposition;
D O I
10.1109/ICIP42928.2021.9506062
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite their irresistible success, deep learning algorithms still heavily rely on annotated data, and unsupervised settings pose many challenges, such as finding the right inductive bias in diverse scenarios. In this paper, we propose an object-centric model for image sequence representation that uses the prediction task for self-supervision. By disentangling object representation and motion dynamics, our novel compositional structure explicitly handles occlusion and inpaints inferred objects and background for the composition of the predicted frame. Using auxiliary losses to promote spatially and temporally consistent object representations, we train our self-supervised framework without the help of any annotation or pretrained network. Initial experiments confirm that our new pipeline is a promising step towards object-centric video prediction.
引用
收藏
页码:1509 / 1513
页数:5
相关论文
共 50 条
  • [21] Disentangled Self-Supervision in Sequential Recommenders
    Ma, Jianxin
    Zhou, Chang
    Yang, Hongxia
    Cui, Peng
    Wang, Xin
    Zhu, Wenwu
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 483 - 491
  • [22] Hyperspherically regularized networks for self-supervision
    Durrant, Aiden
    Leontidis, Georgios
    IMAGE AND VISION COMPUTING, 2022, 124
  • [23] PITCH ESTIMATION VIA SELF-SUPERVISION
    Gfeller, Beat
    Frank, Christian
    Roblek, Dominik
    Sharifi, Matt
    Tagliasacchi, Marco
    Velimirovic, Mihajlo
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3527 - 3531
  • [24] TRASS: Time Reversal as Self-Supervision
    Nair, Suraj
    Babaeizadeh, Mohammad
    Finn, Chelsea
    Levine, Sergey
    Kumar, Vikash
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 115 - 121
  • [25] Unsupervised Discovery of the Long-Tail in Instance Segmentation Using Hierarchical Self-Supervision
    Weng, Zhenzhen
    Ogut, Mehmet Giray
    Limonchik, Shai
    Yeung, Serena
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2603 - 2612
  • [26] Hyperspherically regularized networks for self-supervision
    Durrant, Aiden
    Leontidis, Georgios
    Image and Vision Computing, 2022, 124
  • [27] Deep Spatial Prediction via Heterogeneous Multi-source Self-supervision
    Zhang, Minxing
    Yu, Dazhou
    Li, Yun
    Zhao, Liang
    ACM TRANSACTIONS ON SPATIAL ALGORITHMS AND SYSTEMS, 2023, 9 (03)
  • [28] Improving Air Quality Prediction via Self-Supervision Masked Air Modeling
    Chen, Shuang
    He, Li
    Shen, Shinan
    Zhang, Yan
    Ma, Weichun
    ATMOSPHERE, 2024, 15 (07)
  • [29] The IRMA dream, self-analysis, and self-supervision
    Blum, H
    JOURNAL OF THE AMERICAN PSYCHOANALYTIC ASSOCIATION, 1996, 44 (02) : 511 - 532