SELF-SUPERVISION BY PREDICTION FOR OBJECT DISCOVERY IN VIDEOS

被引:1
|
作者
Besbinar, Beril [1 ]
Frossard, Pascal [1 ]
机构
[1] Ecole Polytech Fed Lausanne EPFL, Signal Proc Lab LTS4, Lausanne, Switzerland
关键词
Self-supervision; video prediction; object representation; unsupervised scene decomposition;
D O I
10.1109/ICIP42928.2021.9506062
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite their irresistible success, deep learning algorithms still heavily rely on annotated data, and unsupervised settings pose many challenges, such as finding the right inductive bias in diverse scenarios. In this paper, we propose an object-centric model for image sequence representation that uses the prediction task for self-supervision. By disentangling object representation and motion dynamics, our novel compositional structure explicitly handles occlusion and inpaints inferred objects and background for the composition of the predicted frame. Using auxiliary losses to promote spatially and temporally consistent object representations, we train our self-supervised framework without the help of any annotation or pretrained network. Initial experiments confirm that our new pipeline is a promising step towards object-centric video prediction.
引用
收藏
页码:1509 / 1513
页数:5
相关论文
共 50 条
  • [31] Progressive scene text erasing with self-supervision
    Du, Xiangcheng
    Zhou, Zhao
    Zheng, Yingbin
    Wu, Xingjiao
    Ma, Tianlong
    Jin, Cheng
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 233
  • [32] Stereo Matching by Self-supervision of Multiscopic Vision
    Yuan, Weihao
    Zhang, Yazhan
    Wu, Bingkun
    Zhu, Siyu
    Tan, Ping
    Wang, Michael Yu
    Chen, Qifeng
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 5702 - 5709
  • [33] Universal Domain Adaptation through Self-Supervision
    Saito, Kuniaki
    Kim, Donghyun
    Sclaroff, Stan
    Saenko, Kate
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [34] Self-supervision Spatiotemporal Part-Whole Convolutional Neural Network for Traffic Prediction
    Zhai, Linbo
    Yang, Yong
    Song, Shudian
    Ma, Shuyue
    Zhu, Xiumin
    Yang, Feng
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2021, 579
  • [35] GROUP THERAPY - EFFECTIVE METHOD OF SELF-SUPERVISION
    COHEN, AI
    SMALL GROUP BEHAVIOR, 1973, 4 (01): : 69 - 80
  • [36] Self-supervision, normativity and the free energy principle
    Jakob Hohwy
    Synthese, 2021, 199 : 29 - 53
  • [37] Sense and Learn: Self-supervision for omnipresent sensors
    Saeed, Aaqib
    Ungureanu, Victor
    Gfeller, Beat
    Machine Learning with Applications, 2021, 6
  • [38] Learning to Remove Rain in Video With Self-Supervision
    Yang, Wenhan
    Tan, Robby T.
    Wang, Shiqi
    Kot, Alex C.
    Liu, Jiaying
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1378 - 1396
  • [39] Tackling Partial Domain Adaptation with Self-supervision
    Bucci, Silvia
    D'Innocente, Antonio
    Tommasi, Tatiana
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT II, 2019, 11752 : 70 - 81
  • [40] Feature propagation as self-supervision signals on graphs
    Pina, Oscar
    Vilaplana, Veronica
    KNOWLEDGE-BASED SYSTEMS, 2024, 289