Multi-motion and Appearance Self-Supervised Moving Object Detection

被引:2
|
作者
Yang, Fan [1 ,2 ]
Karanam, Srikrishna [1 ]
Zheng, Meng [1 ]
Chen, Terrence [1 ]
Ling, Haibin [3 ]
Wu, Ziyan [1 ]
机构
[1] United Imaging Intelligence, Cambridge, MA 02140 USA
[2] Temple Univ, Philadelphia, PA 19122 USA
[3] SUNY Stony Brook, Stony Brook, NY 11794 USA
关键词
SEGMENTATION;
D O I
10.1109/WACV51458.2022.00216
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we consider the problem of self-supervised Moving Object Detection (MOD) in video, where no ground truth is involved in both training and inference phases. Recently, an adversarial learning framework is proposed [32] to leverage inherent temporal information for MOD. While showing great promising results, it uses single scale temporal information and may meet problems when dealing with a deformable object under multi-scale motion in different parts. Additional challenges can arise from the moving camera, which results in the failure of the motion independence hypothesis and locally independent background motion. To deal with these problems, we propose a Multi-motion and Appearance Self-supervised Network (MASNet) to introduce multi-scale motion information and appearance information of scene for MOD. In particular, a moving object, especially the deformable, usually consists of moving regions at various temporal scales. Introducing multiscale motion can aggregate these regions to form a more complete detection. Appearance information can serve as another cue for MOD when the motion independence is not reliable and for removing false detection in background caused by locally independent background motion. To encode multi-scale motion and appearance, in MASNet we respectively design a multi-branch flow encoding module and an image inpainter module. The proposed modules and MASNet are extensively evaluated on the DAVIS dataset to demonstrate the effectiveness and superiority to state-of-the-art self-supervised methods.
引用
收藏
页码:2101 / 2110
页数:10
相关论文
共 50 条
  • [31] Self-supervised object detection from audio-visual correspondence
    Afouras, Triantafyllos
    Asano, Yuki M.
    Fagan, Francois
    Vedaldi, Andrea
    Metze, Florian
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10565 - 10576
  • [32] Self-Supervised Pretraining for RGB-D Salient Object Detection
    Zhao, Xiaoqi
    Pang, Youwei
    Zhang, Lihe
    Lu, Huchuan
    Ruan, Xiang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3463 - 3471
  • [33] Self-Supervised Pretraining for Point Cloud Object Detection in Autonomous Driving
    Shi, Weijing
    Rajkumar, Ragunathan
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 4341 - 4348
  • [34] Self-Supervised Linear Motion Deblurring
    Liu, Peidong
    Janai, Joel
    Pollefeys, Marc
    Sattler, Torsten
    Geiger, Andreas
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02): : 2475 - 2482
  • [35] Self-supervised Learning of Motion Capture
    Tung, Hsiao-Yu Fish
    Tung, Hsiao-Wei
    Yumer, Ersin
    Fragkiadaki, Katerina
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [36] Multi-Task Self-Supervised Learning for Disfluency Detection
    Wang, Shaolei
    Che, Wanxiang
    Liu, Qi
    Qin, Pengda
    Liu, Ting
    Wang, William Yang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9193 - 9200
  • [37] A Self-supervised Architecture for Moving Obstacles Classification
    Katz, Roman
    Douillard, Bertrand
    Nieto, Juan
    Nebot, Eduardo
    2008 IEEE/RSJ INTERNATIONAL CONFERENCE ON ROBOTS AND INTELLIGENT SYSTEMS, VOLS 1-3, CONFERENCE PROCEEDINGS, 2008, : 155 - 160
  • [38] Object representation enhancement for self-supervised colocalization
    Li, Huifang
    Li, Yidong
    Jin, Yi
    Wang, Tao
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 8277 - 8290
  • [39] Self-supervised Amodal Video Object Segmentation
    Yao, Jian
    Hong, Yuxin
    Wang, Chiyu
    Xiao, Tianjun
    He, Tong
    Locatello, Francesco
    Wipf, David
    Fu, Yanwei
    Zhang, Zheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [40] Self-supervised reinforcement learning for multi-step object manipulation skills
    Wang, Jiaqi
    Chen, Chuxin
    Liu, Jingwei
    Du, Guanglong
    Zhu, Xiaojun
    Guan, Quanlong
    Qiu, Xiaojian
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2025,