Multi-motion and Appearance Self-Supervised Moving Object Detection

被引:2
|
作者
Yang, Fan [1 ,2 ]
Karanam, Srikrishna [1 ]
Zheng, Meng [1 ]
Chen, Terrence [1 ]
Ling, Haibin [3 ]
Wu, Ziyan [1 ]
机构
[1] United Imaging Intelligence, Cambridge, MA 02140 USA
[2] Temple Univ, Philadelphia, PA 19122 USA
[3] SUNY Stony Brook, Stony Brook, NY 11794 USA
关键词
SEGMENTATION;
D O I
10.1109/WACV51458.2022.00216
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we consider the problem of self-supervised Moving Object Detection (MOD) in video, where no ground truth is involved in both training and inference phases. Recently, an adversarial learning framework is proposed [32] to leverage inherent temporal information for MOD. While showing great promising results, it uses single scale temporal information and may meet problems when dealing with a deformable object under multi-scale motion in different parts. Additional challenges can arise from the moving camera, which results in the failure of the motion independence hypothesis and locally independent background motion. To deal with these problems, we propose a Multi-motion and Appearance Self-supervised Network (MASNet) to introduce multi-scale motion information and appearance information of scene for MOD. In particular, a moving object, especially the deformable, usually consists of moving regions at various temporal scales. Introducing multiscale motion can aggregate these regions to form a more complete detection. Appearance information can serve as another cue for MOD when the motion independence is not reliable and for removing false detection in background caused by locally independent background motion. To encode multi-scale motion and appearance, in MASNet we respectively design a multi-branch flow encoding module and an image inpainter module. The proposed modules and MASNet are extensively evaluated on the DAVIS dataset to demonstrate the effectiveness and superiority to state-of-the-art self-supervised methods.
引用
收藏
页码:2101 / 2110
页数:10
相关论文
共 50 条
  • [41] Self-Supervised Multi-Object Tracking with Cross-Input Consistency
    Bastani, Favyen
    He, Songtao
    Madden, Sam
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [42] Self-Supervised Moving Vehicle Detection From Audio-Visual Cues
    Zuern, Jannik
    Burgard, Wolfram
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 7415 - 7422
  • [43] Self-supervised Neural Articulated Shape and Appearance Models
    Wei, Fangyin
    Chabra, Rohan
    Ma, Lingni
    Lassner, Christoph
    Zollhoefer, Michael
    Rusinkiewicz, Szymon
    Sweeney, Chris
    Newcombe, Richard
    Slavcheva, Mira
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15795 - 15805
  • [44] Optical flow for self-supervised learning of obstacle appearance
    Ho, H. W.
    De Wagter, C.
    Remes, B. D. W.
    de Croon, G. C. H. E.
    2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 3098 - 3104
  • [45] Self-supervised domain feature mining for underwater domain generalization object detection
    Chen, Haojie
    Wang, Zhuo
    Qin, Hongde
    Mu, Xiaokai
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 265
  • [46] MODNet: Motion and Appearance based Moving Object Detection Network for Autonomous Driving
    Siam, Mennatullah
    Mahgoub, Heba
    Zahran, Mohamed
    Yogamani, Senthil
    Jagersand, Martin
    El-Sallab, Ahmad
    2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 2859 - 2864
  • [47] SSTN: Self-Supervised Domain Adaptation Thermal Object Detection for Autonomous Driving
    Munir, Farzeen
    Azam, Shoaib
    Jeon, Moongu
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 206 - 213
  • [48] The Retrieval of the Beautiful: Self-Supervised Salient Object Detection for Beauty Product Retrieval
    Wang, Jiawei
    Zhu, Shuai
    Xu, Jiao
    Cao, Da
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2548 - 2552
  • [49] Self-Supervised Feature Enhancement Networks for Small Object Detection in Noisy Images
    Lee, Geonsoo
    Hong, Sungeun
    Cho, Donghyeon
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1026 - 1030
  • [50] Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection
    Wang, Tiancai
    Yang, Tong
    Cao, Jiale
    Zhang, Xiangyu
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2800 - 2808