Multi-motion and Appearance Self-Supervised Moving Object Detection

被引:2
|
作者
Yang, Fan [1 ,2 ]
Karanam, Srikrishna [1 ]
Zheng, Meng [1 ]
Chen, Terrence [1 ]
Ling, Haibin [3 ]
Wu, Ziyan [1 ]
机构
[1] United Imaging Intelligence, Cambridge, MA 02140 USA
[2] Temple Univ, Philadelphia, PA 19122 USA
[3] SUNY Stony Brook, Stony Brook, NY 11794 USA
关键词
SEGMENTATION;
D O I
10.1109/WACV51458.2022.00216
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we consider the problem of self-supervised Moving Object Detection (MOD) in video, where no ground truth is involved in both training and inference phases. Recently, an adversarial learning framework is proposed [32] to leverage inherent temporal information for MOD. While showing great promising results, it uses single scale temporal information and may meet problems when dealing with a deformable object under multi-scale motion in different parts. Additional challenges can arise from the moving camera, which results in the failure of the motion independence hypothesis and locally independent background motion. To deal with these problems, we propose a Multi-motion and Appearance Self-supervised Network (MASNet) to introduce multi-scale motion information and appearance information of scene for MOD. In particular, a moving object, especially the deformable, usually consists of moving regions at various temporal scales. Introducing multiscale motion can aggregate these regions to form a more complete detection. Appearance information can serve as another cue for MOD when the motion independence is not reliable and for removing false detection in background caused by locally independent background motion. To encode multi-scale motion and appearance, in MASNet we respectively design a multi-branch flow encoding module and an image inpainter module. The proposed modules and MASNet are extensively evaluated on the DAVIS dataset to demonstrate the effectiveness and superiority to state-of-the-art self-supervised methods.
引用
收藏
页码:2101 / 2110
页数:10
相关论文
共 50 条
  • [1] Single-pixel imaging of a moving object with multi-motion
    纪鹏程
    吴庆帆
    曹盛福
    张会娟
    杨照华
    余远金
    Chinese Optics Letters, 2024, 22 (10) : 38 - 43
  • [2] Single-pixel imaging of a moving object with multi-motion
    Ji, Pengcheng
    Wu, Qingfan
    Cao, Shengfu
    Zhang, Huijuan
    Yang, Zhaohua
    Yu, Yuanjin
    CHINESE OPTICS LETTERS, 2024, 22 (10)
  • [3] Object Detection with Self-Supervised Scene Adaptation
    Zhang, Zekun
    Hoai, Minh
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21589 - 21599
  • [4] Self-supervised Video Object Segmentation by Motion Grouping
    Yang, Charig
    Lamdouar, Hala
    Lu, Erika
    Zisserman, Andrew
    Xie, Weidi
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7157 - 7168
  • [5] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection
    Cao, Shengcao
    Joshi, Dhiraj
    Gui, Liang-Yan
    Wang, Yu-Xiong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [6] Self-Supervised Object Detection from Egocentric Videos
    Akiva, Peri
    Huang, Jing
    Liang, Kevin J.
    Kovvuri, Rama
    Chen, Xingyu
    Feiszli, Matt
    Dana, Kristin
    Hassner, Tal
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5202 - 5214
  • [7] Self-Supervised Reinforcement Learning for Active Object Detection
    Fang, Fen
    Liang, Wenyu
    Wu, Yan
    Xu, Qianli
    Lim, Joo-Hwee
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04): : 10224 - 10231
  • [8] Self-supervised Object Motion and Depth Estimation from Video
    Dai, Qi
    Patii, Vaishakh
    Hecker, Simon
    Dai, Dengxin
    Van Gool, Luc
    Schindler, Konrad
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4326 - 4334
  • [9] Self-Supervised Video GANs: Learning for Appearance Consistency and Motion Coherency
    Hyun, Sangeek
    Kim, Jihwan
    Heo, Jae-Pil
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10821 - 10830
  • [10] Driving Accident Detection by Self-Supervised Adversarial Appearance-Motion Prediction in First -Person Videos
    Qiao, Jiahuan
    Fang, Jianwu
    Yan, Dingxin
    Xue, Jianru
    PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 1083 - 1088