Multi-motion and Appearance Self-Supervised Moving Object Detection

被引：2

作者：

Yang, Fan ^{[1
,2
]}

Karanam, Srikrishna ^{[1
]}

Zheng, Meng ^{[1
]}

Chen, Terrence ^{[1
]}

Ling, Haibin ^{[3
]}

Wu, Ziyan ^{[1
]}

机构：

[1] United Imaging Intelligence, Cambridge, MA 02140 USA

[2] Temple Univ, Philadelphia, PA 19122 USA

[3] SUNY Stony Brook, Stony Brook, NY 11794 USA

来源：

2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022) | 2022年

关键词：

SEGMENTATION;

D O I：

10.1109/WACV51458.2022.00216

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we consider the problem of self-supervised Moving Object Detection (MOD) in video, where no ground truth is involved in both training and inference phases. Recently, an adversarial learning framework is proposed [32] to leverage inherent temporal information for MOD. While showing great promising results, it uses single scale temporal information and may meet problems when dealing with a deformable object under multi-scale motion in different parts. Additional challenges can arise from the moving camera, which results in the failure of the motion independence hypothesis and locally independent background motion. To deal with these problems, we propose a Multi-motion and Appearance Self-supervised Network (MASNet) to introduce multi-scale motion information and appearance information of scene for MOD. In particular, a moving object, especially the deformable, usually consists of moving regions at various temporal scales. Introducing multiscale motion can aggregate these regions to form a more complete detection. Appearance information can serve as another cue for MOD when the motion independence is not reliable and for removing false detection in background caused by locally independent background motion. To encode multi-scale motion and appearance, in MASNet we respectively design a multi-branch flow encoding module and an image inpainter module. The proposed modules and MASNet are extensively evaluated on the DAVIS dataset to demonstrate the effectiveness and superiority to state-of-the-art self-supervised methods.

引用

页码：2101 / 2110

页数：10

共 50 条

[41] Self-Supervised Multi-Object Tracking with Cross-Input Consistency
Bastani, Favyen
He, Songtao
Madden, Sam
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[42] Self-Supervised Moving Vehicle Detection From Audio-Visual Cues
Zuern, Jannik
Burgard, Wolfram
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 7415 - 7422
[43] Self-supervised Neural Articulated Shape and Appearance Models
Wei, Fangyin
Chabra, Rohan
Ma, Lingni
Lassner, Christoph
Zollhoefer, Michael
Rusinkiewicz, Szymon
Sweeney, Chris
Newcombe, Richard
Slavcheva, Mira
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15795 - 15805
[44] Optical flow for self-supervised learning of obstacle appearance
Ho, H. W.
De Wagter, C.
Remes, B. D. W.
de Croon, G. C. H. E.
2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 3098 - 3104
[45] Self-supervised domain feature mining for underwater domain generalization object detection
Chen, Haojie
Wang, Zhuo
Qin, Hongde
Mu, Xiaokai
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 265
[46] MODNet: Motion and Appearance based Moving Object Detection Network for Autonomous Driving
Siam, Mennatullah
Mahgoub, Heba
Zahran, Mohamed
Yogamani, Senthil
Jagersand, Martin
El-Sallab, Ahmad
2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 2859 - 2864
[47] SSTN: Self-Supervised Domain Adaptation Thermal Object Detection for Autonomous Driving
Munir, Farzeen
Azam, Shoaib
Jeon, Moongu
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 206 - 213
[48] The Retrieval of the Beautiful: Self-Supervised Salient Object Detection for Beauty Product Retrieval
Wang, Jiawei
Zhu, Shuai
Xu, Jiao
Cao, Da
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2548 - 2552
[49] Self-Supervised Feature Enhancement Networks for Small Object Detection in Noisy Images
Lee, Geonsoo
Hong, Sungeun
Cho, Donghyeon
IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1026 - 1030
[50] Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection
Wang, Tiancai
Yang, Tong
Cao, Jiale
Zhang, Xiangyu
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2800 - 2808

← 1 2 3 4 5 →