Self-Supervised Object Detection via Generative Image Synthesis

被引:6
|
作者
Mustikovela, Siva Karthik [1 ,3 ]
De Mello, Shalini [1 ]
Prakash, Aayush [1 ]
Iqbal, Umar [1 ]
Liu, Sifei [1 ]
Thu Nguyen-Phuoc [2 ]
Rother, Carsten [3 ]
Kautz, Jan [1 ]
机构
[1] NVIDIA, Heidelberg, Germany
[2] Univ Bath, Bath, Avon, England
[3] Heidelberg Univ, Heidelberg, Germany
关键词
D O I
10.1109/ICCV48922.2021.00849
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present SSOD - the first end-to-end analysis-by-synthesis framework with controllable GANs for the task of self-supervised object detection. We use collections of real-world images without bounding box annotations to learn to synthesize and detect objects. We leverage controllable GANs to synthesize images with pre-defined object properties and use them to train object detectors. We propose a tight end-to-end coupling of the synthesis and detection networks to optimally train our system. Finally, we also propose a method to optimally adapt SSOD to an intended target data without requiring labels for it. For the task of car detection, on the challenging KITTI and Cityscapes datasets, we show that SSOD outperforms the prior state-of-the-art purely image-based self-supervised object detection method Wetectron. Even without requiring any 3D CAD assets, it also surpasses the state-of-the-art rendering-based method Meta-Sim2. Our work advances the field of self-supervised object detection by introducing a successful new paradigm of using controllable GAN-based image synthesis for it and by significantly improving the baseline accuracy of the task.
引用
收藏
页码:8589 / 8598
页数:10
相关论文
共 50 条
  • [41] Self-supervised video object segmentation via pseudo label rectification
    Guo, Pinxue
    Zhang, Wei
    Li, Xiaoqiang
    Fan, Jianping
    Zhang, Wenqiang
    PATTERN RECOGNITION, 2025, 163
  • [42] IMAGE QUALITY ASSESSMENT DRIVEN SELF-SUPERVISED ANOMALY DETECTION
    Wang, Zhipeng
    Hou, Chunping
    Liu, Yang
    Ge, Bangbang
    Yue, Guanghui
    Song, Chunying
    2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [43] Self-Supervised Image Anomaly Detection and Localization with Synthetic Anomalies
    Tsai, Min-Chun
    Wang, Sheng-De
    2023 10TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS: SYSTEMS, MANAGEMENT AND SECURITY, IOTSMS, 2023, : 90 - 95
  • [44] SELF-SUPERVISED CONFIDENT LEARNING FOR HYPERSPECTRAL IMAGE CHANGE DETECTION
    Wu, Haonan
    Chen, Zhao
    2022 12TH WORKSHOP ON HYPERSPECTRAL IMAGING AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2022,
  • [45] HYPERSPECTRAL IMAGE CHANGE DETECTION BY SELF-SUPERVISED TENSOR NETWORK
    Zhou, Feng
    Chen, Zhao
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 2527 - 2530
  • [46] IMPROVING ANOMALY DETECTION WITH A SELF-SUPERVISED TASK BASED ON GENERATIVE ADVERSARIAL NETWORK
    Chai, Heyan
    Su, Weijun
    Tang, Siyu
    Ding, Ye
    Fang, Binxing
    Liao, Qing
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3563 - 3567
  • [47] Self-supervised Prototype Conditional Few-Shot Object Detection
    Kobayashi, Daisuke
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT II, 2022, 13232 : 681 - 692
  • [48] Multi-motion and Appearance Self-Supervised Moving Object Detection
    Yang, Fan
    Karanam, Srikrishna
    Zheng, Meng
    Chen, Terrence
    Ling, Haibin
    Wu, Ziyan
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2101 - 2110
  • [49] Self-Supervised Velocity Estimation for Automotive Radar Object Detection Networks
    Niederlohner, Daniel
    Ulrich, Michael
    Braun, Sascha
    Koehler, Daniel
    Faion, Florian
    Glaeser, Claudius
    Treptow, Andre
    Blume, Holger
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 352 - 359
  • [50] Self-supervised object detection from audio-visual correspondence
    Afouras, Triantafyllos
    Asano, Yuki M.
    Fagan, Francois
    Vedaldi, Andrea
    Metze, Florian
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10565 - 10576