Self-Supervised Object Detection via Generative Image Synthesis

被引:6
|
作者
Mustikovela, Siva Karthik [1 ,3 ]
De Mello, Shalini [1 ]
Prakash, Aayush [1 ]
Iqbal, Umar [1 ]
Liu, Sifei [1 ]
Thu Nguyen-Phuoc [2 ]
Rother, Carsten [3 ]
Kautz, Jan [1 ]
机构
[1] NVIDIA, Heidelberg, Germany
[2] Univ Bath, Bath, Avon, England
[3] Heidelberg Univ, Heidelberg, Germany
关键词
D O I
10.1109/ICCV48922.2021.00849
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present SSOD - the first end-to-end analysis-by-synthesis framework with controllable GANs for the task of self-supervised object detection. We use collections of real-world images without bounding box annotations to learn to synthesize and detect objects. We leverage controllable GANs to synthesize images with pre-defined object properties and use them to train object detectors. We propose a tight end-to-end coupling of the synthesis and detection networks to optimally train our system. Finally, we also propose a method to optimally adapt SSOD to an intended target data without requiring labels for it. For the task of car detection, on the challenging KITTI and Cityscapes datasets, we show that SSOD outperforms the prior state-of-the-art purely image-based self-supervised object detection method Wetectron. Even without requiring any 3D CAD assets, it also surpasses the state-of-the-art rendering-based method Meta-Sim2. Our work advances the field of self-supervised object detection by introducing a successful new paradigm of using controllable GAN-based image synthesis for it and by significantly improving the baseline accuracy of the task.
引用
收藏
页码:8589 / 8598
页数:10
相关论文
共 50 条
  • [11] Self-Supervised Reinforcement Learning for Active Object Detection
    Fang, Fen
    Liang, Wenyu
    Wu, Yan
    Xu, Qianli
    Lim, Joo-Hwee
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04): : 10224 - 10231
  • [12] Generative and Contrastive Self-Supervised Learning for Graph Anomaly Detection
    Zheng, Yu
    Jin, Ming
    Liu, Yixin
    Chi, Lianhua
    Phan, Khoa T.
    Chen, Yi-Ping Phoebe
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (12) : 12220 - 12233
  • [13] Self-Supervised Sketch-to-Image Synthesis
    Liu, Bingchen
    Zhu, Yizhe
    Song, Kunpeng
    Elgammal, Ahmed
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2073 - 2081
  • [14] Towards Open-World Object-Based Anomaly Detection via Self-Supervised Outlier Synthesis
    Isaac-Medina, Brian K. S.
    Gauss, Yona Falinie A.
    Bhowmik, Neelanjan
    Breckon, Toby P.
    COMPUTER VISION - ECCV 2024, PT LXXI, 2025, 15129 : 196 - 214
  • [15] Text-to-image synthesis with self-supervised bi-stage generative adversarial network
    Tan, Yong Xuan
    Lee, Chin Poo
    Neo, Mai
    Lim, Kian Ming
    Lim, Jit Yan
    PATTERN RECOGNITION LETTERS, 2023, 169 : 43 - 49
  • [16] Object and attribute recognition for product image with self-supervised learning
    Dai, Yong
    Li, Yi
    Sun, Bin
    NEUROCOMPUTING, 2023, 558
  • [17] Self-Supervised Dusty Image Enhancement Using Generative Adversarial Networks
    Mohamadi, Mahsa
    Bartani, Ako
    Tab, Fardin Akhlaghian
    2023 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS, IPRIA, 2023,
  • [18] Self-Supervised Generative Adversarial Compression
    Yu, Chong
    Pool, Jeff
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [19] Self-Supervised Learning: Generative or Contrastive
    Liu, Xiao
    Zhang, Fanjin
    Hou, Zhenyu
    Mian, Li
    Wang, Zhaoyu
    Zhang, Jing
    Tang, Jie
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 857 - 876
  • [20] Imagine Before Go: Self-Supervised Generative Map for Object Goal Navigation
    Zhang, Sixian
    Yu, Xinyao
    Song, Xinhang
    Wang, Xiaohan
    Jiang, Shugiang
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16414 - 16425