Self-Supervised Object Detection via Generative Image Synthesis

被引:6
|
作者
Mustikovela, Siva Karthik [1 ,3 ]
De Mello, Shalini [1 ]
Prakash, Aayush [1 ]
Iqbal, Umar [1 ]
Liu, Sifei [1 ]
Thu Nguyen-Phuoc [2 ]
Rother, Carsten [3 ]
Kautz, Jan [1 ]
机构
[1] NVIDIA, Heidelberg, Germany
[2] Univ Bath, Bath, Avon, England
[3] Heidelberg Univ, Heidelberg, Germany
关键词
D O I
10.1109/ICCV48922.2021.00849
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present SSOD - the first end-to-end analysis-by-synthesis framework with controllable GANs for the task of self-supervised object detection. We use collections of real-world images without bounding box annotations to learn to synthesize and detect objects. We leverage controllable GANs to synthesize images with pre-defined object properties and use them to train object detectors. We propose a tight end-to-end coupling of the synthesis and detection networks to optimally train our system. Finally, we also propose a method to optimally adapt SSOD to an intended target data without requiring labels for it. For the task of car detection, on the challenging KITTI and Cityscapes datasets, we show that SSOD outperforms the prior state-of-the-art purely image-based self-supervised object detection method Wetectron. Even without requiring any 3D CAD assets, it also surpasses the state-of-the-art rendering-based method Meta-Sim2. Our work advances the field of self-supervised object detection by introducing a successful new paradigm of using controllable GAN-based image synthesis for it and by significantly improving the baseline accuracy of the task.
引用
收藏
页码:8589 / 8598
页数:10
相关论文
共 50 条
  • [31] A Survey of Self-Supervised and Few-Shot Object Detection
    Huang, Gabriel
    Laradji, Issam
    Vazquez, David
    Lacoste-Julien, Simon
    Rodriguez, Pau
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4071 - 4089
  • [32] Text-to-image synthesis with self-supervised learning
    Tan, Yong Xuan
    Lee, Chin Poo
    Neo, Mai
    Lim, Kian Ming
    PATTERN RECOGNITION LETTERS, 2022, 157 : 119 - 126
  • [33] Industrial Image Anomaly Detection via Self-Supervised Learning with Feature Enhancement Assistance
    Wu, Bin
    Wang, Xiaoqi
    APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [34] Self-Supervised Text Erasing with Controllable Image Synthesis
    Jiang, Gangwei
    Wang, Shiyao
    Ge, Tiezheng
    Jiang, Yuning
    Wei, Ying
    Lian, Defu
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1973 - 1983
  • [35] Multi-task Self-supervised Object Detection via Recycling of Bounding Box Annotations
    Lee, Wonhee
    Na, Joonil
    Kim, Gunhee
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4979 - 4988
  • [36] Self-supervised Co-salient Object Detection via Feature Correspondences at Multiple Scales
    Chakraborty, Souradeep
    Samaras, Dimitris
    COMPUTER VISION - ECCV 2024, PT IX, 2025, 15067 : 231 - 250
  • [37] Object-Centric Masked Image Modeling-Based Self-Supervised Pretraining for Remote Sensing Object Detection
    Zhang, Tong
    Zhuang, Yin
    Chen, He
    Chen, Liang
    Wang, Guanqun
    Gao, Peng
    Dong, Hao
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 5013 - 5025
  • [38] Self-supervised generative models for crystal structures
    Liu, Fangze
    Chen, Zhantao
    Liu, Tianyi
    Song, Ruyi
    Lin, Yu
    Turner, Joshua J.
    Jia, Chunjing
    ISCIENCE, 2024, 27 (09)
  • [39] Generative Adversarial and Self-Supervised Dehazing Network
    Zhang, Shengdong
    Zhang, Xiaoqin
    Wan, Shaohua
    Ren, Wenqi
    Zhao, Liping
    Shen, Linlin
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (03) : 4187 - 4197
  • [40] Efficient Medical Image Assessment via Self-supervised Learning
    Huang, Chun-Yin
    Lei, Qi
    Li, Xiaoxiao
    DATA AUGMENTATION, LABELLING, AND IMPERFECTIONS (DALI 2022), 2022, 13567 : 102 - 111