Semi-supervised 3D Object Detection with Proficient Teachers

被引:47
|
作者
Yin, Junbo [1 ]
Fang, Jin [2 ,3 ,4 ]
Zhou, Dingfu [2 ,3 ]
Zhang, Liangjun [2 ,3 ]
Xu, Cheng-Zhong [4 ]
Shen, Jianbing [4 ]
Wang, Wenguan [5 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci, Beijing, Peoples R China
[2] Baidu Res, Beijing, Peoples R China
[3] Natl Engn Lab Deep Learning Technol & Applicat, Beijing, Peoples R China
[4] Univ Macau, CIS, SKL IOTSC, Zhuhai, Peoples R China
[5] Univ Technol Sydney, ReLER, AAII, Ultimo, Australia
来源
基金
澳大利亚研究理事会;
关键词
3D object detection; Semi-supervised learning; Point cloud;
D O I
10.1007/978-3-031-19839-7_42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dominated point cloud-based 3D object detectors in autonomous driving scenarios rely heavily on the huge amount of accurately labeled samples, however, 3D annotation in the point cloud is extremely tedious, expensive and time-consuming. To reduce the dependence on large supervision, semi-supervised learning (SSL) based approaches have been proposed. The Pseudo-Labeling methodology is commonly used for SSL frameworks, however, the low-quality predictions from the teacher model have seriously limited its performance. In this work, we propose a new Pseudo-Labeling framework for semi-supervised 3D object detection, by enhancing the teacher model to a proficient one with several necessary designs. First, to improve the recall of pseudo labels, a Spatial-temporal Ensemble (STE) module is proposed to generate sufficient seed boxes. Second, to improve the precision of recalled boxes, a Clustering-based Box Voting (CBV) module is designed to get aggregated votes from the clustered seed boxes. This also eliminates the necessity of sophisticated thresholds to select pseudo labels. Furthermore, to reduce the negative influence of wrongly pseudo-labeled samples during the training, a soft supervision signal is proposed by considering Box-wise Contrastive Learning (BCL). The effectiveness of our model is verified on both ONCE and Waymo datasets. For example, on ONCE, our approach significantly improves the baseline by 9.51 mAP. Moreover, with half annotations, our model outperforms the oracle model with full annotations on Waymo.
引用
收藏
页码:727 / 743
页数:17
相关论文
共 50 条
  • [1] Semi-supervised 3D Object Detection with PatchTeacher and PillarMix
    Wu, Xiaopei
    Peng, Liang
    Xie, Liang
    Hou, Yuenan
    Lin, Binbin
    Huang, Xiaoshui
    Liu, Haifeng
    Cai, Deng
    Ouyang, Wanli
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 6153 - 6161
  • [2] Learning with Noisy Data for Semi-Supervised 3D Object Detection
    Chen, Zehui
    Li, Zhenyu
    Wang, Shuo
    Fu, Dengpan
    Zhao, Feng
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6906 - 6916
  • [3] A semi-supervised 3D object detection method for autonomous driving
    Zhang, Jiacheng
    Liu, Huafeng
    Lu, Jianfeng
    DISPLAYS, 2022, 71
  • [4] A-Teacher: Asymmetric Network for 3D Semi-Supervised Object Detection
    Wang, Hanshi
    Zhang, Zhipeng
    Gao, Jin
    Hu, Weiming
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 14978 - 14987
  • [5] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection
    Zhang, Dingyuan
    Liang, Dingkang
    Zou, Zhikang
    Li, Jingyu
    Ye, Xiaoqing
    Liu, Zhe
    Tan, Xiao
    Bai, Xiang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8339 - 8349
  • [6] Joint Semi-Supervised and Active Learning via 3D Consistency for 3D Object Detection
    Hwang, Sihwan
    Kim, Sanmin
    Kim, Youngseok
    Kum, Dongsuk
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 4819 - 4825
  • [7] DetMatch: Two Teachers are Better than One for Joint 2D and 3D Semi-Supervised Object Detection
    Park, Jinhyung
    Xu, Chenfeng
    Zhou, Yiyang
    Tomizuka, Masayoshi
    Zhan, Wei
    COMPUTER VISION, ECCV 2022, PT X, 2022, 13670 : 370 - 389
  • [8] Transferable Semi-Supervised 3D Object Detection From RGB-D Data
    Tang, Yew Siang
    Lee, Gim Hee
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1931 - 1940
  • [9] Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection
    Ho, Cheng-Ju
    Tai, Chen-Hsuan
    Lin, Yen-Yu
    Yang, Ming-Hsuan
    Tsai, Yi-Hsuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [10] Semi-Supervised Online Continual Learning for 3D Object Detection in Mobile Robotics
    Liu, Binhong
    Yao, Dexin
    Yang, Rui
    Yan, Zhi
    Yang, Tao
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2024, 110 (04)