Semi-supervised 3D Object Detection with Proficient Teachers

被引:47
|
作者
Yin, Junbo [1 ]
Fang, Jin [2 ,3 ,4 ]
Zhou, Dingfu [2 ,3 ]
Zhang, Liangjun [2 ,3 ]
Xu, Cheng-Zhong [4 ]
Shen, Jianbing [4 ]
Wang, Wenguan [5 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci, Beijing, Peoples R China
[2] Baidu Res, Beijing, Peoples R China
[3] Natl Engn Lab Deep Learning Technol & Applicat, Beijing, Peoples R China
[4] Univ Macau, CIS, SKL IOTSC, Zhuhai, Peoples R China
[5] Univ Technol Sydney, ReLER, AAII, Ultimo, Australia
来源
基金
澳大利亚研究理事会;
关键词
3D object detection; Semi-supervised learning; Point cloud;
D O I
10.1007/978-3-031-19839-7_42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dominated point cloud-based 3D object detectors in autonomous driving scenarios rely heavily on the huge amount of accurately labeled samples, however, 3D annotation in the point cloud is extremely tedious, expensive and time-consuming. To reduce the dependence on large supervision, semi-supervised learning (SSL) based approaches have been proposed. The Pseudo-Labeling methodology is commonly used for SSL frameworks, however, the low-quality predictions from the teacher model have seriously limited its performance. In this work, we propose a new Pseudo-Labeling framework for semi-supervised 3D object detection, by enhancing the teacher model to a proficient one with several necessary designs. First, to improve the recall of pseudo labels, a Spatial-temporal Ensemble (STE) module is proposed to generate sufficient seed boxes. Second, to improve the precision of recalled boxes, a Clustering-based Box Voting (CBV) module is designed to get aggregated votes from the clustered seed boxes. This also eliminates the necessity of sophisticated thresholds to select pseudo labels. Furthermore, to reduce the negative influence of wrongly pseudo-labeled samples during the training, a soft supervision signal is proposed by considering Box-wise Contrastive Learning (BCL). The effectiveness of our model is verified on both ONCE and Waymo datasets. For example, on ONCE, our approach significantly improves the baseline by 9.51 mAP. Moreover, with half annotations, our model outperforms the oracle model with full annotations on Waymo.
引用
收藏
页码:727 / 743
页数:17
相关论文
共 50 条
  • [21] Humble Teachers Teach Better Students for Semi-Supervised Object Detection
    Tang, Yihe
    Chen, Weifeng
    Luo, Yijun
    Zhang, Yuting
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3131 - 3140
  • [22] SSDA3D: Semi-supervised Domain Adaptation for 3D Object Detection from Point Cloud
    Wang, Yan
    Yin, Junbo
    Li, Wei
    Frossard, Pascal
    Yang, Ruigang
    Shen, Jianbing
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 2707 - 2715
  • [23] ProUDA: Progressive unsupervised data augmentation for semi-Supervised 3D object detection on point cloud
    An, Pei
    Liang, Junxiong
    Ma, Tao
    Chen, Yanfei
    Wang, Liheng
    Ma, Jie
    PATTERN RECOGNITION LETTERS, 2023, 170 : 64 - 69
  • [24] FocalMix: Semi-Supervised Learning for 3D Medical Image Detection
    Wang, Dong
    Zhang, Yuan
    Zhang, Kexin
    Wang, Liwei
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3950 - 3959
  • [25] ATF-3D: Semi-Supervised 3D Object Detection With Adaptive Thresholds Filtering Based on Confidence and Distance
    Zhang, Zehan
    Ji, Yang
    Cui, Wei
    Wang, Yulong
    Li, Hao
    Zhao, Xian
    Li, Duo
    Tang, Sanli
    Yang, Ming
    Tan, Wenming
    Pu, Shiliang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 10573 - 10580
  • [26] DDS3D: Dense Pseudo-Labels with Dynamic Threshold for Semi-Supervised 3D Object Detection
    Li, Jingyu
    Liu, Zhe
    Hou, Jinghua
    Liang, Dingkang
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9245 - 9252
  • [27] Interactive Self-Training with Mean Teachers for Semi-supervised Object Detection
    Yang, Qize
    Wei, Xihan
    Wang, Biao
    Hua, Xian-Sheng
    Zhang, Lei
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5937 - 5946
  • [28] Diff3DETR: Agent-Based Diffusion Model for Semi-supervised 3D Object Detection
    Deng, Jiacheng
    Lu, Jiahao
    Zhang, Tianzhu
    COMPUTER VISION - ECCV 2024, PT XXXIV, 2025, 15092 : 57 - 73
  • [29] UpCycling: Semi-supervised 3D Object Detection without Sharing Raw-level Unlabeled Scenes
    Hwang, Sunwook
    Kim, Youngseok
    Kim, Seongwon
    Bahk, Saewoong
    Kim, Hyung-Sin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 23294 - 23304
  • [30] Semi-Supervised Stereo-based 3D Object Detection via Cross-View Consensus
    Wu, Wenhao
    Wong, Hau-San
    Wu, Si
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17471 - 17481