Semi-supervised 3D Object Detection with Proficient Teachers

被引:47
|
作者
Yin, Junbo [1 ]
Fang, Jin [2 ,3 ,4 ]
Zhou, Dingfu [2 ,3 ]
Zhang, Liangjun [2 ,3 ]
Xu, Cheng-Zhong [4 ]
Shen, Jianbing [4 ]
Wang, Wenguan [5 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci, Beijing, Peoples R China
[2] Baidu Res, Beijing, Peoples R China
[3] Natl Engn Lab Deep Learning Technol & Applicat, Beijing, Peoples R China
[4] Univ Macau, CIS, SKL IOTSC, Zhuhai, Peoples R China
[5] Univ Technol Sydney, ReLER, AAII, Ultimo, Australia
来源
基金
澳大利亚研究理事会;
关键词
3D object detection; Semi-supervised learning; Point cloud;
D O I
10.1007/978-3-031-19839-7_42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dominated point cloud-based 3D object detectors in autonomous driving scenarios rely heavily on the huge amount of accurately labeled samples, however, 3D annotation in the point cloud is extremely tedious, expensive and time-consuming. To reduce the dependence on large supervision, semi-supervised learning (SSL) based approaches have been proposed. The Pseudo-Labeling methodology is commonly used for SSL frameworks, however, the low-quality predictions from the teacher model have seriously limited its performance. In this work, we propose a new Pseudo-Labeling framework for semi-supervised 3D object detection, by enhancing the teacher model to a proficient one with several necessary designs. First, to improve the recall of pseudo labels, a Spatial-temporal Ensemble (STE) module is proposed to generate sufficient seed boxes. Second, to improve the precision of recalled boxes, a Clustering-based Box Voting (CBV) module is designed to get aggregated votes from the clustered seed boxes. This also eliminates the necessity of sophisticated thresholds to select pseudo labels. Furthermore, to reduce the negative influence of wrongly pseudo-labeled samples during the training, a soft supervision signal is proposed by considering Box-wise Contrastive Learning (BCL). The effectiveness of our model is verified on both ONCE and Waymo datasets. For example, on ONCE, our approach significantly improves the baseline by 9.51 mAP. Moreover, with half annotations, our model outperforms the oracle model with full annotations on Waymo.
引用
收藏
页码:727 / 743
页数:17
相关论文
共 50 条
  • [41] Monocular 3D Detection With Geometric Constraint Embedding and Semi-Supervised Training
    Li, Peixuan
    Zhao, Huaici
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (03): : 5565 - 5572
  • [42] Semi-DETR: Semi-Supervised Object Detection with Detection Transformers
    Zhang, Jiacheng
    Lin, Xiangru
    Zhang, Wei
    Wang, Kuo
    Tan, Xiao
    Han, Junyu
    Ding, Errui
    Wang, Jingdong
    Li, Guanbin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23809 - 23818
  • [43] Mix-Teaching: A Simple, Unified and Effective Semi-Supervised Learning Framework for Monocular 3D Object Detection
    Yang, Lei
    Zhang, Xinyu
    Li, Jun
    Wang, Li
    Zhu, Minghan
    Zhang, Chuang
    Liu, Huaping
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6832 - 6844
  • [44] PE-MCAT: Leveraging Image Sensor Fusion and Adaptive Thresholds for Semi-Supervised 3D Object Detection
    Li, Bohao
    Song, Shaojing
    Ai, Luxia
    SENSORS, 2024, 24 (21)
  • [45] MixCycle: Mixup Assisted Semi-Supervised 3D Single Object Tracking with Cycle Consistency
    Wu, Qiao
    Yang, Jiaqi
    Sun, Kun
    Zhang, Chu'ai
    Zhang, Yanning
    Salzmann, Mathieu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 13910 - 13920
  • [46] Semi-supervised Open-World Object Detection
    Mullappilly, Sahal Shaji
    Gehlot, Abhishek Singh
    Anwer, Rao Muhammad
    Khan, Fahad Shahbaz
    Cholakkal, Hisham
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4305 - 4314
  • [47] Uncertain region mining semi-supervised object detection
    Tianxiang Yin
    Ningzhong Liu
    Han Sun
    Applied Intelligence, 2024, 54 : 2300 - 2313
  • [48] Uncertain region mining semi-supervised object detection
    Yin, Tianxiang
    Liu, Ningzhong
    Sun, Han
    APPLIED INTELLIGENCE, 2024, 54 (02) : 2300 - 2313
  • [49] Open-Set Semi-Supervised Object Detection
    Liu, Yen-Cheng
    Ma, Chih-Yao
    Dai, Xiaoliang
    Tian, Junjiao
    Vajda, Peter
    He, Zijian
    Kira, Zsolt
    COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 143 - 159
  • [50] Rethinking Pseudo Labels for Semi-supervised Object Detection
    Li, Hengduo
    Wu, Zuxuan
    Shrivastava, Abhinav
    Davis, Larry S.
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1314 - 1322