Semi-supervised 3D Object Detection with Proficient Teachers

被引:47
|
作者
Yin, Junbo [1 ]
Fang, Jin [2 ,3 ,4 ]
Zhou, Dingfu [2 ,3 ]
Zhang, Liangjun [2 ,3 ]
Xu, Cheng-Zhong [4 ]
Shen, Jianbing [4 ]
Wang, Wenguan [5 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci, Beijing, Peoples R China
[2] Baidu Res, Beijing, Peoples R China
[3] Natl Engn Lab Deep Learning Technol & Applicat, Beijing, Peoples R China
[4] Univ Macau, CIS, SKL IOTSC, Zhuhai, Peoples R China
[5] Univ Technol Sydney, ReLER, AAII, Ultimo, Australia
来源
基金
澳大利亚研究理事会;
关键词
3D object detection; Semi-supervised learning; Point cloud;
D O I
10.1007/978-3-031-19839-7_42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dominated point cloud-based 3D object detectors in autonomous driving scenarios rely heavily on the huge amount of accurately labeled samples, however, 3D annotation in the point cloud is extremely tedious, expensive and time-consuming. To reduce the dependence on large supervision, semi-supervised learning (SSL) based approaches have been proposed. The Pseudo-Labeling methodology is commonly used for SSL frameworks, however, the low-quality predictions from the teacher model have seriously limited its performance. In this work, we propose a new Pseudo-Labeling framework for semi-supervised 3D object detection, by enhancing the teacher model to a proficient one with several necessary designs. First, to improve the recall of pseudo labels, a Spatial-temporal Ensemble (STE) module is proposed to generate sufficient seed boxes. Second, to improve the precision of recalled boxes, a Clustering-based Box Voting (CBV) module is designed to get aggregated votes from the clustered seed boxes. This also eliminates the necessity of sophisticated thresholds to select pseudo labels. Furthermore, to reduce the negative influence of wrongly pseudo-labeled samples during the training, a soft supervision signal is proposed by considering Box-wise Contrastive Learning (BCL). The effectiveness of our model is verified on both ONCE and Waymo datasets. For example, on ONCE, our approach significantly improves the baseline by 9.51 mAP. Moreover, with half annotations, our model outperforms the oracle model with full annotations on Waymo.
引用
收藏
页码:727 / 743
页数:17
相关论文
共 50 条
  • [31] Semi-Supervised 3D Hand-Object Poses Estimation with Interactions in Time
    Liu, Shaowei
    Jiang, Hanwen
    Xu, Jiarui
    Liu, Sifei
    Wang, Xiaolong
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14682 - 14692
  • [32] Semi-Supervised Active Learning for Object Detection
    Chen, Sijin
    Yang, Yingyun
    Hua, Yan
    ELECTRONICS, 2023, 12 (02)
  • [33] Semi-supervised Object Detection with Unlabeled Data
    Nhu-Van Nguyen
    Rigaud, Christophe
    Burie, Jean-Christophe
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 289 - 296
  • [34] Improving Localization for Semi-Supervised Object Detection
    Rossi, Leonardo
    Karimi, Akbar
    Prati, Andrea
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT II, 2022, 13232 : 516 - 527
  • [35] Label Matching Semi-Supervised Object Detection
    Chen, Binbin
    Chen, Weijie
    Yang, Shicai
    Xuan, Yunyi
    Song, Jie
    Xie, Di
    Pu, Shiliang
    Song, Mingli
    Zhuang, Yueting
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14361 - 14370
  • [36] Semi-supervised Active Salient Object Detection
    Lv, Yunqiu
    Liu, Bowen
    Zhang, Jing
    Dai, Yuchao
    Li, Aixuan
    Zhang, Tong
    PATTERN RECOGNITION, 2022, 123
  • [37] Proposal Learning for Semi-Supervised Object Detection
    Tang, Peng
    Ramaiah, Chetan
    Wang, Yan
    Xu, Ran
    Xiong, Caiming
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2290 - 2300
  • [38] Active Teacher for Semi-Supervised Object Detection
    Mi, Peng
    Lin, Jianghang
    Zhou, Yiyi
    Shen, Yunhang
    Luo, Gen
    Sun, Xiaoshuai
    Cao, Liujuan
    Fu, Rongrong
    Xu, Qiang
    Ji, Rongrong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14462 - 14471
  • [39] 3D measures exploitation for a monocular semi-supervised fall detection system
    Makantasis, Konstantinos
    Protopapadakis, Eftychios
    Doulamis, Anastasios
    Doulamis, Nikolaos
    Matsatsinis, Nikolaos
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (22) : 15017 - 15049
  • [40] 3D measures exploitation for a monocular semi-supervised fall detection system
    Konstantinos Makantasis
    Eftychios Protopapadakis
    Anastasios Doulamis
    Nikolaos Doulamis
    Nikolaos Matsatsinis
    Multimedia Tools and Applications, 2016, 75 : 15017 - 15049