Semi-supervised 3D Object Detection with Proficient Teachers

被引：47

作者：

Yin, Junbo ^{[1
]}

Fang, Jin ^{[2
,3
,4
]}

Zhou, Dingfu ^{[2
,3
]}

Zhang, Liangjun ^{[2
,3
]}

Xu, Cheng-Zhong ^{[4
]}

Shen, Jianbing ^{[4
]}

Wang, Wenguan ^{[5
]}

机构：

[1] Beijing Inst Technol, Sch Comp Sci, Beijing, Peoples R China

[2] Baidu Res, Beijing, Peoples R China

[3] Natl Engn Lab Deep Learning Technol & Applicat, Beijing, Peoples R China

[4] Univ Macau, CIS, SKL IOTSC, Zhuhai, Peoples R China

[5] Univ Technol Sydney, ReLER, AAII, Ultimo, Australia

来源：

COMPUTER VISION, ECCV 2022, PT XXXVIII | 2022年 / 13698卷

基金：

澳大利亚研究理事会;

关键词：

3D object detection; Semi-supervised learning; Point cloud;

D O I：

10.1007/978-3-031-19839-7_42

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Dominated point cloud-based 3D object detectors in autonomous driving scenarios rely heavily on the huge amount of accurately labeled samples, however, 3D annotation in the point cloud is extremely tedious, expensive and time-consuming. To reduce the dependence on large supervision, semi-supervised learning (SSL) based approaches have been proposed. The Pseudo-Labeling methodology is commonly used for SSL frameworks, however, the low-quality predictions from the teacher model have seriously limited its performance. In this work, we propose a new Pseudo-Labeling framework for semi-supervised 3D object detection, by enhancing the teacher model to a proficient one with several necessary designs. First, to improve the recall of pseudo labels, a Spatial-temporal Ensemble (STE) module is proposed to generate sufficient seed boxes. Second, to improve the precision of recalled boxes, a Clustering-based Box Voting (CBV) module is designed to get aggregated votes from the clustered seed boxes. This also eliminates the necessity of sophisticated thresholds to select pseudo labels. Furthermore, to reduce the negative influence of wrongly pseudo-labeled samples during the training, a soft supervision signal is proposed by considering Box-wise Contrastive Learning (BCL). The effectiveness of our model is verified on both ONCE and Waymo datasets. For example, on ONCE, our approach significantly improves the baseline by 9.51 mAP. Moreover, with half annotations, our model outperforms the oracle model with full annotations on Waymo.

引用

页码：727 / 743

页数：17

共 50 条

[1] Semi-supervised 3D Object Detection with PatchTeacher and PillarMix
Wu, Xiaopei
Peng, Liang
Xie, Liang
Hou, Yuenan
Lin, Binbin
Huang, Xiaoshui
Liu, Haifeng
Cai, Deng
Ouyang, Wanli
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 6153 - 6161
[2] Learning with Noisy Data for Semi-Supervised 3D Object Detection
Chen, Zehui
Li, Zhenyu
Wang, Shuo
Fu, Dengpan
Zhao, Feng
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6906 - 6916
[3] A semi-supervised 3D object detection method for autonomous driving
Zhang, Jiacheng
Liu, Huafeng
Lu, Jianfeng
DISPLAYS, 2022, 71
[4] A-Teacher: Asymmetric Network for 3D Semi-Supervised Object Detection
Wang, Hanshi
Zhang, Zhipeng
Gao, Jin
Hu, Weiming
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 14978 - 14987
[5] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection
Zhang, Dingyuan
Liang, Dingkang
Zou, Zhikang
Li, Jingyu
Ye, Xiaoqing
Liu, Zhe
Tan, Xiao
Bai, Xiang
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8339 - 8349
[6] Joint Semi-Supervised and Active Learning via 3D Consistency for 3D Object Detection
Hwang, Sihwan
Kim, Sanmin
Kim, Youngseok
Kum, Dongsuk
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 4819 - 4825
[7] DetMatch: Two Teachers are Better than One for Joint 2D and 3D Semi-Supervised Object Detection
Park, Jinhyung
Xu, Chenfeng
Zhou, Yiyang
Tomizuka, Masayoshi
Zhan, Wei
COMPUTER VISION, ECCV 2022, PT X, 2022, 13670 : 370 - 389
[8] Transferable Semi-Supervised 3D Object Detection From RGB-D Data
Tang, Yew Siang
Lee, Gim Hee
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1931 - 1940
[9] Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection
Ho, Cheng-Ju
Tai, Chen-Hsuan
Lin, Yen-Yu
Yang, Ming-Hsuan
Tsai, Yi-Hsuan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[10] Semi-Supervised Online Continual Learning for 3D Object Detection in Mobile Robotics
Liu, Binhong
Yao, Dexin
Yang, Rui
Yan, Zhi
Yang, Tao
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2024, 110 (04)

← 1 2 3 4 5 →