Lidar Point Cloud Guided Monocular 3D Object Detection

被引:23
|
作者
Peng, Liang [1 ,2 ]
Liu, Fei
Yu, Zhengxu [1 ]
Yan, Senbo [1 ,2 ]
Deng, Dan [2 ]
Yang, Zheng [2 ]
Liu, Haifeng [1 ]
Cai, Deng [1 ,2 ]
机构
[1] Zhejiang Univ, State Key Lab CAD & CG, Hangzhou, Peoples R China
[2] Fabu Inc, Hangzhou, Peoples R China
来源
关键词
Monocular 3D detection; LiDAR point cloud; Self-driving;
D O I
10.1007/978-3-031-19769-7_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monocular 3D object detection is a challenging task in the self-driving and computer vision community. As a common practice, most previous works use manually annotated 3D box labels, where the annotating process is expensive. In this paper, we find that the precisely and carefully annotated labels may be unnecessary in monocular 3D detection, which is an interesting and counterintuitive finding. Using rough labels that are randomly disturbed, the detector can achieve very close accuracy compared to the one using the ground-truth labels. We delve into this underlying mechanism and then empirically find that: concerning the label accuracy, the 3D location part in the label is preferred compared to other parts of labels. Motivated by the conclusions above and considering the precise LiDAR 3D measurement, we propose a simple and effective framework, dubbed LiDAR point cloud guided monocular 3D object detection (LPCG). This framework is capable of either reducing the annotation costs or considerably boosting the detection accuracy without introducing extra annotation costs. Specifically, It generates pseudo labels from unlabeled LiDAR point clouds. Thanks to accurate LiDAR 3D measurements in 3D space, such pseudo labels can replace manually annotated labels in the training of monocular 3D detectors, since their 3D location information is precise. LPCG can be applied into any monocular 3D detector to fully use massive unlabeled data in a selfdriving system. As a result, in KITTI benchmark, we take the first place on both monocular 3D and BEV (bird's-eye-view) detection with a significant margin. In Waymo benchmark, our method using 10% labeled data achieves comparable accuracy to the baseline detector using 100% labeled data. The codes are released at https://github.com/SPengLiang/LPCG.
引用
收藏
页码:123 / 139
页数:17
相关论文
共 50 条
  • [41] 3D point cloud object detection algorithm based on Transformer
    Liu M.
    Yang Q.
    Hu G.
    Guo Y.
    Zhang J.
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2023, 41 (06): : 1190 - 1197
  • [42] 3D OBJECT DETECTION NETWORK COMBINED WITH POINT CLOUD COMPLETION
    Zhou, Jing
    Yu, Chao
    Zhang, Junchi
    Hu, Yiyu
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2024, 25 (05) : 789 - 809
  • [43] JOINT OPTIMIZED POINT CLOUD COMPRESSION FOR 3D OBJECT DETECTION
    Liu, Bojun
    Li, Shanshan
    Sheng, Xihua
    Li, Li
    Liu, Dong
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1185 - 1189
  • [44] Investigating Attention Mechanism in 3D Point Cloud Object Detection
    Qiu, Shi
    Wu, Yunfan
    Anwar, Saeed
    Li, Chongyi
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 403 - 412
  • [45] EPGNet: Enhanced Point Cloud Generation for 3D Object Detection
    Chen, Qingsheng
    Fan, Cien
    Jin, Weizheng
    Zou, Lian
    Li, Fangyu
    Li, Xiaopeng
    Jiang, Hao
    Wu, Minyuan
    Liu, Yifeng
    SENSORS, 2020, 20 (23) : 1 - 17
  • [46] Object Detection in 3D Point Cloud Based on ECA Mechanism
    Wang, Xinkai
    Jia, Xu
    Zhang, Miyuan
    Lu, Houda
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (05)
  • [47] Bridged Transformer for Vision and Point Cloud 3D Object Detection
    Wang, Yikai
    Ye, TengQi
    Cao, Lele
    Huang, Wenbing
    Sun, Fuchun
    He, Fengxiang
    Tao, Dacheng
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12104 - 12113
  • [48] Offboard 3D Object Detection from Point Cloud Sequences
    Qi, Charles R.
    Zhou, Yin
    Najibi, Mahyar
    Sun, Pei
    Khoa Vo
    Deng, Boyang
    Anguelov, Dragomir
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6130 - 6140
  • [49] Monocular 3D Object Detection From Comprehensive Feature Distillation Pseudo-LiDAR
    Sun, Chentao
    Xu, Chengrui
    Fang, Wenxiao
    Xu, Kunyuan
    IEEE ACCESS, 2023, 11 : 98969 - 98976
  • [50] Knowledge guided object detection and identification in 3D Point Clouds
    Karmacharya, A.
    Boochs, F.
    Tietz, B.
    VIDEOMETRICS, RANGE IMAGING, AND APPLICATIONS XIII, 2015, 9528