SMOKE: Single-Stage Monocular 3D Object Detection via Keypoint Estimation

被引:230
|
作者
Liu, Zechen [1 ]
Wu, Zizhang [1 ]
Toth, Roland [2 ]
机构
[1] ZongMu Tech, Beijing, Peoples R China
[2] TU e, Beijing, Peoples R China
关键词
D O I
10.1109/CVPRW50498.2020.00506
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating 3D orientation and translation of objects is essential for infrastructure-less autonomous navigation and driving. In case of monocular vision, successful methods have been mainly based on two ingredients: (i) a network generating 2D region proposals, (ii) a R-CNN structure predicting 3D object pose by utilizing the acquired regions of interest. We argue that the 2D detection network is redundant and introduces non-negligible noise for 3D detection. Hence, we propose a novel 3D object detection method, named SMOKE, in this paper that predicts a 3D bounding box for each detected object by combining a single keypoint estimate with regressed 3D variables. As a second contribution, we propose a multi-step disentangling approach for constructing the 3D bounding box, which significantly improves both training convergence and detection accuracy. In contrast to previous 3D detection techniques, our method does not require complicated pre/post-processing, extra data, and a refinement stage. Despite of its structural simplicity, our proposed SMOKE network outperforms all existing monocular 3D detection methods on the KITTI dataset, giving the best state-of-the-art result on both 3D object detection and Bird's eye view evaluation. The code is available at https://github.com/lzccccc/SMOKE.
引用
收藏
页码:4289 / 4298
页数:10
相关论文
共 50 条
  • [1] Keypoint-Aware Single-Stage 3D Object Detector for Autonomous Driving
    Xu, Wencai
    Hu, Jie
    Chen, Ruinan
    An, Yongpeng
    Xiong, Zongquan
    Liu, Han
    SENSORS, 2022, 22 (04)
  • [2] DST3D: DLA-Swin Transformer for Single-Stage Monocular 3D Object Detection
    Wu, Zhihong
    Jiang, Xin
    Xu, Ruidong
    Lu, Ke
    Zhu, Yuan
    Wu, Mingzhi
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 411 - 418
  • [3] Monocular 3D Object Detection Based on Pseudo Multimodal Information Extraction and Keypoint Estimation
    Zhao, Dan
    Ji, Chaofeng
    Liu, Guizhong
    APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [4] ABC: Aligning binary centers for single-stage monocular 3D detection
    Feng, Yong
    Chen, Jinglong
    He, Shuilong
    Xu, Enyong
    IMAGE AND VISION COMPUTING, 2023, 136
  • [5] A Monocular 3D Object Detection Algorithm with Multi-Keypoint Constraints and Depth Estimation Assistance
    Zheng, Jin
    Wang, Sen
    Li, Hang
    Zhou, Yu-Hai
    Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (12): : 2803 - 2818
  • [6] Lite-FPN for keypoint-based monocular 3D object detection
    Yang, Lei
    Zhang, Xinyu
    Li, Jun
    Wang, Li
    Zhu, Minghan
    Zhu, Lei
    KNOWLEDGE-BASED SYSTEMS, 2023, 271
  • [7] Monocular 3D object detection via estimation of paired keypoints for autonomous driving
    Ji, Chaofeng
    Liu, Guizhong
    Zhao, Dan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (04) : 5973 - 5988
  • [8] Monocular 3D object detection via estimation of paired keypoints for autonomous driving
    Chaofeng Ji
    Guizhong Liu
    Dan Zhao
    Multimedia Tools and Applications, 2022, 81 : 5973 - 5988
  • [9] A Single-Stage 3D Object Detection Method Based on Sparse Attention Mechanism
    Jia, Songche
    Zhang, Zhenyu
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 414 - 425
  • [10] Rethinking IoU-based Optimization for Single-stage 3D Object Detection
    Sheng, Hualian
    Cai, Sijia
    Zhao, Na
    Deng, Bing
    Huang, Jianqiang
    Hua, Xian-Sheng
    Zhao, Min-Jian
    Lee, Gim Hee
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 544 - 561