M3D-RPN: Monocular 3D Region Proposal Network for Object Detection

被引:320
|
作者
Brazil, Garrick [1 ]
Liu, Xiaoming [1 ]
机构
[1] Michigan State Univ, E Lansing, MI 48824 USA
关键词
D O I
10.1109/ICCV.2019.00938
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Understanding the world in 3D is a critical component of urban autonomous driving. Generally, the combination of expensive LiDAR sensors and stereo RGB imaging has been paramount for successful 3D object detection algorithms, whereas monocular image-only methods experience drastically reduced performance. We propose to reduce the gap by reformulating the monocular 3D detection problem as a standalone 3D region proposal network. We leverage the geometric relationship of 2D and 3D perspectives, allowing 3D boxes to utilize well-known and powerful convolutional features generated in the image-space. To help address the strenuous 3D parameter estimations, we further design depth-aware convolutional layers which enable location specific feature development and in consequence improved 3D scene understanding. Compared to prior work in monocular 3D detection, our method consists of only the proposed 3D region proposal network rather than relying on external networks, data, or multiple stages. M3D-RPN is able to significantly improve the performance of both monocular 3D Object Detection and Bird's Eye View tasks within the KITTI urban autonomous driving dataset, while efficiently using a shared multi-class model.
引用
收藏
页码:9286 / 9295
页数:10
相关论文
共 50 条
  • [1] ARPNET: attention region proposal network for 3D object detection
    Yangyang Ye
    Chi Zhang
    Xiaoli Hao
    Science China Information Sciences, 2019, 62
  • [2] ARPNET: attention region proposal network for 3D object detection
    Yangyang YE
    Chi ZHANG
    Xiaoli HAO
    ScienceChina(InformationSciences), 2019, 62 (12) : 44 - 52
  • [3] ARPNET: attention region proposal network for 3D object detection
    Ye, Yangyang
    Zhang, Chi
    Hao, Xiaoli
    SCIENCE CHINA-INFORMATION SCIENCES, 2019, 62 (12)
  • [4] 3D Object Detection Based on Proposal Generation Network Utilizing Monocular Images
    ul Haq, Qazi Mazhar
    Haq, Muhamad Amirul
    Ruan, Shanq-Jang
    Liang, Pei-Jung
    Gao, De-Qin
    IEEE CONSUMER ELECTRONICS MAGAZINE, 2022, 11 (05) : 47 - 53
  • [5] A New Monocular 3D Object Detection with Neural Network
    Hong, Weijie
    Liu, Yiguang
    Zheng, Yunan
    Wang, Ying
    Shi, Xuelei
    PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT IV, 2018, 11259 : 174 - 185
  • [6] Aerial Monocular 3D Object Detection
    Hu, Yue
    Fang, Shaoheng
    Xie, Weidi
    Chen, Siheng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
  • [7] Disentangling Monocular 3D Object Detection
    Simonelli, Andrea
    Bulo, Samuel Rota
    Porzi, Lorenzo
    Lopez-Antequera, Manuel
    Kontschieder, Peter
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1991 - 1999
  • [8] Geometry Uncertainty Projection Network for Monocular 3D Object Detection
    Lu, Yan
    Ma, Xinzhu
    Yang, Lei
    Zhang, Tianzhu
    Liu, Yating
    Chu, Qi
    Yan, Junjie
    Ouyang, Wanli
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3091 - 3101
  • [9] Depth-enhancement network for monocular 3D object detection
    Liu, Guohua
    Lian, Haiyang
    Guo, Changrui
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (09)
  • [10] Categorical Depth Distribution Network for Monocular 3D Object Detection
    Reading, Cody
    Harakeh, Ali
    Chae, Julia
    Waslander, Steven L.
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8551 - 8560