RoarNet: A Robust 3D Object Detection based on RegiOn Approximation Refinement

被引:0
|
作者
Shin, Kiwoo [1 ,2 ]
Kwon, Youngwook Paul [1 ,3 ]
Tomizuka, Masayoshi [1 ,2 ]
机构
[1] Univ Calif Berkeley, Dept Mech Engn, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Mech Syst Control Lab, Berkeley, CA 94720 USA
[3] Phantom AI Inc, Burlingame, CA USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present RoarNet, a new approach for 31) object detection from 21) image and 31) Lidar point clouds. Rased on two stage object detection framework ([II, [2]) with PointNet [3[ as our backbone network, we suggest several novel ideas to improve all object detection performance. The first part of our method, RoarNet_2D, estimates the 3D poses of objects from a monocular image, which approximates where to examine further, and derives multiple candidates that are geometrically feasible. This step significantly narrows down feasible 3D regions, which otherwise requires demanding processing of 3D point clouds in a huge search space. Then the second part, RoarNet_3D, takes the candidate regions and conducts in-depth inferences to conclude final poses in a recursive manner. inspired by PointNet RoarNet_3D processes 3D point clouds directly without any loss of data, leading to precise detection. We evaluate our method in KITTI, a 3D object detection benchmark. Our result shows that RoarNet has superior performance to state-of-the-art methods that are publicly available. Remarkably. RoarNet also outperforms state-of-the-art methods even in settings where Lidar and camera are not time synchronized, which is practically important for actual driving environment.
引用
收藏
页码:2510 / 2515
页数:6
相关论文
共 50 条
  • [41] Reinforcing LiDAR-Based 3D Object Detection with RGB and 3D Information
    Liu, Wenjian
    Zhou, Yue
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 : 199 - 209
  • [42] Effective 3D object detection based on detector and tracker
    Nie, Weizhi
    Liu, Anan
    Wang, Zhongyang
    Su, Yuting
    NEUROCOMPUTING, 2016, 215 : 63 - 70
  • [43] Semantic Frustum Based VoxelNet for 3D Object Detection
    Chen, Feng
    Wu, Fei
    Huang, Qinghua
    Feng, Yujian
    Ge, Qi
    Ji, Yimu
    Hu, Chang-Hui
    Jing, Xiao-Yuan
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 7629 - 7634
  • [44] 3D Object Detection Based on Improved Frustum PointNet
    Liu Xunhua
    Sun Shaoyuan
    Gu Lipeng
    Li Xiang
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (20)
  • [45] Detection-based Object Labeling in 3D Scenes
    Lai, Kevin
    Bo, Liefeng
    Ren, Xiaofeng
    Fox, Dieter
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 1330 - 1337
  • [46] Improved 3D Object Detection Method Based on PointPillars
    Han, Zhenguo
    Li, Xu
    Xu, Hengxin
    Song, Hongzheng
    2024 4TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND INTELLIGENT SYSTEMS ENGINEERING, MLISE 2024, 2024, : 163 - 166
  • [47] 3D Object Detection and Tracking Based on Streaming Data
    Guo, Xusen
    Gu, Jianfeng
    Guo, Silu
    Xu, Zixiao
    Yang, Chengzhang
    Liu, Shanghua
    Cheng, Long
    Huang, Kai
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 8376 - 8382
  • [48] 3D Object Detection in Substation Scene Based on Voxelization
    Wang, Dawei
    Hu, Fan
    Zhang, Na
    Yang, Gang
    Lu, Jiyuan
    Zhang, Xingzhong
    Computer Engineering and Applications, 2024, 60 (11) : 328 - 335
  • [49] A review of 3D object detection based on autonomous driving
    Wang, Huijuan
    Chen, Xinyue
    Yuan, Quanbo
    Liu, Peng
    VISUAL COMPUTER, 2025, 41 (03): : 1757 - 1775
  • [50] LiDAR 3D Object Detection Based on Improved PointRCNN
    Gao, Han
    Chen, Ying
    Ni, Lizheng
    Deng, Xiuhan
    Zhong, Kai
    Yan, Chengzhi
    LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (22)