RoarNet: A Robust 3D Object Detection based on RegiOn Approximation Refinement

被引:0
|
作者
Shin, Kiwoo [1 ,2 ]
Kwon, Youngwook Paul [1 ,3 ]
Tomizuka, Masayoshi [1 ,2 ]
机构
[1] Univ Calif Berkeley, Dept Mech Engn, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Mech Syst Control Lab, Berkeley, CA 94720 USA
[3] Phantom AI Inc, Burlingame, CA USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present RoarNet, a new approach for 31) object detection from 21) image and 31) Lidar point clouds. Rased on two stage object detection framework ([II, [2]) with PointNet [3[ as our backbone network, we suggest several novel ideas to improve all object detection performance. The first part of our method, RoarNet_2D, estimates the 3D poses of objects from a monocular image, which approximates where to examine further, and derives multiple candidates that are geometrically feasible. This step significantly narrows down feasible 3D regions, which otherwise requires demanding processing of 3D point clouds in a huge search space. Then the second part, RoarNet_3D, takes the candidate regions and conducts in-depth inferences to conclude final poses in a recursive manner. inspired by PointNet RoarNet_3D processes 3D point clouds directly without any loss of data, leading to precise detection. We evaluate our method in KITTI, a 3D object detection benchmark. Our result shows that RoarNet has superior performance to state-of-the-art methods that are publicly available. Remarkably. RoarNet also outperforms state-of-the-art methods even in settings where Lidar and camera are not time synchronized, which is practically important for actual driving environment.
引用
收藏
页码:2510 / 2515
页数:6
相关论文
共 50 条
  • [21] A robust scheme for copy detection of 3D object point clouds
    Yang, Jiaqi
    Lu, Xuequan
    Chen, Wenzhi
    NEUROCOMPUTING, 2022, 510 : 181 - 192
  • [22] Diffusion Models-Based Purification for Common Corruptions on Robust 3D Object Detection
    Cai, Mumuxin
    Wang, Xupeng
    Sohel, Ferdous
    Lei, Hang
    SENSORS, 2024, 24 (16)
  • [23] 3D-FERNS+: VIEWPOINT-BASED KEYPOINT CLASSIFIER FOR ROBUST 3D OBJECT POSE DETECTION
    Kobayashi, Tatsuya
    Kato, Haruhisa
    Yanagihara, Hiromasa
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 3337 - 3341
  • [24] Camera and LiDAR-based point painted voxel region-based convolutional neural network for robust 3D object detection
    Xie, Han
    Zheng, Wenqi
    Chen, Yunfan
    Shin, Hyunchul
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (05)
  • [25] 3D Object Detection based on Geometrical Segmentation
    Teng, Zhou
    Xiao, Jing
    2013 INTERNATIONAL CONFERENCE ON COMPUTER AND ROBOT VISION (CRV), 2013, : 67 - 74
  • [26] 3D Object Detection Based on LiDAR Data
    Sahba, Ramin
    Sahba, Amin
    Jamshidi, Mo
    Rad, Paul
    2019 IEEE 10TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2019, : 511 - 514
  • [27] Model-based 3D object detection
    Biegelbauer, Georg
    Vincze, Markus
    Wohlkinger, Walter
    MACHINE VISION AND APPLICATIONS, 2010, 21 (04) : 497 - 516
  • [28] Improved 3D Object Detection Based on PointPillars
    Kong, Weiwei
    Du, Yusheng
    He, Leilei
    Li, Zejiang
    ELECTRONICS, 2024, 13 (15)
  • [29] ARPNET: attention region proposal network for 3D object detection
    Yangyang Ye
    Chi Zhang
    Xiaoli Hao
    Science China Information Sciences, 2019, 62
  • [30] 3D Object Pose Refinement in Range Images
    Zabulis, Xenophon
    Lourakis, Manolis
    Koutlemanis, Panagiotis
    COMPUTER VISION SYSTEMS (ICVS 2015), 2015, 9163 : 263 - 274