F-PVNet: Frustum-Level 3-D Object Detection on Point-Voxel Feature Representation for Autonomous Driving

被引:6
|
作者
Tao, Chongben [1 ]
Fu, Shiping [1 ]
Wang, Chen [1 ]
Luo, Xizhao [2 ]
Li, Huayi [1 ]
Gao, Zhen [3 ]
Zhang, Zufeng [4 ]
Zheng, Sifa [4 ]
机构
[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215009, Peoples R China
[2] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
[3] McMaster Univ, Fac Engn, Hamilton, ON L8S 0A3, Canada
[4] Tsinghua Univ, Sch Vehicle & Mobil, Beijing 100084, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Three-dimensional displays; Feature extraction; Point cloud compression; Object detection; Heuristic algorithms; Estimation; Proposals; 3-D object detection; autonomous driving; fully convolutional network (FCN); point voxel fusion; sliding frustum;
D O I
10.1109/JIOT.2022.3231369
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Current 3-D object detection technology for autonomous driving usually cannot efficiently utilize local sensitive points. Meanwhile, contextual feature extracted from a object is not sufficient, which easily leads to deteriorated detection accuracy of the final object estimation. For the problems, a point-voxel-based 3-D dynamic object detection algorithm is proposed. First, local points are grouped with a camera frustum. Then, the global feature extracted by the submanifold 3-D voxel CNNs is aggregated into frustum key points. Second, a module of vector pool with feature aggregation is used to aggregate multiscale features of the point cloud. Moreover, the frustum raw feature and BEV feature are used for feature extension. Subsequently, the fine multiscale feature extracted from the point cloud is used as input to a subsequent fully convolutional network for final classification and continuous estimation of oriented 3-D boxes. The proposed method was compared with other state-of-the-art algorithms on the KITTI, Waymo, and nuScenes data sets. Experimental results showed that the proposed algorithm was better in accuracy, robustness, and generalization capabilities in 3-D dynamic object detection. Experiments on a real scenario and extensive ablation studies also demonstrated that the proposed algorithm not only effectively controls computational cost but also achieved more efficient results in 3-D object detection.
引用
收藏
页码:8031 / 8045
页数:15
相关论文
共 50 条
  • [31] Robust LiDAR-Camera 3-D Object Detection With Object-Level Feature Fusion
    Chen, Yongxiang
    Yan, Fuwu
    Yin, Zhishuai
    Nie, Linzhen
    Tao, Bo
    Miao, Mingze
    Zheng, Ningyu
    Zhang, Pei
    Zeng, Junyuan
    IEEE SENSORS JOURNAL, 2024, 24 (18) : 29108 - 29120
  • [32] TransMRE: Multiple Observation Planes Representation Encoding With Fully Sparse Voxel Transformers for 3-D Object Detection
    Zhu, Ziming
    Zhu, Yu
    Zhang, Kezhi
    Li, Hangyu
    Ling, Xiaofeng
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [33] HVPR: Hybrid Voxel-Point Representation for Single-stage 3D Object Detection
    Noh, Jongyoun
    Lee, Sanghoon
    Ham, Bumsub
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14600 - 14609
  • [34] Adversarial point cloud perturbations against 3D object detection in autonomous driving systems
    Wang, Xupeng
    Cai, Mumuxin
    Sohel, Ferdous
    Sang, Nan
    Chang, Zhengwei
    NEUROCOMPUTING, 2021, 466 : 27 - 36
  • [35] Evaluation of Point Cloud Data Augmentation for 3D-LiDAR Object Detection in Autonomous Driving
    Martins, Marta
    Gomes, Iago P.
    Wolf, Denis Fernando
    Premebida, Cristiano
    ROBOT 2023: SIXTH IBERIAN ROBOTICS CONFERENCE ADVANCES IN ROBOTICS, VOL 1, 2024, 976 : 82 - 92
  • [36] Channelwise and Spatially Guided Multimodal Feature Fusion Network for 3-D Object Detection in Autonomous Vehicles
    Uzair, Muhammad
    Dong, Jian
    Shi, Ronghua
    Mushtaq, Husnain
    Ullah, Irshad
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [37] MSL3D: 3D object detection from monocular, stereo and point cloud for autonomous driving
    Chen, Wenyu
    Li, Peixuan
    Zhao, Huaici
    NEUROCOMPUTING, 2022, 494 : 23 - 32
  • [38] Efficient flexible voxel-based two-stage network for 3D object detection in autonomous driving
    Sun, Fanyue
    Tong, Guoxiang
    Song, Yan
    APPLIED SOFT COMPUTING, 2024, 162
  • [39] Voxel-FPN: Multi-Scale Voxel Feature Aggregation for 3D Object Detection from LIDAR Point Clouds
    Kuang, Hongwu
    Wang, Bei
    An, Jianping
    Zhang, Ming
    Zhang, Zehan
    SENSORS, 2020, 20 (03)
  • [40] RT3D: Real-Time 3-D Vehicle Detection in LiDAR Point Cloud for Autonomous Driving
    Zeng, Yiming
    Hu, Yu
    Liu, Shice
    Ye, Jing
    Han, Yinhe
    Li, Xiaowei
    Sun, Ninghui
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04): : 3434 - 3440