MSPV3D: Multi-Scale Point-Voxels 3D Object Detection Net

被引:0
|
作者
Zhang, Zheng [1 ]
Bao, Zhiping [1 ]
Wei, Yun [2 ]
Zhou, Yongsheng [3 ]
Li, Ming [2 ]
Tian, Qing [1 ]
机构
[1] North China Univ Technol, Sch Informat, Beijing 100144, Peoples R China
[2] Beijing Mass Transit Railway Operat Co Ltd, Corp Informat, Beijing 100044, Peoples R China
[3] Beijing Univ Chem Technol, Sch Informat, Beijing 100029, Peoples R China
关键词
target detection; target recognition; deep learning;
D O I
10.3390/rs16173146
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Autonomous vehicle technology is advancing, with 3D object detection based on point clouds being crucial. However, point clouds' irregularity, sparsity, and large data volume, coupled with irrelevant background points, hinder detection accuracy. We propose a two-stage multi-scale 3D object detection network. Firstly, considering that a large number of useless background points are usually generated by the ground during detection, we propose a new ground filtering algorithm to increase the proportion of foreground points and enhance the accuracy and efficiency of the two-stage detection. Secondly, given that different types of targets to be detected vary in size, and the use of a single-scale voxelization may result in excessive loss of detailed information, the voxels of different scales are introduced to extract relevant features of objects of different scales in the point clouds and integrate them into the second-stage detection. Lastly, a multi-scale feature fusion module is proposed, which simultaneously enhances and integrates features extracted from voxels of different scales. This module fully utilizes the valuable information present in the point cloud across various scales, ultimately leading to more precise 3D object detection. The experiment is conducted on the KITTI dataset and the nuScenes dataset. Compared with our baseline, "Pedestrian" detection improved by 3.37-2.72% and "Cyclist" detection by 3.79-1.32% across difficulty levels on KITTI, and was boosted by 2.4% in NDS and 3.6% in mAP on nuScenes.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Scale invariant point feature (SIPF) for 3D point clouds and 3D multi-scale object detection
    Lin, Baowei
    Wang, Fasheng
    Zhao, Fangda
    Sun, Yi
    NEURAL COMPUTING & APPLICATIONS, 2018, 29 (05): : 1209 - 1224
  • [2] Retraction Note: Scale invariant point feature (SIPF) for 3D point clouds and 3D multi-scale object detection
    Baowei Lin
    Fasheng Wang
    Fangda Zhao
    Yi Sun
    Neural Computing and Applications, 2024, 36 (18) : 11065 - 11065
  • [3] RETRACTED ARTICLE: Scale invariant point feature (SIPF) for 3D point clouds and 3D multi-scale object detection
    Baowei Lin
    Fasheng Wang
    Fangda Zhao
    Yi Sun
    Neural Computing and Applications, 2018, 29 : 1209 - 1224
  • [4] PFSC:Pyramid R-CNN for Point-Voxels with Focal Sparse Convolutional Networks for 3D Object Detection
    Su, Zhao
    Liang, Xuehui
    Tong, Jigang
    Yang, Sen
    Du, Shengzhi
    2024 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, ICMA 2024, 2024, : 1843 - 1848
  • [5] Multi-Scale PointPillars 3D Object Detection Network
    Ya, Hang
    Luo, Guiming
    PROCEEDINGS OF THE 2019 IEEE 18TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC 2019), 2019, : 174 - 179
  • [6] DMSC-Net: A deep Multi-Scale context network for 3D object detection of indoor point clouds
    Zhang, Zhenxin
    Xu, Dixiang
    Mathiopoulos, Takis
    Wang, Qiang
    Zhang, Liqiang
    Xu, Zhihua
    Jiang, Jincheng
    Li, Zhen
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 122
  • [7] Point Density-Aware Voxels for LiDAR 3D Object Detection
    Hu, Jordan S. K.
    Kuai, Tianshu
    Waslander, Steven L.
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8459 - 8468
  • [8] 3D-MSFC: A 3D multi-scale features compression method for object detection☆
    Li, Zhengxin
    Tian, Chongzhen
    Yuan, Hui
    Lu, Xin
    Malekmohamadi, Hossein
    DISPLAYS, 2024, 85
  • [9] 3D Point Cloud Object Detection Method Based on Multi-Scale Dynamic Sparse Voxelization
    Wang, Jiayu
    Liu, Ye
    Zhu, Yongjian
    Wang, Dong
    Zhang, Yu
    SENSORS, 2024, 24 (06)
  • [10] Multi-Scale Keypoints Feature Fusion Network for 3D Object Detection from Point Clouds
    Zhang, Xu
    Bai, Linjuan
    Zhang, Zuyu
    Li, Yan
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2022, 12