MSPV3D: Multi-Scale Point-Voxels 3D Object Detection Net

被引:0
|
作者
Zhang, Zheng [1 ]
Bao, Zhiping [1 ]
Wei, Yun [2 ]
Zhou, Yongsheng [3 ]
Li, Ming [2 ]
Tian, Qing [1 ]
机构
[1] North China Univ Technol, Sch Informat, Beijing 100144, Peoples R China
[2] Beijing Mass Transit Railway Operat Co Ltd, Corp Informat, Beijing 100044, Peoples R China
[3] Beijing Univ Chem Technol, Sch Informat, Beijing 100029, Peoples R China
关键词
target detection; target recognition; deep learning;
D O I
10.3390/rs16173146
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Autonomous vehicle technology is advancing, with 3D object detection based on point clouds being crucial. However, point clouds' irregularity, sparsity, and large data volume, coupled with irrelevant background points, hinder detection accuracy. We propose a two-stage multi-scale 3D object detection network. Firstly, considering that a large number of useless background points are usually generated by the ground during detection, we propose a new ground filtering algorithm to increase the proportion of foreground points and enhance the accuracy and efficiency of the two-stage detection. Secondly, given that different types of targets to be detected vary in size, and the use of a single-scale voxelization may result in excessive loss of detailed information, the voxels of different scales are introduced to extract relevant features of objects of different scales in the point clouds and integrate them into the second-stage detection. Lastly, a multi-scale feature fusion module is proposed, which simultaneously enhances and integrates features extracted from voxels of different scales. This module fully utilizes the valuable information present in the point cloud across various scales, ultimately leading to more precise 3D object detection. The experiment is conducted on the KITTI dataset and the nuScenes dataset. Compared with our baseline, "Pedestrian" detection improved by 3.37-2.72% and "Cyclist" detection by 3.79-1.32% across difficulty levels on KITTI, and was boosted by 2.4% in NDS and 3.6% in mAP on nuScenes.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] M3DETR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers
    Guan, Tianrui
    Wang, Jun
    Lan, Shiyi
    Chandra, Rohan
    Wu, Zuxuan
    Davis, Larry
    Manocha, Dinesh
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2293 - 2303
  • [42] Dense Point Diffusion for 3D Object Detection
    Liu, Xu
    Cao, Jiayan
    Bi, Qianqian
    Wang, Jian
    Shi, Boxin
    Wei, Yichen
    2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 762 - 770
  • [43] KPP3D:Key Point Painting for 3D Object Detection
    Wang, Mingming
    Chen, Qingkui
    Fu, Zhibing
    Computer Engineering and Applications, 2023, 59 (17) : 195 - 204
  • [44] PVF-NET: Point & Voxel Fusion 3D Object Detection Framework for Point Cloud
    Cui, Zhihao
    Zhang, Zhenhua
    2020 17TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2020), 2020, : 125 - 133
  • [45] Interactive Multi-Scale Fusion of 2D and 3D Features for Multi-Object Vehicle Tracking
    Wang, Guangming
    Peng, Chensheng
    Gu, Yingying
    Zhang, Jinpeng
    Wang, Hesheng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (10) : 10618 - 10627
  • [46] Multi-Scale 3D Printed Capillary Gripper
    Cavaiani, Marco
    Dehaeck, Sam
    Vitry, Youen
    Lambertt, Pierre
    2018 INTERNATIONAL CONFERENCE ON MANIPULATION, AUTOMATION AND ROBOTICS AT SMALL SCALES (MARSS), 2018,
  • [47] MSIT-Det: Multi-Scale Feature Aggregation with Iterative Transformer Networks for 3D Object Detection
    Li, Xi
    Chen, Yuanyuan
    Lv, Yisheng
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 5510 - 5515
  • [48] Multi-scale 3D Data Acquisition of Maize
    Wen, Weiliang
    Guo, Xinyu
    Lu, Xianju
    Wang, Yongjian
    Yu, Zetao
    COMPUTER AND COMPUTING TECHNOLOGIES IN AGRICULTURE XI, PT I, 2019, 545 : 108 - 115
  • [49] MULTI-SCALE SALIENCY OF 3D COLORED MESHES
    Nouri, Anass
    Charrier, Christophe
    Lezoray, Olivier
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2820 - 2824
  • [50] Multi-scale CNNs for 3D model retrieval
    Weizhi Nie
    Shu Xiang
    Anan Liu
    Multimedia Tools and Applications, 2018, 77 : 22953 - 22963