F-PVNet: Frustum-Level 3-D Object Detection on Point-Voxel Feature Representation for Autonomous Driving

被引：6

作者：

Tao, Chongben ^{[1
]}

Fu, Shiping ^{[1
]}

Wang, Chen ^{[1
]}

Luo, Xizhao ^{[2
]}

Li, Huayi ^{[1
]}

Gao, Zhen ^{[3
]}

Zhang, Zufeng ^{[4
]}

Zheng, Sifa ^{[4
]}

机构：

[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215009, Peoples R China

[2] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China

[3] McMaster Univ, Fac Engn, Hamilton, ON L8S 0A3, Canada

[4] Tsinghua Univ, Sch Vehicle & Mobil, Beijing 100084, Peoples R China

来源：

IEEE INTERNET OF THINGS JOURNAL | 2023年 / 10卷 / 09期

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

Three-dimensional displays; Feature extraction; Point cloud compression; Object detection; Heuristic algorithms; Estimation; Proposals; 3-D object detection; autonomous driving; fully convolutional network (FCN); point voxel fusion; sliding frustum;

D O I：

10.1109/JIOT.2022.3231369

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Current 3-D object detection technology for autonomous driving usually cannot efficiently utilize local sensitive points. Meanwhile, contextual feature extracted from a object is not sufficient, which easily leads to deteriorated detection accuracy of the final object estimation. For the problems, a point-voxel-based 3-D dynamic object detection algorithm is proposed. First, local points are grouped with a camera frustum. Then, the global feature extracted by the submanifold 3-D voxel CNNs is aggregated into frustum key points. Second, a module of vector pool with feature aggregation is used to aggregate multiscale features of the point cloud. Moreover, the frustum raw feature and BEV feature are used for feature extension. Subsequently, the fine multiscale feature extracted from the point cloud is used as input to a subsequent fully convolutional network for final classification and continuous estimation of oriented 3-D boxes. The proposed method was compared with other state-of-the-art algorithms on the KITTI, Waymo, and nuScenes data sets. Experimental results showed that the proposed algorithm was better in accuracy, robustness, and generalization capabilities in 3-D dynamic object detection. Experiments on a real scenario and extensive ablation studies also demonstrated that the proposed algorithm not only effectively controls computational cost but also achieved more efficient results in 3-D object detection.

引用

页码：8031 / 8045

页数：15

共 50 条

[31] Robust LiDAR-Camera 3-D Object Detection With Object-Level Feature Fusion
Chen, Yongxiang
Yan, Fuwu
Yin, Zhishuai
Nie, Linzhen
Tao, Bo
Miao, Mingze
Zheng, Ningyu
Zhang, Pei
Zeng, Junyuan
IEEE SENSORS JOURNAL, 2024, 24 (18) : 29108 - 29120
[32] TransMRE: Multiple Observation Planes Representation Encoding With Fully Sparse Voxel Transformers for 3-D Object Detection
Zhu, Ziming
Zhu, Yu
Zhang, Kezhi
Li, Hangyu
Ling, Xiaofeng
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
[33] HVPR: Hybrid Voxel-Point Representation for Single-stage 3D Object Detection
Noh, Jongyoun
Lee, Sanghoon
Ham, Bumsub
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14600 - 14609
[34] Adversarial point cloud perturbations against 3D object detection in autonomous driving systems
Wang, Xupeng
Cai, Mumuxin
Sohel, Ferdous
Sang, Nan
Chang, Zhengwei
NEUROCOMPUTING, 2021, 466 : 27 - 36
[35] Evaluation of Point Cloud Data Augmentation for 3D-LiDAR Object Detection in Autonomous Driving
Martins, Marta
Gomes, Iago P.
Wolf, Denis Fernando
Premebida, Cristiano
ROBOT 2023: SIXTH IBERIAN ROBOTICS CONFERENCE ADVANCES IN ROBOTICS, VOL 1, 2024, 976 : 82 - 92
[36] Channelwise and Spatially Guided Multimodal Feature Fusion Network for 3-D Object Detection in Autonomous Vehicles
Uzair, Muhammad
Dong, Jian
Shi, Ronghua
Mushtaq, Husnain
Ullah, Irshad
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[37] MSL3D: 3D object detection from monocular, stereo and point cloud for autonomous driving
Chen, Wenyu
Li, Peixuan
Zhao, Huaici
NEUROCOMPUTING, 2022, 494 : 23 - 32
[38] Efficient flexible voxel-based two-stage network for 3D object detection in autonomous driving
Sun, Fanyue
Tong, Guoxiang
Song, Yan
APPLIED SOFT COMPUTING, 2024, 162
[39] Voxel-FPN: Multi-Scale Voxel Feature Aggregation for 3D Object Detection from LIDAR Point Clouds
Kuang, Hongwu
Wang, Bei
An, Jianping
Zhang, Ming
Zhang, Zehan
SENSORS, 2020, 20 (03)
[40] RT3D: Real-Time 3-D Vehicle Detection in LiDAR Point Cloud for Autonomous Driving
Zeng, Yiming
Hu, Yu
Liu, Shice
Ye, Jing
Han, Yinhe
Li, Xiaowei
Sun, Ninghui
IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04): : 3434 - 3440

← 1 2 3 4 5 →