F-PVNet: Frustum-Level 3-D Object Detection on Point-Voxel Feature Representation for Autonomous Driving

被引：6

作者：

Tao, Chongben ^{[1
]}

Fu, Shiping ^{[1
]}

Wang, Chen ^{[1
]}

Luo, Xizhao ^{[2
]}

Li, Huayi ^{[1
]}

Gao, Zhen ^{[3
]}

Zhang, Zufeng ^{[4
]}

Zheng, Sifa ^{[4
]}

机构：

[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215009, Peoples R China

[2] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China

[3] McMaster Univ, Fac Engn, Hamilton, ON L8S 0A3, Canada

[4] Tsinghua Univ, Sch Vehicle & Mobil, Beijing 100084, Peoples R China

来源：

IEEE INTERNET OF THINGS JOURNAL | 2023年 / 10卷 / 09期

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

Three-dimensional displays; Feature extraction; Point cloud compression; Object detection; Heuristic algorithms; Estimation; Proposals; 3-D object detection; autonomous driving; fully convolutional network (FCN); point voxel fusion; sliding frustum;

D O I：

10.1109/JIOT.2022.3231369

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Current 3-D object detection technology for autonomous driving usually cannot efficiently utilize local sensitive points. Meanwhile, contextual feature extracted from a object is not sufficient, which easily leads to deteriorated detection accuracy of the final object estimation. For the problems, a point-voxel-based 3-D dynamic object detection algorithm is proposed. First, local points are grouped with a camera frustum. Then, the global feature extracted by the submanifold 3-D voxel CNNs is aggregated into frustum key points. Second, a module of vector pool with feature aggregation is used to aggregate multiscale features of the point cloud. Moreover, the frustum raw feature and BEV feature are used for feature extension. Subsequently, the fine multiscale feature extracted from the point cloud is used as input to a subsequent fully convolutional network for final classification and continuous estimation of oriented 3-D boxes. The proposed method was compared with other state-of-the-art algorithms on the KITTI, Waymo, and nuScenes data sets. Experimental results showed that the proposed algorithm was better in accuracy, robustness, and generalization capabilities in 3-D dynamic object detection. Experiments on a real scenario and extensive ablation studies also demonstrated that the proposed algorithm not only effectively controls computational cost but also achieved more efficient results in 3-D object detection.

引用

页码：8031 / 8045

页数：15

共 50 条

[1] Accelerating Point-Voxel Representation of 3-D Object Detection for Automatic Driving
Cao J.
Tao C.
Zhang Z.
Gao Z.
Luo X.
Zheng S.
Zhu Y.
IEEE Transactions on Artificial Intelligence, 2024, 5 (01): : 254 - 266
[2] PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
Shaoshuai Shi
Li Jiang
Jiajun Deng
Zhe Wang
Chaoxu Guo
Jianping Shi
Xiaogang Wang
Hongsheng Li
International Journal of Computer Vision, 2023, 131 : 531 - 551
[3] Improved Point-Voxel Region Convolutional Neural Network: 3D Object Detectors for Autonomous Driving
Li, Yujie
Yang, Shuo
Zheng, Yuchao
Lu, Huimin
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 9311 - 9317
[4] PV-RCNN++: semantical point-voxel feature interaction for 3D object detection
Peng Wu
Lipeng Gu
Xuefeng Yan
Haoran Xie
Fu Lee Wang
Gary Cheng
Mingqiang Wei
The Visual Computer, 2023, 39 (6) : 2425 - 2440
[5] PV-RCNN plus plus : Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
Shi, Shaoshuai
Jiang, Li
Deng, Jiajun
Wang, Zhe
Guo, Chaoxu
Shi, Jianping
Wang, Xiaogang
Li, Hongsheng
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (02) : 531 - 551
[6] HCPVF: Hierarchical Cascaded Point-Voxel Fusion for 3D Object Detection
Fan, Baojie
Zhang, Kexin
Tian, Jiandong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 8997 - 9009
[7] PV-RCNN plus plus : semantical point-voxel feature interaction for 3D object detection
Wu, Peng
Gu, Lipeng
Yan, Xuefeng
Xie, Haoran
Wang, Fu Lee
Cheng, Gary
Wei, Mingqiang
VISUAL COMPUTER, 2023, 39 (06): : 2425 - 2440
[8] ASPVNet: Attention Based Sparse Point-Voxel Network for 3D Object Detection
Yu, Bingxin
Wang, Lu
He, Yuhong
Wang, Xiaoyang
Cheng, Jun
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 161 - 176
[9] A symbolic representation for 3-D object feature detection
Neal, PJ
Shapiro, LG
15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 221 - 224
[10] Point-Voxel and Bird-Eye-View Representation Aggregation Network for Single Stage 3D Object Detection
Ning, Kanglin
Liu, Yanfei
Su, Yanzhao
Jiang, Ke
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (03) : 3223 - 3235

← 1 2 3 4 5 →