PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection

被引：0

作者：

Shaoshuai Shi

Li Jiang

Jiajun Deng

Zhe Wang

Chaoxu Guo

Jianping Shi

Xiaogang Wang

Hongsheng Li

机构：

[1] The Chinese University of Hong Kong,

[2] Max Planck Institute for Informatics,undefined

[3] The University of Sydney,undefined

[4] SenseTime Research,undefined

来源：

International Journal of Computer Vision | 2023年 / 131卷

关键词：

3D object Detection; Point clouds; LiDAR; Autonomous driving; Sparse convolution;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

3D object detection is receiving increasing attention from both industry and academia thanks to its wide applications in various fields. In this paper, we propose Point-Voxel Region-based Convolution Neural Networks (PV-RCNNs) for 3D object detection on point clouds. First, we propose a novel 3D detector, PV-RCNN, which boosts the 3D detection performance by deeply integrating the feature learning of both point-based set abstraction and voxel-based sparse convolution through two novel steps, i.e., the voxel-to-keypoint scene encoding and the keypoint-to-grid RoI feature abstraction. Second, we propose an advanced framework, PV-RCNN++, for more efficient and accurate 3D object detection. It consists of two major improvements: sectorized proposal-centric sampling for efficiently producing more representative keypoints, and VectorPool aggregation for better aggregating local point features with much less resource consumption. With these two strategies, our PV-RCNN++ is about 3×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3\times $$\end{document} faster than PV-RCNN, while also achieving better performance. The experiments demonstrate that our proposed PV-RCNN++ framework achieves state-of-the-art 3D detection performance on the large-scale and highly-competitive Waymo Open Dataset with 10 FPS inference speed on the detection range of 150m×150m\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$150m \times 150m$$\end{document}.

引用

页码：531 / 551

页数：20

共 50 条

[1] PV-RCNN plus plus : Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
Shi, Shaoshuai
Jiang, Li
Deng, Jiajun
Wang, Zhe
Guo, Chaoxu
Shi, Jianping
Wang, Xiaogang
Li, Hongsheng
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (02) : 531 - 551
[2] PV-RCNN++: semantical point-voxel feature interaction for 3D object detection
Peng Wu
Lipeng Gu
Xuefeng Yan
Haoran Xie
Fu Lee Wang
Gary Cheng
Mingqiang Wei
The Visual Computer, 2023, 39 (6) : 2425 - 2440
[3] PV-RCNN plus plus : semantical point-voxel feature interaction for 3D object detection
Wu, Peng
Gu, Lipeng
Yan, Xuefeng
Xie, Haoran
Wang, Fu Lee
Cheng, Gary
Wei, Mingqiang
VISUAL COMPUTER, 2023, 39 (06): : 2425 - 2440
[4] SASAN: Shape-Adaptive Set Abstraction Network for Point-Voxel 3D Object Detection
Zhang, Hui
Luo, Guiyang
Wang, Xiao
Li, Yidong
Ding, Weiping
Wang, Fei-Yue
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
[5] Accelerating Point-Voxel Representation of 3-D Object Detection for Automatic Driving
Cao J.
Tao C.
Zhang Z.
Gao Z.
Luo X.
Zheng S.
Zhu Y.
IEEE Transactions on Artificial Intelligence, 2024, 5 (01): : 254 - 266
[6] PP-RCNN: Point-Pillars Feature Set Abstraction for 3D Real-time Object Detection
Tu, Jiayin
Wang, Ping
Liu, Fuqiang
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[7] HCPVF: Hierarchical Cascaded Point-Voxel Fusion for 3D Object Detection
Fan, Baojie
Zhang, Kexin
Tian, Jiandong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 8997 - 9009
[8] Point-Voxel Fusion for Multimodal 3D Detection
Wang, Ke
Zhang, Zhichuang
2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 1716 - 1719
[9] ASPVNet: Attention Based Sparse Point-Voxel Network for 3D Object Detection
Yu, Bingxin
Wang, Lu
He, Yuhong
Wang, Xiaoyang
Cheng, Jun
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 161 - 176
[10] Point-Voxel and Bird-Eye-View Representation Aggregation Network for Single Stage 3D Object Detection
Ning, Kanglin
Liu, Yanfei
Su, Yanzhao
Jiang, Ke
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (03) : 3223 - 3235

← 1 2 3 4 5 →