PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection

被引:0
|
作者
Shaoshuai Shi
Li Jiang
Jiajun Deng
Zhe Wang
Chaoxu Guo
Jianping Shi
Xiaogang Wang
Hongsheng Li
机构
[1] The Chinese University of Hong Kong,
[2] Max Planck Institute for Informatics,undefined
[3] The University of Sydney,undefined
[4] SenseTime Research,undefined
来源
关键词
3D object Detection; Point clouds; LiDAR; Autonomous driving; Sparse convolution;
D O I
暂无
中图分类号
学科分类号
摘要
3D object detection is receiving increasing attention from both industry and academia thanks to its wide applications in various fields. In this paper, we propose Point-Voxel Region-based Convolution Neural Networks (PV-RCNNs) for 3D object detection on point clouds. First, we propose a novel 3D detector, PV-RCNN, which boosts the 3D detection performance by deeply integrating the feature learning of both point-based set abstraction and voxel-based sparse convolution through two novel steps, i.e., the voxel-to-keypoint scene encoding and the keypoint-to-grid RoI feature abstraction. Second, we propose an advanced framework, PV-RCNN++, for more efficient and accurate 3D object detection. It consists of two major improvements: sectorized proposal-centric sampling for efficiently producing more representative keypoints, and VectorPool aggregation for better aggregating local point features with much less resource consumption. With these two strategies, our PV-RCNN++ is about 3×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3\times $$\end{document} faster than PV-RCNN, while also achieving better performance. The experiments demonstrate that our proposed PV-RCNN++ framework achieves state-of-the-art 3D detection performance on the large-scale and highly-competitive Waymo Open Dataset with 10 FPS inference speed on the detection range of 150m×150m\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$150m \times 150m$$\end{document}.
引用
收藏
页码:531 / 551
页数:20
相关论文
共 50 条
  • [1] PV-RCNN plus plus : Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
    Shi, Shaoshuai
    Jiang, Li
    Deng, Jiajun
    Wang, Zhe
    Guo, Chaoxu
    Shi, Jianping
    Wang, Xiaogang
    Li, Hongsheng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (02) : 531 - 551
  • [2] PV-RCNN++: semantical point-voxel feature interaction for 3D object detection
    Peng Wu
    Lipeng Gu
    Xuefeng Yan
    Haoran Xie
    Fu Lee Wang
    Gary Cheng
    Mingqiang Wei
    The Visual Computer, 2023, 39 (6) : 2425 - 2440
  • [3] PV-RCNN plus plus : semantical point-voxel feature interaction for 3D object detection
    Wu, Peng
    Gu, Lipeng
    Yan, Xuefeng
    Xie, Haoran
    Wang, Fu Lee
    Cheng, Gary
    Wei, Mingqiang
    VISUAL COMPUTER, 2023, 39 (06): : 2425 - 2440
  • [4] SASAN: Shape-Adaptive Set Abstraction Network for Point-Voxel 3D Object Detection
    Zhang, Hui
    Luo, Guiyang
    Wang, Xiao
    Li, Yidong
    Ding, Weiping
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
  • [5] Accelerating Point-Voxel Representation of 3-D Object Detection for Automatic Driving
    Cao J.
    Tao C.
    Zhang Z.
    Gao Z.
    Luo X.
    Zheng S.
    Zhu Y.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (01): : 254 - 266
  • [6] PP-RCNN: Point-Pillars Feature Set Abstraction for 3D Real-time Object Detection
    Tu, Jiayin
    Wang, Ping
    Liu, Fuqiang
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [7] HCPVF: Hierarchical Cascaded Point-Voxel Fusion for 3D Object Detection
    Fan, Baojie
    Zhang, Kexin
    Tian, Jiandong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 8997 - 9009
  • [8] Point-Voxel Fusion for Multimodal 3D Detection
    Wang, Ke
    Zhang, Zhichuang
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 1716 - 1719
  • [9] ASPVNet: Attention Based Sparse Point-Voxel Network for 3D Object Detection
    Yu, Bingxin
    Wang, Lu
    He, Yuhong
    Wang, Xiaoyang
    Cheng, Jun
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 161 - 176
  • [10] Point-Voxel and Bird-Eye-View Representation Aggregation Network for Single Stage 3D Object Detection
    Ning, Kanglin
    Liu, Yanfei
    Su, Yanzhao
    Jiang, Ke
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (03) : 3223 - 3235