F-PVNet: Frustum-Level 3-D Object Detection on Point-Voxel Feature Representation for Autonomous Driving

被引:6
|
作者
Tao, Chongben [1 ]
Fu, Shiping [1 ]
Wang, Chen [1 ]
Luo, Xizhao [2 ]
Li, Huayi [1 ]
Gao, Zhen [3 ]
Zhang, Zufeng [4 ]
Zheng, Sifa [4 ]
机构
[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215009, Peoples R China
[2] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
[3] McMaster Univ, Fac Engn, Hamilton, ON L8S 0A3, Canada
[4] Tsinghua Univ, Sch Vehicle & Mobil, Beijing 100084, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Three-dimensional displays; Feature extraction; Point cloud compression; Object detection; Heuristic algorithms; Estimation; Proposals; 3-D object detection; autonomous driving; fully convolutional network (FCN); point voxel fusion; sliding frustum;
D O I
10.1109/JIOT.2022.3231369
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Current 3-D object detection technology for autonomous driving usually cannot efficiently utilize local sensitive points. Meanwhile, contextual feature extracted from a object is not sufficient, which easily leads to deteriorated detection accuracy of the final object estimation. For the problems, a point-voxel-based 3-D dynamic object detection algorithm is proposed. First, local points are grouped with a camera frustum. Then, the global feature extracted by the submanifold 3-D voxel CNNs is aggregated into frustum key points. Second, a module of vector pool with feature aggregation is used to aggregate multiscale features of the point cloud. Moreover, the frustum raw feature and BEV feature are used for feature extension. Subsequently, the fine multiscale feature extracted from the point cloud is used as input to a subsequent fully convolutional network for final classification and continuous estimation of oriented 3-D boxes. The proposed method was compared with other state-of-the-art algorithms on the KITTI, Waymo, and nuScenes data sets. Experimental results showed that the proposed algorithm was better in accuracy, robustness, and generalization capabilities in 3-D dynamic object detection. Experiments on a real scenario and extensive ablation studies also demonstrated that the proposed algorithm not only effectively controls computational cost but also achieved more efficient results in 3-D object detection.
引用
收藏
页码:8031 / 8045
页数:15
相关论文
共 50 条
  • [1] Accelerating Point-Voxel Representation of 3-D Object Detection for Automatic Driving
    Cao J.
    Tao C.
    Zhang Z.
    Gao Z.
    Luo X.
    Zheng S.
    Zhu Y.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (01): : 254 - 266
  • [2] PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
    Shaoshuai Shi
    Li Jiang
    Jiajun Deng
    Zhe Wang
    Chaoxu Guo
    Jianping Shi
    Xiaogang Wang
    Hongsheng Li
    International Journal of Computer Vision, 2023, 131 : 531 - 551
  • [3] Improved Point-Voxel Region Convolutional Neural Network: 3D Object Detectors for Autonomous Driving
    Li, Yujie
    Yang, Shuo
    Zheng, Yuchao
    Lu, Huimin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 9311 - 9317
  • [4] PV-RCNN++: semantical point-voxel feature interaction for 3D object detection
    Peng Wu
    Lipeng Gu
    Xuefeng Yan
    Haoran Xie
    Fu Lee Wang
    Gary Cheng
    Mingqiang Wei
    The Visual Computer, 2023, 39 (6) : 2425 - 2440
  • [5] PV-RCNN plus plus : Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
    Shi, Shaoshuai
    Jiang, Li
    Deng, Jiajun
    Wang, Zhe
    Guo, Chaoxu
    Shi, Jianping
    Wang, Xiaogang
    Li, Hongsheng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (02) : 531 - 551
  • [6] HCPVF: Hierarchical Cascaded Point-Voxel Fusion for 3D Object Detection
    Fan, Baojie
    Zhang, Kexin
    Tian, Jiandong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 8997 - 9009
  • [7] PV-RCNN plus plus : semantical point-voxel feature interaction for 3D object detection
    Wu, Peng
    Gu, Lipeng
    Yan, Xuefeng
    Xie, Haoran
    Wang, Fu Lee
    Cheng, Gary
    Wei, Mingqiang
    VISUAL COMPUTER, 2023, 39 (06): : 2425 - 2440
  • [8] ASPVNet: Attention Based Sparse Point-Voxel Network for 3D Object Detection
    Yu, Bingxin
    Wang, Lu
    He, Yuhong
    Wang, Xiaoyang
    Cheng, Jun
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 161 - 176
  • [9] A symbolic representation for 3-D object feature detection
    Neal, PJ
    Shapiro, LG
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 221 - 224
  • [10] Point-Voxel and Bird-Eye-View Representation Aggregation Network for Single Stage 3D Object Detection
    Ning, Kanglin
    Liu, Yanfei
    Su, Yanzhao
    Jiang, Ke
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (03) : 3223 - 3235