SASAN: Shape-Adaptive Set Abstraction Network for Point-Voxel 3D Object Detection

被引:6
|
作者
Zhang, Hui [1 ,2 ]
Luo, Guiyang [3 ]
Wang, Xiao [4 ]
Li, Yidong [1 ,2 ]
Ding, Weiping [5 ]
Wang, Fei-Yue [6 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Key Lab Big Data & Artificial Intelligence Transp, Minist Educ, Beijing 100044, Peoples R China
[3] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
[4] Anhui Univ, Engn Res Ctr Autonomous Unmanned Syst Technol, Minist Educ, Hefei 230031, Peoples R China
[5] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[6] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Three-dimensional displays; Detectors; Object detection; Proposals; Point cloud compression; Shape; 3D object detection; autonomous driving; point-voxel detectors; CNN;
D O I
10.1109/TNNLS.2023.3339889
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Point-voxel 3D object detectors have achieved impressive performance in complex traffic scenes. However, they utilize the 3D sparse convolution (spconv) layers with fixed receptive fields, such as voxel-based detectors, and inherit the fixed sphere radius from point-based methods for generating the features of keypoints, which make them weak in adaptively modeling various geometrical deformations and sizes of real objects. To tackle this issue, we propose a shape-adaptive set abstraction network (SASAN) for point-voxel 3D object detection. First, the proposal and offset generation module is adopted to learn the coordinates and confidences of 3D proposals and shape-adaptive offsets of the certain number of offset points for each voxel. Meanwhile, an extra offset supervision task is employed to guide the learning of shifting values of offset points, aiming at motivating the predicted offsets to preferably adapt to the various shapes of objects. Then, the shape-adaptive set abstraction module is proposed to extract multiscale keypoints features by grouping the neighboring offset points' features, as well as features learned from adjacent raw points and the 2-D bird-view map. Finally, the region of interest (RoI)-grid proposal refinement module is used to aggregate the keypoints features for further proposal refinement and confidence prediction. Extensive experiments on the competitive KITTI 3D detection benchmark demonstrate that the proposed SASAN gains superior performance as compared with state-of-the-art methods.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [31] PVTransformer: Point-to-Voxel Transformer for Scalable 3D Object Detection
    Leng, Zhaoqi
    Sun, Pei
    He, Tong
    Anguelov, Dragomir
    Tan, Mingxing
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 4238 - 4244
  • [32] From Voxel to Point: IoU-guided 3D Object Detection for Point Cloud with Voxel-to-Point Decoder
    Li, Jiale
    Dai, Hang
    Shao, Ling
    Ding, Yong
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4622 - 4631
  • [33] SAF-Nets: Shape-Adaptive Filter Networks for 3D point cloud processing*
    Lee, Seon-Ho
    Kim, Chang-Su
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 79
  • [34] HVNet: Hybrid Voxel Network for LiDAR Based 3D Object Detection
    Ye, Maosheng
    Xu, Shuangjie
    Cao, Tongyi
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1628 - 1637
  • [35] DO-SA&R: Distant Object Augmented Set Abstraction and Regression for Point-Based 3D Object Detection
    He, Xuan
    Wang, Zian
    Lin, Jiacheng
    Nai, Ke
    Yuan, Jin
    Li, Zhiyong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5852 - 5864
  • [36] Voxel-to-Pillar: Knowledge Distillation of 3D Object Detection in Point Cloud
    Zhang, Jinbao
    Liu, Jun
    PROCEEDINGS OF THE 4TH EUROPEAN SYMPOSIUM ON SOFTWARE ENGINEERING, ESSE 2023, 2024, : 99 - 104
  • [37] Dense Voxel Fusion for 3D Object Detection
    Mahmoud, Anas
    Hu, Jordan S. K.
    Waslander, Steven L.
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 663 - 672
  • [38] Voxel Field Fusion for 3D Object Detection
    Li, Yanwei
    Qi, Xiaojuan
    Chen, Yukang
    Wang, Liwei
    Li, Zeming
    Sun, Jian
    Jia, Jiaya
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1110 - 1119
  • [39] Radar Voxel Fusion for 3D Object Detection
    Nobis, Felix
    Shafiei, Ehsan
    Karle, Phillip
    Betz, Johannes
    Lienkamp, Markus
    APPLIED SCIENCES-BASEL, 2021, 11 (12):
  • [40] F-PVNet: Frustum-Level 3-D Object Detection on Point-Voxel Feature Representation for Autonomous Driving
    Tao, Chongben
    Fu, Shiping
    Wang, Chen
    Luo, Xizhao
    Li, Huayi
    Gao, Zhen
    Zhang, Zufeng
    Zheng, Sifa
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (09) : 8031 - 8045