HybridPillars: Hybrid Point-Pillar Network for Real-Time Two-Stage 3-D Object Detection

被引:0
|
作者
Huang, Zhicong [1 ]
Huang, Yuxiao [1 ]
Zheng, Zhijie [1 ]
Hu, Haifeng [1 ]
Chen, Dihu [2 ]
机构
[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R China
[2] Sun Yat Sen Univ, Sch Integrated Circuits, Shenzhen 518000, Peoples R China
关键词
Three-dimensional displays; Feature extraction; Proposals; Point cloud compression; Object detection; Convolution; Accuracy; Representation learning; Real-time systems; Pipelines; 3-D object detection; LiDAR point clouds; real time; two-stage;
D O I
10.1109/JSEN.2024.3468646
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
LiDAR-based 3-D object detection is an important perceptual task in various fields such as intelligent transportation, autonomous driving, and robotics. Existing two-stage point-voxel methods contribute to the boost of accuracy on 3-D object detection by utilizing precise pointwise features to refine 3-D proposals. Although obtaining promising results, these methods are not suitable for real-time applications. First, the inference speed of existing point-voxel hybrid frameworks is slow because the acquisition of point features from voxel features consumes a lot of time. Second, existing point-voxel methods rely on 3-D convolution for voxel feature learning, which increases the difficulty of deployment on embedded computing platforms. To address these issues, we propose a real-time two-stage detection network, named HybridPillars. We first propose a novel hybrid framework by integrating a point feature encoder into a point-pillar pipeline efficiently. By combining point-based and pillar-based networks, our method can discard 3-D convolution to reduce computational complexity. Furthermore, we propose a novel pillar feature aggregation network to efficiently extract bird's eye view (BEV) features from pointwise features, thereby significantly enhancing the performance of our network. Extensive experiments demonstrate that our proposed HybridPillars not only boosts the inference speed, but also achieves competitive detection performance compared to other methods. The code will be available at https://github.com/huangzhicong3/HybridPillars.
引用
收藏
页码:38318 / 38328
页数:11
相关论文
共 50 条
  • [1] HPV-RCNN: Hybrid Point-Voxel Two-Stage Network for LiDAR Based 3-D Object Detection
    Feng, Chen
    Xiang, Chao
    Xie, Xiaopo
    Zhang, Yuan
    Yang, Mingchuan
    Li, Xuesong
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (06) : 3066 - 3076
  • [2] A Two-Stage Pillar Feature-Encoding Network for Pillar-Based 3D Object Detection
    Xu, Hao
    Dong, Xiang
    Wu, Wenxuan
    Yu, Biao
    Zhu, Hui
    WORLD ELECTRIC VEHICLE JOURNAL, 2023, 14 (06):
  • [3] Toward High-Accuracy and Real-Time Two-Stage Small Object Detection on FPGA
    Li, Shiyao
    Zhu, Zhenhua
    Sun, Hanbo
    Ning, Xuefei
    Dai, Guohao
    Hu, Yiming
    Yang, Huazhong
    Wang, Yu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 8053 - 8066
  • [4] Position-Aware Voxel Aggregate Network for Two-Stage 3-D Object Detector
    Xu, Wencai
    Hu, Jie
    An, Yongpeng
    Chen, Ruinan
    Chang, Minjie
    Xie, Lihao
    IEEE SENSORS JOURNAL, 2023, 23 (16) : 18867 - 18878
  • [5] Two-stage real-time hybrid testing method for isolated structures
    Tang Z.
    Liu H.
    Li Y.
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2023, 55 (09): : 27 - 33
  • [6] PIXOR: Real-time 3D Object Detection from Point Clouds
    Yang, Bin
    Luo, Wenjie
    Urtasun, Raquel
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7652 - 7660
  • [7] A Real-time Driver Fatigue Detection Method Based on Two-Stage Convolutional Neural Network
    He, Hu
    Zhang, Xiaoyong
    Jiang, Fu
    Wang, Chenglong
    Yang, Yingze
    Liu, Weirong
    Peng, Jun
    IFAC PAPERSONLINE, 2020, 53 (02): : 15374 - 15379
  • [8] Equal Emphasis on Data and Network: A Two-Stage 3D Point Cloud Object Detection Algorithm with Feature Alignment
    Xiao, Kai
    Li, Teng
    Li, Jun
    Huang, Da
    Peng, Yuanxi
    REMOTE SENSING, 2024, 16 (02)
  • [9] PillarNet: Real-Time and High-Performance Pillar-Based 3D Object Detection
    Shi, Guangsheng
    Li, Ruifeng
    Ma, Chao
    COMPUTER VISION, ECCV 2022, PT X, 2022, 13670 : 35 - 52
  • [10] Accurate and Real-Time 3D Pedestrian Detection Using an Efficient Attentive Pillar Network
    Le, Duy Tho
    Shi, Hengcan
    Rezatofighi, Hamid
    Cai, Jianfei
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (02) : 1159 - 1166