HybridPillars: Hybrid Point-Pillar Network for Real-Time Two-Stage 3-D Object Detection

被引:0
|
作者
Huang, Zhicong [1 ]
Huang, Yuxiao [1 ]
Zheng, Zhijie [1 ]
Hu, Haifeng [1 ]
Chen, Dihu [2 ]
机构
[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R China
[2] Sun Yat Sen Univ, Sch Integrated Circuits, Shenzhen 518000, Peoples R China
关键词
Three-dimensional displays; Feature extraction; Proposals; Point cloud compression; Object detection; Convolution; Accuracy; Representation learning; Real-time systems; Pipelines; 3-D object detection; LiDAR point clouds; real time; two-stage;
D O I
10.1109/JSEN.2024.3468646
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
LiDAR-based 3-D object detection is an important perceptual task in various fields such as intelligent transportation, autonomous driving, and robotics. Existing two-stage point-voxel methods contribute to the boost of accuracy on 3-D object detection by utilizing precise pointwise features to refine 3-D proposals. Although obtaining promising results, these methods are not suitable for real-time applications. First, the inference speed of existing point-voxel hybrid frameworks is slow because the acquisition of point features from voxel features consumes a lot of time. Second, existing point-voxel methods rely on 3-D convolution for voxel feature learning, which increases the difficulty of deployment on embedded computing platforms. To address these issues, we propose a real-time two-stage detection network, named HybridPillars. We first propose a novel hybrid framework by integrating a point feature encoder into a point-pillar pipeline efficiently. By combining point-based and pillar-based networks, our method can discard 3-D convolution to reduce computational complexity. Furthermore, we propose a novel pillar feature aggregation network to efficiently extract bird's eye view (BEV) features from pointwise features, thereby significantly enhancing the performance of our network. Extensive experiments demonstrate that our proposed HybridPillars not only boosts the inference speed, but also achieves competitive detection performance compared to other methods. The code will be available at https://github.com/huangzhicong3/HybridPillars.
引用
收藏
页码:38318 / 38328
页数:11
相关论文
共 50 条
  • [21] Real-Time Moving Object Detection for 3-D LiDAR Using Occlusion Accumulation in Range Image
    Kim, Junha
    Kim, Haram
    Oh, Changsuk
    Kim, Changhyeon
    Kim, H. Jin
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [22] Real-Time 3D Object Detection From Point Cloud Through Foreground Segmentation
    Wang, Bo
    Zhu, Ming
    Lu, Ying
    Wang, Jiarong
    Gao, Wen
    Wei, Hua
    IEEE ACCESS, 2021, 9 : 84886 - 84898
  • [23] A 3D Convolutional Neural Network Towards Real-time Amodal 3D Object Detection
    Sun, Hao
    Meng, Zehui
    Du, Xinxin
    Ang, Marcelo H., Jr.
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 8331 - 8338
  • [24] Real-Time Multimodal 3D Object Detection with Transformers
    Liu, Hengsong
    Duan, Tongle
    WORLD ELECTRIC VEHICLE JOURNAL, 2024, 15 (07):
  • [25] RANSAC Back to SOTA: A Two-Stage Consensus Filtering for Real-Time 3D Registration
    Shi, Pengcheng
    Yan, Shaocheng
    Xiao, Yilin
    Liu, Xinyi
    Zhang, Yongjun
    Li, Jiayuan
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (12): : 11881 - 11888
  • [26] Real-Time 3D Object Detection on Crowded Pedestrians
    Lu, Bin
    Li, Qing
    Liang, Yanju
    SENSORS, 2023, 23 (21)
  • [27] Real-time 3D Object Detection in Unstructured Environments
    Rui, Wang
    Ying, Liang
    PROCEEDINGS FIRST INTERNATIONAL CONFERENCE ON ELECTRONICS INSTRUMENTATION & INFORMATION SYSTEMS (EIIS 2017), 2017, : 183 - 188
  • [28] A two-stage fast stereo matching algorithm for real-time 3D coordinate computation
    Liu, Huizhou
    Shen, Bowen
    Zhang, Jiwang
    Huang, Zhong
    Huang, Mengxing
    MEASUREMENT, 2025, 247
  • [29] A Hybrid Two-Stage 3D Object Recognition from Orthogonal Projections
    Quang Tri Chiem
    Lech, Margaret
    Wilkinson, Richardt H.
    2019 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2019,
  • [30] GridNet-3D: A Novel Real-Time 3D Object Detection Algorithm Based on Point Cloud
    YUE Yuanchen
    CAI Yunfei
    WANG Dongsheng
    ChineseJournalofElectronics, 2021, 30 (05) : 931 - 939