HybridPillars: Hybrid Point-Pillar Network for Real-Time Two-Stage 3-D Object Detection

被引:0
|
作者
Huang, Zhicong [1 ]
Huang, Yuxiao [1 ]
Zheng, Zhijie [1 ]
Hu, Haifeng [1 ]
Chen, Dihu [2 ]
机构
[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R China
[2] Sun Yat Sen Univ, Sch Integrated Circuits, Shenzhen 518000, Peoples R China
关键词
Three-dimensional displays; Feature extraction; Proposals; Point cloud compression; Object detection; Convolution; Accuracy; Representation learning; Real-time systems; Pipelines; 3-D object detection; LiDAR point clouds; real time; two-stage;
D O I
10.1109/JSEN.2024.3468646
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
LiDAR-based 3-D object detection is an important perceptual task in various fields such as intelligent transportation, autonomous driving, and robotics. Existing two-stage point-voxel methods contribute to the boost of accuracy on 3-D object detection by utilizing precise pointwise features to refine 3-D proposals. Although obtaining promising results, these methods are not suitable for real-time applications. First, the inference speed of existing point-voxel hybrid frameworks is slow because the acquisition of point features from voxel features consumes a lot of time. Second, existing point-voxel methods rely on 3-D convolution for voxel feature learning, which increases the difficulty of deployment on embedded computing platforms. To address these issues, we propose a real-time two-stage detection network, named HybridPillars. We first propose a novel hybrid framework by integrating a point feature encoder into a point-pillar pipeline efficiently. By combining point-based and pillar-based networks, our method can discard 3-D convolution to reduce computational complexity. Furthermore, we propose a novel pillar feature aggregation network to efficiently extract bird's eye view (BEV) features from pointwise features, thereby significantly enhancing the performance of our network. Extensive experiments demonstrate that our proposed HybridPillars not only boosts the inference speed, but also achieves competitive detection performance compared to other methods. The code will be available at https://github.com/huangzhicong3/HybridPillars.
引用
收藏
页码:38318 / 38328
页数:11
相关论文
共 50 条
  • [41] Real-time Hierarchical Soft Attention-based 3D Object Detection in Point Clouds
    Chen, Qiuxiao
    Qi, Xiaojun
    Song, Ziqi
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2928 - 2934
  • [42] Point Transformer-Based Salient Object Detection Network for 3-D Measurement Point Clouds
    Wei, Zeyong
    Chen, Baian
    Wang, Weiming
    Chen, Honghua
    Wei, Mingqiang
    Li, Jonathan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 11
  • [43] MonoAMNet: Three-Stage Real-Time Monocular 3D Object Detection With Adaptive Methods
    Pan, Huihui
    Jia, Yisong
    Wang, Jue
    Sun, Weichao
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (03) : 3574 - 3587
  • [44] TemporalNet: Real-time 2D-3D Video Object Detection
    Chen, Meihong
    Lang, Jochen
    2022 19TH CONFERENCE ON ROBOTS AND VISION (CRV 2022), 2022, : 205 - 212
  • [45] D3D: Dual 3-D Convolutional Network for Real-Time Action Recognition
    Jiang, Shengqin
    Qi, Yuankai
    Zhang, Haokui
    Bai, Zongwen
    Lu, Xiaobo
    Wang, Peng
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (07) : 4584 - 4593
  • [46] Test Verification of Two-Stage Adaptive Delay Compensation Method for Real-Time Hybrid Simulation
    Wang, Zhen
    Yan, Xueqi
    Ning, Xizhan
    Wu, Bin
    SHOCK AND VIBRATION, 2020, 2020
  • [47] OccTr: A Two-Stage BEV Fusion Network for Temporal Object Detection
    Fu, Qifang
    Yu, Xinyi
    Ou, Linlin
    ELECTRONICS, 2024, 13 (13)
  • [48] Optimizing resource speed for two-stage real-time tasks
    Melani, Alessandra
    Mancuso, Renato
    Cullina, Daniel
    Caccamo, Marco
    Thiele, Lothar
    REAL-TIME SYSTEMS, 2017, 53 (01) : 82 - 120
  • [49] Two-Stage Underwater Object Detection Network Using Swin Transformer
    Liu, Jia
    Liu, Shuang
    Xu, Shujuan
    Zhou, Changjun
    IEEE ACCESS, 2022, 10 : 117235 - 117247
  • [50] A Two-Stage Framework for Real-Time Guidewire Endpoint Localization
    Li, Rui-Qi
    Bian, Guibin
    Zhou, Xiaohu
    Xie, Xiaoliang
    Ni, ZhenLiang
    Hou, Zengguang
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT V, 2019, 11768 : 357 - 365