HybridPillars: Hybrid Point-Pillar Network for Real-Time Two-Stage 3-D Object Detection

被引:0
|
作者
Huang, Zhicong [1 ]
Huang, Yuxiao [1 ]
Zheng, Zhijie [1 ]
Hu, Haifeng [1 ]
Chen, Dihu [2 ]
机构
[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R China
[2] Sun Yat Sen Univ, Sch Integrated Circuits, Shenzhen 518000, Peoples R China
关键词
Three-dimensional displays; Feature extraction; Proposals; Point cloud compression; Object detection; Convolution; Accuracy; Representation learning; Real-time systems; Pipelines; 3-D object detection; LiDAR point clouds; real time; two-stage;
D O I
10.1109/JSEN.2024.3468646
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
LiDAR-based 3-D object detection is an important perceptual task in various fields such as intelligent transportation, autonomous driving, and robotics. Existing two-stage point-voxel methods contribute to the boost of accuracy on 3-D object detection by utilizing precise pointwise features to refine 3-D proposals. Although obtaining promising results, these methods are not suitable for real-time applications. First, the inference speed of existing point-voxel hybrid frameworks is slow because the acquisition of point features from voxel features consumes a lot of time. Second, existing point-voxel methods rely on 3-D convolution for voxel feature learning, which increases the difficulty of deployment on embedded computing platforms. To address these issues, we propose a real-time two-stage detection network, named HybridPillars. We first propose a novel hybrid framework by integrating a point feature encoder into a point-pillar pipeline efficiently. By combining point-based and pillar-based networks, our method can discard 3-D convolution to reduce computational complexity. Furthermore, we propose a novel pillar feature aggregation network to efficiently extract bird's eye view (BEV) features from pointwise features, thereby significantly enhancing the performance of our network. Extensive experiments demonstrate that our proposed HybridPillars not only boosts the inference speed, but also achieves competitive detection performance compared to other methods. The code will be available at https://github.com/huangzhicong3/HybridPillars.
引用
收藏
页码:38318 / 38328
页数:11
相关论文
共 50 条
  • [31] GridNet-3D: A Novel Real-Time 3D Object Detection Algorithm Based on Point Cloud
    Yue Yuanchen
    Cai Yunfei
    Wang Dongsheng
    CHINESE JOURNAL OF ELECTRONICS, 2021, 30 (05) : 931 - 939
  • [32] Real-time 3D Object Detection from Point Clouds using an RGB-D Camera
    Wang, Ya
    Xu, Shu
    Zell, Andreas
    ICPRAM: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2020, : 407 - 414
  • [33] Real-time Pedestrian Detection Based on A Hierarchical Two-Stage Support Vector Machine
    Min, Kyoungwon
    Son, Haengseon
    Choe, Yoonsik
    Kim, Yong-Goo
    PROCEEDINGS OF THE 2013 IEEE 8TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2013, : 114 - 119
  • [34] Modeling and Implementing Two-Stage AdaBoost for Real-Time Vehicle License Plate Detection
    Song, Moon Kyou
    Sarker, Md. Mostafa Kamal
    JOURNAL OF APPLIED MATHEMATICS, 2014,
  • [35] Real-time algorithm of 3-D shadow generation for point light sources
    Liu, Lieming
    Wu, Enhua
    Ruan Jian Xue Bao/Journal of Software, 2000, 11 (06): : 785 - 790
  • [36] Two-stage 3D object detection guided by position encoding q
    Xu, Wanpeng
    Zou, Ling
    Fu, Zhipeng
    Wu, Lingda
    Qi, Yue
    NEUROCOMPUTING, 2022, 501 : 811 - 821
  • [37] TWO-B-REAL NET: TWO-BRANCH NETWORK FOR REAL-TIME SALIENT OBJECT DETECTION
    Li, Bo
    Sun, Zhengxing
    Tang, Lv
    Hu, Anqi
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1662 - 1666
  • [38] PiSFANet: Pillar Scale-Aware Feature Aggregation Network for Real-Time 3D Pedestrian Detection
    Yan, Weiqing
    Liu, Shile
    Tang, Chang
    Zhou, Wujie
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2000 - 2004
  • [39] Real-Time Object Detection Based on Improved YOLOv3 Network
    Sun Jia
    Guo Dabo
    Yang Tiantian
    Ma Shitu
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (22)
  • [40] Complexer-YOLO: Real-Time 3D Object Detection and Tracking on Semantic Point Clouds
    Simon, Martin
    Amende, Karl
    Kraus, Andrea
    Honer, Jens
    Saemann, Timo
    Kaulbersch, Hauke
    Milz, Stefan
    Gross, Horst Michael
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1190 - 1199