HybridPillars: Hybrid Point-Pillar Network for Real-Time Two-Stage 3-D Object Detection

被引：0

作者：

Huang, Zhicong ^{[1
]}

Huang, Yuxiao ^{[1
]}

Zheng, Zhijie ^{[1
]}

Hu, Haifeng ^{[1
]}

Chen, Dihu ^{[2
]}

机构：

[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R China

[2] Sun Yat Sen Univ, Sch Integrated Circuits, Shenzhen 518000, Peoples R China

来源：

IEEE SENSORS JOURNAL | 2024年 / 24卷 / 22期

关键词：

Three-dimensional displays; Feature extraction; Proposals; Point cloud compression; Object detection; Convolution; Accuracy; Representation learning; Real-time systems; Pipelines; 3-D object detection; LiDAR point clouds; real time; two-stage;

D O I：

10.1109/JSEN.2024.3468646

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

LiDAR-based 3-D object detection is an important perceptual task in various fields such as intelligent transportation, autonomous driving, and robotics. Existing two-stage point-voxel methods contribute to the boost of accuracy on 3-D object detection by utilizing precise pointwise features to refine 3-D proposals. Although obtaining promising results, these methods are not suitable for real-time applications. First, the inference speed of existing point-voxel hybrid frameworks is slow because the acquisition of point features from voxel features consumes a lot of time. Second, existing point-voxel methods rely on 3-D convolution for voxel feature learning, which increases the difficulty of deployment on embedded computing platforms. To address these issues, we propose a real-time two-stage detection network, named HybridPillars. We first propose a novel hybrid framework by integrating a point feature encoder into a point-pillar pipeline efficiently. By combining point-based and pillar-based networks, our method can discard 3-D convolution to reduce computational complexity. Furthermore, we propose a novel pillar feature aggregation network to efficiently extract bird's eye view (BEV) features from pointwise features, thereby significantly enhancing the performance of our network. Extensive experiments demonstrate that our proposed HybridPillars not only boosts the inference speed, but also achieves competitive detection performance compared to other methods. The code will be available at https://github.com/huangzhicong3/HybridPillars.

引用

页码：38318 / 38328

页数：11

共 50 条

[31] GridNet-3D: A Novel Real-Time 3D Object Detection Algorithm Based on Point Cloud
Yue Yuanchen
Cai Yunfei
Wang Dongsheng
CHINESE JOURNAL OF ELECTRONICS, 2021, 30 (05) : 931 - 939
[32] Real-time 3D Object Detection from Point Clouds using an RGB-D Camera
Wang, Ya
Xu, Shu
Zell, Andreas
ICPRAM: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2020, : 407 - 414
[33] Real-time Pedestrian Detection Based on A Hierarchical Two-Stage Support Vector Machine
Min, Kyoungwon
Son, Haengseon
Choe, Yoonsik
Kim, Yong-Goo
PROCEEDINGS OF THE 2013 IEEE 8TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2013, : 114 - 119
[34] Modeling and Implementing Two-Stage AdaBoost for Real-Time Vehicle License Plate Detection
Song, Moon Kyou
Sarker, Md. Mostafa Kamal
JOURNAL OF APPLIED MATHEMATICS, 2014,
[35] Real-time algorithm of 3-D shadow generation for point light sources
Liu, Lieming
Wu, Enhua
Ruan Jian Xue Bao/Journal of Software, 2000, 11 (06): : 785 - 790
[36] Two-stage 3D object detection guided by position encoding q
Xu, Wanpeng
Zou, Ling
Fu, Zhipeng
Wu, Lingda
Qi, Yue
NEUROCOMPUTING, 2022, 501 : 811 - 821
[37] TWO-B-REAL NET: TWO-BRANCH NETWORK FOR REAL-TIME SALIENT OBJECT DETECTION
Li, Bo
Sun, Zhengxing
Tang, Lv
Hu, Anqi
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1662 - 1666
[38] PiSFANet: Pillar Scale-Aware Feature Aggregation Network for Real-Time 3D Pedestrian Detection
Yan, Weiqing
Liu, Shile
Tang, Chang
Zhou, Wujie
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2000 - 2004
[39] Real-Time Object Detection Based on Improved YOLOv3 Network
Sun Jia
Guo Dabo
Yang Tiantian
Ma Shitu
LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (22)
[40] Complexer-YOLO: Real-Time 3D Object Detection and Tracking on Semantic Point Clouds
Simon, Martin
Amende, Karl
Kraus, Andrea
Honer, Jens
Saemann, Timo
Kaulbersch, Hauke
Milz, Stefan
Gross, Horst Michael
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1190 - 1199

← 1 2 3 4 5 →