PiSFANet: Pillar Scale-Aware Feature Aggregation Network for Real-Time 3D Pedestrian Detection

被引:0
|
作者
Yan, Weiqing [1 ]
Liu, Shile [1 ]
Tang, Chang [2 ]
Zhou, Wujie [3 ]
机构
[1] Yantai Univ, Sch Comp & Control Engn, Yantai 261400, Peoples R China
[2] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
[3] Zhejiang Univ Sci & Technol, Sch Informat & Elect Engn, Zhejiang 310023, Peoples R China
关键词
Feature extraction; Pedestrians; Three-dimensional displays; Point cloud compression; Encoding; Real-time systems; Object detection; 3D object detection; real-time; scale-aware; pillar-based;
D O I
10.1109/LSP.2024.3426294
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Detecting 3D pedestrian from point cloud data in real-time while accounting for scale is crucial in various robotic and autonomous driving applications. Currently, the most successful methods for 3D object detection rely on voxel-based techniques, but these tend to be computationally inefficient for deployment in aerial scenarios. Conversely, the pillar-based approach exclusively employs 2D convolution, requiring fewer computational resources, albeit potentially sacrificing detection accuracy compared to voxel-based methods. Previous pillar-based approaches suffered from inadequate pillar feature encoding. In this letter, we introduce a real-time and scale-aware 3D Pedestrian Detection, which incorporates a robust encoder network designed for effective pillar feature extraction. The Proposed TriFocus Attention module (TriFA), which integrates external attention and similar attention strategies based on Squeeze and Exception. By comprehensively supervising the point-wise, channel-wise, and pillar-wise of pillar features, it enhances the encoding ability of pillars, suppresses noise in pillar features, and enhances the expression ability of pillar features. The proposed Bidirectional Scale-Aware Feature Pyramid module (BiSAFP) integrates a scale-aware module into the multi-scale pyramid structure. This addition enhances its ability to perceive pedestrian within low-level features. Moreover, it ensures that the significance of feature maps across various feature levels is fully taken into account. BiSAFP represents a lightweight multi-scale pyramid network that minimally impacts inference time while substantially boosting network performance. Our approach achieves real-time detection, processing up to 30 frames per second (FPS).
引用
收藏
页码:2000 / 2004
页数:5
相关论文
共 50 条
  • [41] HOG Feature Extractor Hardware Accelerator for Real-time Pedestrian Detection
    Hemmati, Maryam
    Biglari-Abhari, Morteza
    Berber, Stevan
    Niar, Smail
    2014 17TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2014, : 543 - 550
  • [42] A Real-Time Pedestrian Detection Method Based on Improved Gated Context Aggregation Network in Foggy Weather
    Wu T.
    Wang Y.
    Guan Y.
    Tian S.
    Qiche Gongcheng/Automotive Engineering, 2023, 45 (05): : 796 - 806
  • [43] Real-time 3D
    Coco, D
    COMPUTER GRAPHICS WORLD, 1995, 18 (12) : 22 - +
  • [44] High-accuracy, real-time pedestrian detection system using 2D and 3D features
    Chambers, David R.
    Flannigan, Clay
    Wheeler, Benjamin
    THREE-DIMENSIONAL IMAGING, VISUALIZATION, AND DISPLAY 2012, 2012, 8384
  • [45] Monomm: a multi-scale mamba-enhanced network for real-time monocular 3D object detection
    Fu, Youjia
    Xu, Zihao
    Fu, Junsong
    Xue, Huixia
    Tan, Shuqiu
    Li, Lei
    Qing, Shaoxun
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (03):
  • [46] Spatial-aware stacked regression network for real-time 3D hand pose estimation
    Ren, Pengfei
    Sun, Haifeng
    Huang, Weiting
    Hao, Jiachang
    Cheng, Daixuan
    Qi, Qi
    Wang, Jingyu
    Liao, Jianxin
    NEUROCOMPUTING, 2021, 437 : 42 - 57
  • [47] Real-time Detection and Tracking Network with Feature Sharing
    Guo, Ente
    Chen, Zhifeng
    Fan, Zhenjia
    Yang, Xiujun
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 519 - 522
  • [48] Fully automated 3D boundary detection in real-time 3D echocardiography
    Takuma, S
    Angelini, ED
    Yoshiara, K
    Liu, R
    Kazanowski, M
    Dimayuga, C
    Makita, K
    Di Tullio, MR
    Holmes, JW
    Laine, AF
    Homma, S
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2000, 35 (02) : 469A - 470A
  • [49] Scale-Aware Attention-Based PillarsNet (SAPN) Based 3D Object Detection for Point Cloud
    Song, Xiang
    Zhan, Weiqin
    Che, Xiaoyu
    Jiang, Huilin
    Yang, Biao
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [50] A Two-Stage Pillar Feature-Encoding Network for Pillar-Based 3D Object Detection
    Xu, Hao
    Dong, Xiang
    Wu, Wenxuan
    Yu, Biao
    Zhu, Hui
    WORLD ELECTRIC VEHICLE JOURNAL, 2023, 14 (06):