PiSFANet: Pillar Scale-Aware Feature Aggregation Network for Real-Time 3D Pedestrian Detection

被引:0
|
作者
Yan, Weiqing [1 ]
Liu, Shile [1 ]
Tang, Chang [2 ]
Zhou, Wujie [3 ]
机构
[1] Yantai Univ, Sch Comp & Control Engn, Yantai 261400, Peoples R China
[2] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
[3] Zhejiang Univ Sci & Technol, Sch Informat & Elect Engn, Zhejiang 310023, Peoples R China
关键词
Feature extraction; Pedestrians; Three-dimensional displays; Point cloud compression; Encoding; Real-time systems; Object detection; 3D object detection; real-time; scale-aware; pillar-based;
D O I
10.1109/LSP.2024.3426294
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Detecting 3D pedestrian from point cloud data in real-time while accounting for scale is crucial in various robotic and autonomous driving applications. Currently, the most successful methods for 3D object detection rely on voxel-based techniques, but these tend to be computationally inefficient for deployment in aerial scenarios. Conversely, the pillar-based approach exclusively employs 2D convolution, requiring fewer computational resources, albeit potentially sacrificing detection accuracy compared to voxel-based methods. Previous pillar-based approaches suffered from inadequate pillar feature encoding. In this letter, we introduce a real-time and scale-aware 3D Pedestrian Detection, which incorporates a robust encoder network designed for effective pillar feature extraction. The Proposed TriFocus Attention module (TriFA), which integrates external attention and similar attention strategies based on Squeeze and Exception. By comprehensively supervising the point-wise, channel-wise, and pillar-wise of pillar features, it enhances the encoding ability of pillars, suppresses noise in pillar features, and enhances the expression ability of pillar features. The proposed Bidirectional Scale-Aware Feature Pyramid module (BiSAFP) integrates a scale-aware module into the multi-scale pyramid structure. This addition enhances its ability to perceive pedestrian within low-level features. Moreover, it ensures that the significance of feature maps across various feature levels is fully taken into account. BiSAFP represents a lightweight multi-scale pyramid network that minimally impacts inference time while substantially boosting network performance. Our approach achieves real-time detection, processing up to 30 frames per second (FPS).
引用
收藏
页码:2000 / 2004
页数:5
相关论文
共 50 条
  • [21] Building Feature Pool Effectively for Real-Time Pedestrian Detection
    Yang, Jijun
    Ma, Yingdong
    Zhang, Zhibin
    3RD ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI2018), 2018, 1069
  • [22] Feature Synthesization for Real-Time Pedestrian Detection in Urban Environment
    Fang, Wenhua
    Chen, Jun
    Lu, Tao
    Hu, Ruimin
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 102 - 112
  • [23] Aggregated Channels Network for Real-Time Pedestrian Detection
    Ghorban, Farzin
    Marin, Javier
    Su, Yu
    Colombo, Alessandro
    Kummert, Anton
    TENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2017), 2018, 10696
  • [24] A 3D Convolutional Neural Network Towards Real-time Amodal 3D Object Detection
    Sun, Hao
    Meng, Zehui
    Du, Xinxin
    Ang, Marcelo H., Jr.
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 8331 - 8338
  • [25] Local multi-scale feature aggregation network for real-time image dehazing
    Liu, Yong
    Hou, Xiaorong
    PATTERN RECOGNITION, 2023, 141
  • [26] VTD-FCENet: A Real-Time HD Video Text Detection with Scale-Aware Fourier Contour Embedding
    Xiao, Wocheng
    Liang, Lingyu
    Chen, Jianyong
    Wang, Tao
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (04) : 574 - 578
  • [27] Multi-Scale Feature Fusion Lightweight Real-Time Infrared Pedestrian Detection at Night
    He Z.
    Chen G.
    Chen J.
    Zhang Y.
    Zhongguo Jiguang/Chinese Journal of Lasers, 2022, 49 (17):
  • [28] Scale-aware Black-and-White Abstraction of 3D Shapes
    Lin, You-En
    Yang, Yong-Liang
    Chu, Hung-Kuo
    ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04):
  • [29] RFDNet: Real-Time 3D Object Detection via Range Feature Decoration
    Chang, Hongda
    Wang, Lu
    Cheng, Jun
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 5715 - 5721
  • [30] Adaptive context- and scale-aware aggregation with feature alignment for one-shot object detection
    Zhang, Wenwen
    Dong, Chengdong
    Zhang, Jun
    Shan, Hangguan
    Liu, Eryun
    NEUROCOMPUTING, 2022, 514 : 216 - 230