High-order multilayer attention fusion network for 3D object detection

被引:0
|
作者
Zhang, Baowen [1 ]
Zhao, Yongyong [1 ]
Su, Chengzhi [1 ]
Cao, Guohua [1 ]
机构
[1] Changchun Univ Sci & Technol, Sch Mech & Elect Engn, Changchun, Peoples R China
关键词
attention feature fusion; high-order feature; 3D object detection; point cloud;
D O I
10.1002/eng2.12987
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Three-dimensional object detection based on the fusion of 2D image data and 3D point clouds has become a research hotspot in the field of 3D scene understanding. However, different sensor data have discrepancies in spatial position, scale, and alignment, which severely impact detection performance. Inappropriate fusion methods can lead to the loss and interference of valuable information. Therefore, we propose the High-Order Multi-Level Attention Fusion Network (HMAF-Net), which takes camera images and voxelized point clouds as inputs for 3D object detection. To enhance the expressive power between different modality features, we introduce a high-order feature fusion module that performs multi-level convolution operations on the element-wise summed features. By incorporating filtering and non-linear activation, we extract deep semantic information from the fused multi-modal features. To maximize the effectiveness of the fused salient feature information, we introduce an attention mechanism that dynamically evaluates the importance of pooled features at each level, enabling adaptive weighted fusion of significant and secondary features. To validate the effectiveness of HMAF-Net, we conduct experiments on the KITTI dataset. In the "Car," "Pedestrian," and "Cyclist" categories, HMAF-Net achieves mAP performances of 81.78%, 60.09%, and 63.91%, respectively, demonstrating more stable performance compared to other multi-modal methods. Furthermore, we further evaluate the framework's effectiveness and generalization capability through the KITTI benchmark test, and compare its performance with other published detection methods on the 3D detection benchmark and BEV detection benchmark for the "Car" category, showing excellent results. The code and model will be made available on .
引用
收藏
页数:14
相关论文
共 50 条
  • [41] 3D Object Detection with SLS-Fusion Network in Foggy Weather Conditions
    Nguyen Anh Minh Mai
    Duthon, Pierre
    Khoudour, Louahdi
    Crouzil, Alain
    Velastin, Sergio A.
    SENSORS, 2021, 21 (20)
  • [42] MFF-Net: Multimodal Feature Fusion Network for 3D Object Detection
    Shi, Peicheng
    Liu, Zhiqiang
    Qi, Heng
    Yang, Aixi
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (03): : 5615 - 5637
  • [43] AMVFNet: Attentive Multi-View Fusion Network for 3D Object Detection
    Huang, Yuxiao
    Huang, Zhicong
    Zhao, Jingwen
    Hu, Haifeng
    Chen, Dihu
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2025, 21 (01)
  • [44] Deformable Feature Fusion Network for Multi-Modal 3D Object Detection
    Guo, Kun
    Gan, Tong
    Ding, Zhao
    Ling, Qiang
    2024 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, ARTIFICIAL INTELLIGENCE AND INTELLIGENT CONTROL, RAIIC 2024, 2024, : 363 - 367
  • [45] PointGAT: Graph attention networks for 3D object detection
    Zhou H.
    Wang W.
    Liu G.
    Zhou Q.
    Intelligent and Converged Networks, 2022, 3 (02): : 204 - 216
  • [46] ASPVNet: Attention Based Sparse Point-Voxel Network for 3D Object Detection
    Yu, Bingxin
    Wang, Lu
    He, Yuhong
    Wang, Xiaoyang
    Cheng, Jun
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 161 - 176
  • [47] 3D Object Detection Based on Sparse Self-Attention Graph Neural Network
    Peng, Zhichen
    Feng, Ansong
    Wang, Tianzhu
    Shao, Xinzhe
    Ku, Tao
    Computer Engineering and Applications, 61 (03): : 295 - 305
  • [48] PillarDAN: Pillar-based Dual Attention Attention Network for 3D Object Detection with 4D RaDAR
    Li, Jingzhong
    Yang, Lin
    Chen, Yuxuan
    Yang, Yixin
    Jin, Yue
    Akiyama, Kuanta
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 1851 - 1857
  • [49] Towards Raw Sensor Fusion in 3D Object Detection
    Rovid, Andras
    Remeli, Viktor
    2019 IEEE 17TH WORLD SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI 2019), 2019, : 293 - 298
  • [50] IAFPN: interlayer enhancement and multilayer fusion network for object detection
    Li, Zhicheng
    Yang, Chao
    Jiang, Longyu
    MACHINE VISION AND APPLICATIONS, 2024, 35 (04)