High-order multilayer attention fusion network for 3D object detection

被引:0
|
作者
Zhang, Baowen [1 ]
Zhao, Yongyong [1 ]
Su, Chengzhi [1 ]
Cao, Guohua [1 ]
机构
[1] Changchun Univ Sci & Technol, Sch Mech & Elect Engn, Changchun, Peoples R China
关键词
attention feature fusion; high-order feature; 3D object detection; point cloud;
D O I
10.1002/eng2.12987
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Three-dimensional object detection based on the fusion of 2D image data and 3D point clouds has become a research hotspot in the field of 3D scene understanding. However, different sensor data have discrepancies in spatial position, scale, and alignment, which severely impact detection performance. Inappropriate fusion methods can lead to the loss and interference of valuable information. Therefore, we propose the High-Order Multi-Level Attention Fusion Network (HMAF-Net), which takes camera images and voxelized point clouds as inputs for 3D object detection. To enhance the expressive power between different modality features, we introduce a high-order feature fusion module that performs multi-level convolution operations on the element-wise summed features. By incorporating filtering and non-linear activation, we extract deep semantic information from the fused multi-modal features. To maximize the effectiveness of the fused salient feature information, we introduce an attention mechanism that dynamically evaluates the importance of pooled features at each level, enabling adaptive weighted fusion of significant and secondary features. To validate the effectiveness of HMAF-Net, we conduct experiments on the KITTI dataset. In the "Car," "Pedestrian," and "Cyclist" categories, HMAF-Net achieves mAP performances of 81.78%, 60.09%, and 63.91%, respectively, demonstrating more stable performance compared to other multi-modal methods. Furthermore, we further evaluate the framework's effectiveness and generalization capability through the KITTI benchmark test, and compare its performance with other published detection methods on the 3D detection benchmark and BEV detection benchmark for the "Car" category, showing excellent results. The code and model will be made available on .
引用
收藏
页数:14
相关论文
共 50 条
  • [21] 3D Object Detection Based on Attention and Multi-Scale Feature Fusion
    Liu, Minghui
    Ma, Jinming
    Zheng, Qiuping
    Liu, Yuchen
    Shi, Gang
    SENSORS, 2022, 22 (10)
  • [22] 3D Object Detection with Fusion Point Attention Mechanism in LiDAR Point Cloud
    Liu Weili
    Zhu Deli
    Luo Huahao
    Li Yi
    ACTA PHOTONICA SINICA, 2023, 52 (09)
  • [23] High-order graph attention network
    He, Liancheng
    Bai, Liang
    Yang, Xian
    Du, Hangyuan
    Liang, Jiye
    INFORMATION SCIENCES, 2023, 630 : 222 - 234
  • [24] MF-Net: Meta Fusion Network for 3D object detection
    Meng, Zhaoxin
    Luo, Guiyang
    Yuan, Quan
    Li, Jinglin
    Yang, Fangchun
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [25] Multimodal fusion via voting network for 3D object detection in indoors
    Li, Jianxin
    Si, Guannan
    Liang, Xinyu
    An, Zhaoliang
    Tian, Pengxin
    Zhou, Fengyu
    Wang, Xiaoliang
    PATTERN RECOGNITION, 2025, 164
  • [26] Cascaded Cross-Modality Fusion Network for 3D Object Detection
    Chen, Zhiyu
    Lin, Qiong
    Sun, Jing
    Feng, Yujian
    Liu, Shangdong
    Liu, Qiang
    Ji, Yimu
    Xu, He
    SENSORS, 2020, 20 (24) : 1 - 14
  • [27] FusionPillars: A 3D Object Detection Network with Cross-Fusion and Self-Fusion
    Zhang, Jing
    Xu, Da
    Li, Yunsong
    Zhao, Liping
    Su, Rui
    REMOTE SENSING, 2023, 15 (10)
  • [28] 3D-MAN: 3D Multi-frame Attention Network for Object Detection
    Yang, Zetong
    Zhou, Yin
    Chen, Zhifeng
    Ngiam, Jiquan
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1863 - 1872
  • [29] CDAF3D: Cross-Dimensional Attention Fusion for Indoor 3D Object Detection
    Wang, Shilin
    Huang, Hai
    Zhu, Yueyan
    Tang, Zhenqi
    PATTERN RECOGNITION AND COMPUTER VISION, PT XIII, PRCV 2024, 2025, 15043 : 165 - 177
  • [30] 3D MSSD: A multilayer spatial structure 3D object detection network for mobile LiDAR point clouds
    Wang, Zongyue
    Xia, Qiming
    Du, Jing
    Huang, Shangfeng
    Su, Jinhe
    Marcato Junior, Jose
    Li, Jonathan
    Cai, Guorong
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2021, 102