DMFF: dual-way multimodal feature fusion for 3D object detection

被引:0
|
作者
Dong, Xiaopeng [1 ]
Di, Xiaoguang [1 ]
Wang, Wenzhuang [1 ]
机构
[1] Harbin Inst Technol, Control & Simulat Ctr, Harbin, Peoples R China
基金
黑龙江省自然科学基金;
关键词
3D object detection; Multimodal feature fusion; Self-attention mechanism; Lidar point clouds; RGB images;
D O I
10.1007/s11760-023-02772-z
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, multimodal 3D object detection that fuses the complementary information from LiDAR data and RGB images has been an active research topic. However, it is not trivial to fuse images and point clouds because of different representations of them. Inadequate feature fusion also brings bad effects on detection performance. We convert images into pseudo point clouds by using a depth completion and utilize a more efficient feature fusion method to address the problems. In this paper, we propose a dual-way multimodal feature fusion network (DMFF) for 3D object detection. Specifically, we first use a dual stream feature extraction module (DSFE) to generate homogeneous LiDAR and pseudo region of interest (RoI) features. Then, we propose a dual-way feature interaction method (DWFI) that enables intermodal and intramodal interaction of the two features. Next, we design a local attention feature fusion module (LAFF) to select which features of the input are more likely to contribute to the desired output. In addition, the proposed DMFF achieves the state-of-the-art performances on the KITTI Dataset.
引用
收藏
页码:455 / 463
页数:9
相关论文
共 50 条
  • [31] MFFNet: Multimodal feature fusion network for RGB-D transparent object detection
    Zhu, Li
    Li, Tuanjie
    Ning, Yuming
    Zhang, Yan
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2024, 21 (05):
  • [32] DMFF: Deep multimodel feature fusion for building occupancy detection
    Sun, Kailai
    BUILDING AND ENVIRONMENT, 2024, 253
  • [33] LiDAR-camera fusion: Dual transformer enhancement for 3D object detection
    Chen, Mu
    Liu, Pengfei
    Zhao, Huaici
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
  • [34] Channelwise and Spatially Guided Multimodal Feature Fusion Network for 3-D Object Detection in Autonomous Vehicles
    Uzair, Muhammad
    Dong, Jian
    Shi, Ronghua
    Mushtaq, Husnain
    Ullah, Irshad
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [35] A multilevel fusion network for 3D object detection
    Xia, Chunlong
    Wei, Ping
    Wei, Wenwen
    Zheng, Nanning
    NEUROCOMPUTING, 2021, 437 : 107 - 117
  • [36] Dense Voxel Fusion for 3D Object Detection
    Mahmoud, Anas
    Hu, Jordan S. K.
    Waslander, Steven L.
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 663 - 672
  • [37] PointPainting: Sequential Fusion for 3D Object Detection
    Vora, Sourabh
    Lang, Alex H.
    Helou, Bassam
    Beijbom, Oscar
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4603 - 4611
  • [38] Dense projection fusion for 3D object detection
    Chen, Zhao
    Hu, Bin-Jie
    Luo, Chengxi
    Chen, Guohao
    Zhu, Haohui
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [39] A multilevel fusion network for 3D object detection
    Xia, Chunlong
    Wei, Ping
    Wei, Wenwen
    Zheng, Nanning
    Neurocomputing, 2021, 437 : 107 - 117
  • [40] Sparse Dense Fusion for 3D Object Detection
    Gao, Yulu
    Sima, Chonghao
    Shi, Shaoshuai
    Di, Shangzhe
    Liu, Si
    Li, Hongyang
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 10939 - 10946