DMFF: dual-way multimodal feature fusion for 3D object detection

被引：0

作者：

Dong, Xiaopeng ^{[1
]}

Di, Xiaoguang ^{[1
]}

Wang, Wenzhuang ^{[1
]}

机构：

[1] Harbin Inst Technol, Control & Simulat Ctr, Harbin, Peoples R China

来源：

SIGNAL IMAGE AND VIDEO PROCESSING | 2024年 / 18卷 / 01期

基金：

黑龙江省自然科学基金;

关键词：

3D object detection; Multimodal feature fusion; Self-attention mechanism; Lidar point clouds; RGB images;

D O I：

10.1007/s11760-023-02772-z

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Recently, multimodal 3D object detection that fuses the complementary information from LiDAR data and RGB images has been an active research topic. However, it is not trivial to fuse images and point clouds because of different representations of them. Inadequate feature fusion also brings bad effects on detection performance. We convert images into pseudo point clouds by using a depth completion and utilize a more efficient feature fusion method to address the problems. In this paper, we propose a dual-way multimodal feature fusion network (DMFF) for 3D object detection. Specifically, we first use a dual stream feature extraction module (DSFE) to generate homogeneous LiDAR and pseudo region of interest (RoI) features. Then, we propose a dual-way feature interaction method (DWFI) that enables intermodal and intramodal interaction of the two features. Next, we design a local attention feature fusion module (LAFF) to select which features of the input are more likely to contribute to the desired output. In addition, the proposed DMFF achieves the state-of-the-art performances on the KITTI Dataset.

引用

页码：455 / 463

页数：9

共 50 条

[31] MFFNet: Multimodal feature fusion network for RGB-D transparent object detection
Zhu, Li
Li, Tuanjie
Ning, Yuming
Zhang, Yan
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2024, 21 (05):
[32] DMFF: Deep multimodel feature fusion for building occupancy detection
Sun, Kailai
BUILDING AND ENVIRONMENT, 2024, 253
[33] LiDAR-camera fusion: Dual transformer enhancement for 3D object detection
Chen, Mu
Liu, Pengfei
Zhao, Huaici
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
[34] Channelwise and Spatially Guided Multimodal Feature Fusion Network for 3-D Object Detection in Autonomous Vehicles
Uzair, Muhammad
Dong, Jian
Shi, Ronghua
Mushtaq, Husnain
Ullah, Irshad
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[35] A multilevel fusion network for 3D object detection
Xia, Chunlong
Wei, Ping
Wei, Wenwen
Zheng, Nanning
NEUROCOMPUTING, 2021, 437 : 107 - 117
[36] Dense Voxel Fusion for 3D Object Detection
Mahmoud, Anas
Hu, Jordan S. K.
Waslander, Steven L.
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 663 - 672
[37] PointPainting: Sequential Fusion for 3D Object Detection
Vora, Sourabh
Lang, Alex H.
Helou, Bassam
Beijbom, Oscar
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4603 - 4611
[38] Dense projection fusion for 3D object detection
Chen, Zhao
Hu, Bin-Jie
Luo, Chengxi
Chen, Guohao
Zhu, Haohui
SCIENTIFIC REPORTS, 2024, 14 (01):
[39] A multilevel fusion network for 3D object detection
Xia, Chunlong
Wei, Ping
Wei, Wenwen
Zheng, Nanning
Neurocomputing, 2021, 437 : 107 - 117
[40] Sparse Dense Fusion for 3D Object Detection
Gao, Yulu
Sima, Chonghao
Shi, Shaoshuai
Di, Shangzhe
Liu, Si
Li, Hongyang
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 10939 - 10946

← 1 2 3 4 5 →