Towards efficient multi-modal 3D object detection: Homogeneous sparse fuse network

被引：1

作者：

Tang, Yingjuan ^{[1
]}

He, Hongwen ^{[1
]}

Wang, Yong ^{[1
]}

Wu, Jingda ^{[2
]}

机构：

[1] Beijing Inst Technol, Sch Mech Engn, Beijing 100081, Peoples R China

[2] Nanyang Technol Univ, Sch Mech & Aerosp Engn, 50 Nanyang Ave, Singapore 639798, Singapore

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2024年 / 256卷

关键词：

Autonomous driving; 3D object detection; Multi-modal; Sparse convolutional networks; Point cloud and image fusion; Homogeneous fusion;

D O I：

10.1016/j.eswa.2024.124945

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

LiDAR-only 3D detection methods struggle with the sparsity of point clouds. To overcome this issue, multi- modal methods have been proposed, but their fusion is a challenge due to the heterogeneous representation of images and point clouds. This paper proposes a novel multi-modal framework, Homogeneous Sparse Fusion (HS-Fusion), which generates pseudo point clouds from depth completion. The proposed framework introduces a 3D foreground-aware middle extractor that efficiently extracts high-responding foreground features from sparse point cloud data. This module can be integrated into existing sparse convolutional neural networks. Furthermore, the proposed homogeneous attentive fusion enables cross-modality consistency fusion. Finally, the proposed HS-Fusion can simultaneously combine 2D image features and 3D geometric features of pseudo point clouds using multi-representation feature extraction. The proposed network has been found to attain better performance on the 3D object detection benchmarks. In particular, the proposed model demonstrates a 4.02% improvement in accuracy compared to the pure model. Moreover, its inference speed surpasses that of other models, thus further validating the efficacy of HS-Fusion.

引用

页数：12

共 50 条

[31] MMDistill: Multi-Modal BEV Distillation Framework for Multi-View 3D Object Detection
Jiao, Tianzhe
Chen, Yuming
Zhang, Zhe
Guo, Chaopeng
Song, Jie
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (03): : 4307 - 4325
[32] Multi-modal information fusion for LiDAR-based 3D object detection framework
Ruixin Ma
Yong Yin
Jing Chen
Rihao Chang
Multimedia Tools and Applications, 2024, 83 : 7995 - 8012
[33] Dual-domain deformable feature fusion for multi-modal 3D object detection
Wang, Shihao
Deng, Tao
JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)
[34] CAT-Det: Contrastively Augmented Transformer for Multi-modal 3D Object Detection
Zhang, Yanan
Chen, Jiaxin
Huang, Di
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 898 - 907
[35] DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection
Li, Yingwei
Yu, Adams Wei
Meng, Tianjian
Caine, Ben
Ngiam, Jiquan
Peng, Daiyi
Shen, Junyang
Lu, Yifeng
Zhou, Denny
Le, Quoc, V
Yuille, Alan
Tan, Mingxing
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17161 - 17170
[36] Multi-modal information fusion for LiDAR-based 3D object detection framework
Ma, Ruixin
Yin, Yong
Chen, Jing
Chang, Rihao
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 7995 - 8012
[37] Unlocking the power of multi-modal fusion in 3D object tracking
Hu, Yue
IET COMPUTER VISION, 2025, 19 (01)
[38] AutoAlign: Pixel-Instance Feature Aggregation for Multi-Modal 3D Object Detection
Chen, Zehui
Li, Zhenyu
Zhang, Shiquan
Fang, Liangji
Jiang, Qinhong
Zhao, Feng
Zhou, Bolei
Zhao, Hang
PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, 2022, : 827 - 833
[39] MULTI-DIMENSIONAL PRUNED SPARSE CONVOLUTION FOR EFFICIENT 3D OBJECT DETECTION
Li, Linye
Yue, Xiaodong
Xu, Zhikang
Xie, Shaorong
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3190 - 3194
[40] A Multi-Modal Fusion-Based 3D Multi-Object Tracking Framework With Joint Detection
Wang, Xiyang
Fu, Chunyun
He, Jiawei
Huang, Mingguang
Meng, Ting
Zhang, Siyu
Zhou, Hangning
Xu, Ziyao
Zhang, Chi
IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (01): : 532 - 539

← 1 2 3 4 5 →