Temporal-Enhanced Radar and Camera Fusion for Object Detection

被引：0

作者：

Kong, Linhua ^{[1
]}

Wang, Yiming ^{[2
]}

Chang, Dongxia ^{[1
]}

Zhao, Yao ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Inst Informat Sci, Visual Intellgence X Int Cooperat Joint Lab MOE, Beijing, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Nanjing, Peoples R China

来源：

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS | 2025年 / 21卷 / 01期

关键词：

Automatic Driving; Cross Attention; Ensemble;

D O I：

10.1145/3700442

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recently, object detection methods based on multi-modal fusion have gained widespread adoption in autonomous driving, proving to be valuable for detecting objects in dynamic environments. Among them, millimeter wave (mmWave) radar is commonly utilized as an effective complement to cameras, as it is almost unaffected by harsh weather conditions. However, current approaches that fuse mmWave radar and camera often overlook the correlation between the two modalities, failing to fully exploit their complementary features. To address this, we propose a temporal-enhanced radar and camera fusion network to explore the correlation between these two modalities and learn a comprehensive representation for object detection. In our model, a temporal fusion model is introduced to fuse mmWave radar features from different moments, thus mitigating the problem of mmWave radar point-object mismatch due to object movement. Moreover, a new correlation-based fusion strategy using the dedicated mask cross-attention is proposed to fuse mmWave radar and vision features more effectively. Finally, we design a gate feature pyramid network that selects shallow texture information based on deep semantic information to obtain more representative features. The experimental results on the nuScenes benchmark demonstrate the effectiveness of our proposed method.

引用

页数：16

共 50 条

[1] RVDet:Feature-level Fusion of Radar and Camera for Object Detection
Zhang, Jingwei
Zhang, Ming
Fang, Zicheng
Wang, Yulong
Zhao, Xian
Pu, Shiliang
2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2822 - 2828
[2] RCF-TP: Radar-Camera Fusion With Temporal Priors for 3D Object Detection
Miron, Yakov
Drews, Florian
Faion, Florian
Di Castro, Dotan
Klein, Itzik
IEEE ACCESS, 2024, 12 : 127212 - 127223
[3] Interactive guidance network for object detection based on radar-camera fusion
Jiapeng Wang
Linhua Kong
Dongxia Chang
Zisen Kong
Yao Zhao
Multimedia Tools and Applications, 2024, 83 : 28057 - 28075
[4] Millimeter-Wave Radar and Camera Fusion for Multiscenario Object Detection on USVs
He, Xin
Wu, Defeng
Wu, Dongjie
You, Zheng
Zhong, Shangkun
Liu, Qijun
IEEE SENSORS JOURNAL, 2024, 24 (19) : 31562 - 31572
[5] Interactive guidance network for object detection based on radar-camera fusion
Wang, Jiapeng
Kong, Linhua
Chang, Dongxia
Kong, Zisen
Zhao, Yao
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 28057 - 28075
[6] Camera-Radar Fusion with Radar Channel Extension and Dual-CBAM-FPN for Object Detection
Sun, Xiyan
Jiang, Yaoyu
Qin, Hongmei
Li, Jingjing
Ji, Yuanfa
SENSORS, 2024, 24 (16)
[7] Camera–Radar Fusion with Modality Interaction and Radar Gaussian Expansion for 3D Object Detection
Liu X.
Li Z.
Zhou Y.
Peng Y.
Luo J.
Cyborg and Bionic Systems, 2024, 5
[8] Fusion Point Pruning for Optimized 2D Object Detection with Radar-Camera Fusion
Staecker, Lukas
Heidenreich, Philipp
Rambach, Jason
Stricker, Didier
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1275 - 1282
[9] A Temporal-Enhanced Model for Knowledge Tracing
Cui, Shaoguo
Wang, Mingyang
Xu, Song
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT IX, 2024, 15024 : 407 - 421
[10] Radar-camera fusion for 3D object detection with aggregation transformer
Li, Jun
Zhang, Han
Wu, Zizhang
Xu, Tianhao
APPLIED INTELLIGENCE, 2024, 54 (21) : 10627 - 10639

← 1 2 3 4 5 →