Camera-Radar Fusion with Modality Interaction and Radar Gaussian Expansion for 3D Detection

被引:0
|
作者
Liu, Xiang [1 ]
Li, Zhenglin [1 ,2 ]
Zhou, Yang [1 ]
Peng, Yan [1 ,2 ]
Luo, Jun [1 ,3 ]
Liu, Xiang [1 ]
机构
[1] Shanghai Univ, Inst Artificial Intelligence, Shanghai, Peoples R China
[2] Shanghai Univ, Sch Future Technol, Shanghai, Peoples R China
[3] Chongqing Univ, State Key Lab Mech Transmiss, Chongqing, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
The fusion of millimeter-wave radar and camera modalities is crucial for improving the accuracy and completeness of 3-dimensional (3D) object detection. Most existing methods extract features from each modality separately and conduct fusion with specifically designed modules, potentially resulting in information loss during modality transformation. To address this issue, we propose a novel framework for 3D object detection that iteratively updates radar and camera features through an interaction module. This module serves a dual purpose by facilitating the fusion of multi-modal data while preserving the original features. Specifically, radar and image features are sampled and aggregated with a set of sparse 3D object queries, while retaining the integrity of the original radar features to prevent information loss. Additionally, an innovative radar augmentation technique named Radar Gaussian Expansion is proposed. This module allocates radar measurements within each voxel to neighboring ones as a Gaussian distribution, reducing association errors during projection and enhancing detection accuracy. Our proposed framework offers a comprehensive solution to the fusion of radar and camera data, ultimately leading to heightened accuracy and completeness in 3D object detection processes. On the nuScenes test benchmark, our camera-radar fusion method achieves state-of-the-art 3D object detection results with a 41.6% mean average precision and 52.5% nuScenes detection score.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] NeXtFusion: Attention-Based Camera-Radar Fusion Network for Improved Three-Dimensional Object Detection and Tracking
    Kalgaonkar, Priyank
    El-Sharkawy, Mohamed
    FUTURE INTERNET, 2024, 16 (04)
  • [42] Deep Camera-Radar Fusion with an Attention Framework for Autonomous Vehicle Vision in Foggy Weather Conditions
    Ogunrinde, Isaac
    Bernadin, Shonda
    SENSORS, 2023, 23 (14)
  • [43] Bridging the View Disparity Between Radar and Camera Features for Multi-Modal Fusion 3D Object Detection
    Zhou, Taohua
    Chen, Junjie
    Shi, Yining
    Jiang, Kun
    Yang, Mengmeng
    Yang, Diange
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (02): : 1523 - 1535
  • [44] LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera
    Xiong, Weiyi
    Zou, Zean
    Zhao, Qiuchi
    He, Fengchun
    Zhu, Bing
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2862 - 2869
  • [45] 3D Radar Image Fusion using OFDM-based MIMO Radar
    Nuss, Benjamin
    Sit, Yoke Leen
    Zwick, Thomas
    2016 GERMAN MICROWAVE CONFERENCE (GEMIC), 2016, : 209 - 212
  • [46] Radar and Camera Fusion for Vacant Parking Space Detection
    Wu, Bo-Xun
    Lin, Jia-Jheng
    Kuo, Hsien-Kai
    Chen, Po-Yu
    Guo, Jiun-In
    2022 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2022): INTELLIGENT TECHNOLOGY IN THE POST-PANDEMIC ERA, 2022, : 242 - 245
  • [47] CenterTransFuser: radar point cloud and visual information fusion for 3D object detection
    Yan Li
    Kai Zeng
    Tao Shen
    EURASIP Journal on Advances in Signal Processing, 2023
  • [48] CenterTransFuser: radar point cloud and visual information fusion for 3D object detection
    Li, Yan
    Zeng, Kai
    Shen, Tao
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2023, 2023 (01)
  • [49] LiRaFusion: Deep Adaptive LiDAR-Radar Fusion for 3D Object Detection
    Song, Jingyu
    Zhao, Lingjun
    Skinner, Katherine A.
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 18250 - 18257
  • [50] CFTrack: Center-based Radar and Camera Fusion for 3D Multi-Object Tracking
    Nabati, Ramin
    Harris, Landon
    Qi, Hairong
    2021 IEEE INTELLIGENT VEHICLES SYMPOSIUM WORKSHOPS (IV WORKSHOPS), 2021, : 243 - 248