CR-DINO: A Novel Camera-Radar Fusion 2-D Object Detection Model Based on Transformer

被引:3
|
作者
Jin, Yuhao [1 ]
Zhu, Xiaohui [1 ]
Yue, Yong [1 ]
Lim, Eng Gee [1 ]
Wang, Wei [2 ]
机构
[1] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Suzhou 215000, Peoples R China
[2] Hebei Normal Univ, Coll Comp & Cyber Secur, Shijiazhuang 050024, Hebei, Peoples R China
关键词
Autonomous vehicle; deep learning; multisensor fusion; object detection; transformer;
D O I
10.1109/JSEN.2024.3357775
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Due to millimeter-wave (MMW) radar's ability to directly acquire spatial positions and velocity information of objects, as well as its robust performance in adverse weather conditions, it has been widely employed in autonomous driving. However, radar lacks specific semantic information. To address this limitation, we take the complementary strengths of camera and radar by feature-level fusion and propose a fully transformer-based model for object detection in autonomous driving. Specifically, we introduce a novel radar representation method and propose two camera-radar fusion architectures based on Swin transformer. We name our proposed model as camera-radar based DETR with improved denoising anchor boxes (CR-DINO) and conduct training and testing on the nuScenes dataset. We conducted several ablation experiments, and the best result we obtained was an mAP of 38.0%, surpassing other state-of-the-art (SOTA) camera-radar fusion object detection models.
引用
收藏
页码:11080 / 11090
页数:11
相关论文
共 50 条
  • [31] Object detection using a novel 2-D adaptive sampling strategy
    Mulassano, P
    Avagnina, D
    Presti, LL
    MELECON 2000: INFORMATION TECHNOLOGY AND ELECTROTECHNOLOGY FOR THE MEDITERRANEAN COUNTRIES, VOLS 1-3, PROCEEDINGS, 2000, : 627 - 630
  • [32] ConCs-Fusion: A Context Clustering-Based Radar and Camera Fusion for Three-Dimensional Object Detection
    He, Wei
    Deng, Zhenmiao
    Ye, Yishan
    Pan, Pingping
    REMOTE SENSING, 2023, 15 (21)
  • [33] LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera
    Xiong, Weiyi
    Zou, Zean
    Zhao, Qiuchi
    He, Fengchun
    Zhu, Bing
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2862 - 2869
  • [34] Fusion information enhanced method based on transformer for 3D object detection
    Jin Y.
    Tao C.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2023, 44 (12): : 297 - 306
  • [35] Salient object detection with bayesian inference based on radar and camera fusion used in UAV obstacle avoidance
    Wang, Xiyue
    Wang, Xinsheng
    Zhou, Zhiquan
    Song, Yanhong
    PHYSICA SCRIPTA, 2024, 99 (11)
  • [36] RCF-TP: Radar-Camera Fusion With Temporal Priors for 3D Object Detection
    Miron, Yakov
    Drews, Florian
    Faion, Florian
    Di Castro, Dotan
    Klein, Itzik
    IEEE ACCESS, 2024, 12 : 127212 - 127223
  • [37] SparseFusion3D: Sparse Sensor Fusion for 3D Object Detection by Radar and Camera in Environmental Perception
    Yu, Zedong
    Wan, Weibing
    Ren, Maiyu
    Zheng, Xiuyuan
    Fang, Zhijun
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 1524 - 1536
  • [38] LXL: LiDAR Excluded Lean 3D Object Detection With 4D Imaging Radar and Camera Fusion
    Xiong, Weiyi
    Liu, Jianan
    Huang, Tao
    Han, Qing-Long
    Xia, Yuxuan
    Zhu, Bing
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 79 - 92
  • [39] Point Cloud Painting for 3D Object Detection with Camera and Automotive 3+1D RADAR Fusion
    Montiel-Marin, Santiago
    Llamazares, Angel
    Antunes, Miguel
    Revenga, Pedro A.
    Bergasa, Luis M.
    SENSORS, 2024, 24 (04)
  • [40] LXL: LiDAR Excluded Lean 3D Object Detection with 4D Imaging Radar and Camera Fusion
    Xiong, Weiyi
    Liu, Jianan
    Huang, Tao
    Han, Qing-Long
    Xia, Yuxuan
    Zhu, Bing
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 3142 - 3142