CR-DINO: A Novel Camera-Radar Fusion 2-D Object Detection Model Based on Transformer

被引:3
|
作者
Jin, Yuhao [1 ]
Zhu, Xiaohui [1 ]
Yue, Yong [1 ]
Lim, Eng Gee [1 ]
Wang, Wei [2 ]
机构
[1] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Suzhou 215000, Peoples R China
[2] Hebei Normal Univ, Coll Comp & Cyber Secur, Shijiazhuang 050024, Hebei, Peoples R China
关键词
Autonomous vehicle; deep learning; multisensor fusion; object detection; transformer;
D O I
10.1109/JSEN.2024.3357775
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Due to millimeter-wave (MMW) radar's ability to directly acquire spatial positions and velocity information of objects, as well as its robust performance in adverse weather conditions, it has been widely employed in autonomous driving. However, radar lacks specific semantic information. To address this limitation, we take the complementary strengths of camera and radar by feature-level fusion and propose a fully transformer-based model for object detection in autonomous driving. Specifically, we introduce a novel radar representation method and propose two camera-radar fusion architectures based on Swin transformer. We name our proposed model as camera-radar based DETR with improved denoising anchor boxes (CR-DINO) and conduct training and testing on the nuScenes dataset. We conducted several ablation experiments, and the best result we obtained was an mAP of 38.0%, surpassing other state-of-the-art (SOTA) camera-radar fusion object detection models.
引用
收藏
页码:11080 / 11090
页数:11
相关论文
共 50 条
  • [41] Transformer-based difference fusion network for RGB-D salient object detection
    Cui, Zhi-Qiang
    Wang, Feng
    Feng, Zheng-Yong
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [42] BAFusion: Bidirectional Attention Fusion for 3D Object Detection Based on LiDAR and Camera
    Liu, Min
    Jia, Yuanjun
    Lyu, Youhao
    Dong, Qi
    Yang, Yanyu
    SENSORS, 2024, 24 (14)
  • [43] Rethinking the Late Fusion of LiDAR-Camera Based 3D Object Detection
    Yu, Lehang
    Zhang, Jing
    Liu, Zhong
    Yue, Haosong
    Chen, Weihai
    2024 IEEE 19TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, ICIEA 2024, 2024,
  • [44] MVFusion: Multi-View 3D Object Detection with Semantic-aligned Radar and Camera Fusion
    Wu, Zizhang
    Chen, Guilian
    Gan, Yuanzhu
    Wang, Lei
    Pu, Jian
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2766 - 2773
  • [45] CFTrack: Center-based Radar and Camera Fusion for 3D Multi-Object Tracking
    Nabati, Ramin
    Harris, Landon
    Qi, Hairong
    2021 IEEE INTELLIGENT VEHICLES SYMPOSIUM WORKSHOPS (IV WORKSHOPS), 2021, : 243 - 248
  • [46] SGDet3D: Semantics and Geometry Fusion for 3D Object Detection Using 4D Radar and Camera
    Bai, Xiaokai
    Yu, Zhu
    Zheng, Lianqing
    Zhang, Xiaohan
    Zhou, Zili
    Zhang, Xue
    Wang, Fang
    Bai, Jie
    Shen, Hui-Liang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (01): : 828 - 835
  • [47] Fusion of Multiple 2D LiDAR and RADAR for Object Detection and Tracking in All Directions
    Na, Kiin
    Byun, Jaemin
    Roh, Myongchan
    Seo, Beomsu
    2014 INTERNATIONAL CONFERENCE ON CONNECTED VEHICLES AND EXPO (ICCVE), 2014, : 1058 - 1059
  • [48] Object Detection and Tracking for Unmanned Vehicles Based on Fusion of Infrared Camera and MMW Radar in Smoke-obscured Environment
    Xiong G.
    Luo Z.
    Sun D.
    Tao J.
    Tang Z.
    Wu C.
    Binggong Xuebao/Acta Armamentarii, 2024, 45 (03): : 893 - 906
  • [49] Transformer-Based Optimized Multimodal Fusion for 3D Object Detection in Autonomous Driving
    Alaba, Simegnew Yihunie
    Ball, John E.
    IEEE ACCESS, 2024, 12 : 50165 - 50176
  • [50] Bridging the View Disparity Between Radar and Camera Features for Multi-Modal Fusion 3D Object Detection
    Zhou, Taohua
    Chen, Junjie
    Shi, Yining
    Jiang, Kun
    Yang, Mengmeng
    Yang, Diange
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (02): : 1523 - 1535