CR-DINO: A Novel Camera-Radar Fusion 2-D Object Detection Model Based on Transformer

被引:3
|
作者
Jin, Yuhao [1 ]
Zhu, Xiaohui [1 ]
Yue, Yong [1 ]
Lim, Eng Gee [1 ]
Wang, Wei [2 ]
机构
[1] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Suzhou 215000, Peoples R China
[2] Hebei Normal Univ, Coll Comp & Cyber Secur, Shijiazhuang 050024, Hebei, Peoples R China
关键词
Autonomous vehicle; deep learning; multisensor fusion; object detection; transformer;
D O I
10.1109/JSEN.2024.3357775
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Due to millimeter-wave (MMW) radar's ability to directly acquire spatial positions and velocity information of objects, as well as its robust performance in adverse weather conditions, it has been widely employed in autonomous driving. However, radar lacks specific semantic information. To address this limitation, we take the complementary strengths of camera and radar by feature-level fusion and propose a fully transformer-based model for object detection in autonomous driving. Specifically, we introduce a novel radar representation method and propose two camera-radar fusion architectures based on Swin transformer. We name our proposed model as camera-radar based DETR with improved denoising anchor boxes (CR-DINO) and conduct training and testing on the nuScenes dataset. We conducted several ablation experiments, and the best result we obtained was an mAP of 38.0%, surpassing other state-of-the-art (SOTA) camera-radar fusion object detection models.
引用
收藏
页码:11080 / 11090
页数:11
相关论文
共 50 条
  • [1] CRAFT: Camera-Radar 3D Object Detection with Spatio-Contextual Fusion Transformer
    Kim, Youngseok
    Kim, Sanmin
    Choi, Jun Won
    Kum, Dongsuk
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 1160 - 1168
  • [2] Camera-Radar Fusion with Modality Interaction and Radar Gaussian Expansion for 3D Object Detection
    Liu, Xiang
    Li, Zhenglin
    Zhou, Yang
    Peng, Yan
    Luo, Jun
    CYBORG AND BIONIC SYSTEMS, 2024, 5
  • [3] Camera-Radar Fusion with Radar Channel Extension and Dual-CBAM-FPN for Object Detection
    Sun, Xiyan
    Jiang, Yaoyu
    Qin, Hongmei
    Li, Jingjing
    Ji, Yuanfa
    SENSORS, 2024, 24 (16)
  • [4] TransCAR: Transformer-based Camera-And-Radar Fusion for 3D Object Detection
    Pang, Su
    Morris, Daniel
    Radha, Hayder
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 10902 - 10909
  • [5] Radar-camera fusion for 3D object detection with aggregation transformer
    Li, Jun
    Zhang, Han
    Wu, Zizhang
    Xu, Tianhao
    APPLIED INTELLIGENCE, 2024, 54 (21) : 10627 - 10639
  • [6] Camera-Radar Fusion with Modality Interaction and Radar Gaussian Expansion for 3D Detection
    Liu, Xiang
    Li, Zhenglin
    Zhou, Yang
    Peng, Yan
    Luo, Jun
    Liu, Xiang
    CYBORG AND BIONIC SYSTEMS, 2024, 5
  • [7] RCMixer: Radar-camera fusion based on vision transformer for robust object detection
    Wang, Lindong
    Tuo, Hongya
    Yuan, Yu
    Leung, Henry
    Jing, Zhongliang
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2025, 107
  • [8] NeXtFusion: Attention-Based Camera-Radar Fusion Network for Improved Three-Dimensional Object Detection and Tracking
    Kalgaonkar, Priyank
    El-Sharkawy, Mohamed
    FUTURE INTERNET, 2024, 16 (04)
  • [9] CramNet: Camera-Radar Fusion with Ray-Constrained Cross-Attention for Robust 3D Object Detection
    Hwang, Jyh-Jing
    Kretzschmar, Henrik
    Manela, Joshua
    Rafferty, Sean
    Armstrong-Crews, Nicholas
    Chen, Tiffany
    Anguelov, Dragomir
    COMPUTER VISION, ECCV 2022, PT XXXVIII, 2022, 13698 : 388 - 405
  • [10] Fusion Point Pruning for Optimized 2D Object Detection with Radar-Camera Fusion
    Staecker, Lukas
    Heidenreich, Philipp
    Rambach, Jason
    Stricker, Didier
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1275 - 1282