Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection

被引:40
|
作者
Hong, Yu [1 ]
Dai, Hang [2 ]
Ding, Yong [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] MBZUAI, Abu Dhabi, U Arab Emirates
来源
关键词
POINT;
D O I
10.1007/978-3-031-20080-9_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Leveraging LiDAR-based detectors or real LiDAR point data to guide monocular 3D detection has brought significant improvement, e.g., Pseudo-LiDAR methods. However, the existing methods usually apply non-end-to-end training strategies and insufficiently leverage the LiDAR information, where the rich potential of the LiDAR data has not been well exploited. In this paper, we propose the Cross-Modality Knowledge Distillation (CMKD) network for monocular 3D detection to efficiently and directly transfer the knowledge from LiDAR modality to image modality on both features and responses. Moreover, we further extend CMKD as a semi-supervised training framework by distilling knowledge from large-scale unlabeled data and significantly boost the performance. Until submission, CMKD ranks 1st among the monocular 3D detectors with publications on both KITTI test set and Waymo val set with significant performance gains compared to previous state-of-the-art methods.
引用
收藏
页码:87 / 104
页数:18
相关论文
共 50 条
  • [21] Cross-modality salient object detection network with universality and anti-interference
    Wen, Hongwei
    Song, Kechen
    Huang, Liming
    Wang, Han
    Yan, Yunhui
    KNOWLEDGE-BASED SYSTEMS, 2023, 264
  • [22] MCAFNet: Multiscale cross-modality adaptive fusion network for multispectral object detection
    Zheng, Shangpo
    Liu, Junfeng
    Jun, Zeng
    DIGITAL SIGNAL PROCESSING, 2025, 159
  • [23] Geometry Uncertainty Projection Network for Monocular 3D Object Detection
    Lu, Yan
    Ma, Xinzhu
    Yang, Lei
    Zhang, Tianzhu
    Liu, Yating
    Chu, Qi
    Yan, Junjie
    Ouyang, Wanli
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3091 - 3101
  • [24] Depth-enhancement network for monocular 3D object detection
    Liu, Guohua
    Lian, Haiyang
    Guo, Changrui
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (09)
  • [25] Categorical Depth Distribution Network for Monocular 3D Object Detection
    Reading, Cody
    Harakeh, Ali
    Chae, Julia
    Waslander, Steven L.
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8551 - 8560
  • [26] DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection
    Kumar, Abhinav
    Brazil, Garrick
    Corona, Enrique
    Parchami, Armin
    Liu, Xiaoming
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 664 - 683
  • [27] Aerial Monocular 3D Object Detection
    Hu, Yue
    Fang, Shaoheng
    Xie, Weidi
    Chen, Siheng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
  • [28] Disentangling Monocular 3D Object Detection
    Simonelli, Andrea
    Bulo, Samuel Rota
    Porzi, Lorenzo
    Lopez-Antequera, Manuel
    Kontschieder, Peter
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1991 - 1999
  • [29] Cross-modality interaction for few-shot multispectral object detection with semantic knowledge
    Huang, Lian
    Peng, Zongju
    Chen, Fen
    Dai, Shaosheng
    He, Ziqiang
    Liu, Kesheng
    NEURAL NETWORKS, 2024, 173
  • [30] Cross-modality interaction for few-shot multispectral object detection with semantic knowledge
    Huang, Lian
    Peng, Zongju
    Chen, Fen
    Dai, Shaosheng
    He, Ziqiang
    Liu, Kesheng
    Neural Networks, 2024, 173