Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection

被引：40

作者：

Hong, Yu ^{[1
]}

Dai, Hang ^{[2
]}

Ding, Yong ^{[1
]}

机构：

[1] Zhejiang Univ, Hangzhou, Peoples R China

[2] MBZUAI, Abu Dhabi, U Arab Emirates

来源：

COMPUTER VISION, ECCV 2022, PT X | 2022年 / 13670卷

关键词：

POINT;

D O I：

10.1007/978-3-031-20080-9_6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Leveraging LiDAR-based detectors or real LiDAR point data to guide monocular 3D detection has brought significant improvement, e.g., Pseudo-LiDAR methods. However, the existing methods usually apply non-end-to-end training strategies and insufficiently leverage the LiDAR information, where the rich potential of the LiDAR data has not been well exploited. In this paper, we propose the Cross-Modality Knowledge Distillation (CMKD) network for monocular 3D detection to efficiently and directly transfer the knowledge from LiDAR modality to image modality on both features and responses. Moreover, we further extend CMKD as a semi-supervised training framework by distilling knowledge from large-scale unlabeled data and significantly boost the performance. Until submission, CMKD ranks 1st among the monocular 3D detectors with publications on both KITTI test set and Waymo val set with significant performance gains compared to previous state-of-the-art methods.

引用

页码：87 / 104

页数：18

共 50 条

[21] Cross-modality salient object detection network with universality and anti-interference
Wen, Hongwei
Song, Kechen
Huang, Liming
Wang, Han
Yan, Yunhui
KNOWLEDGE-BASED SYSTEMS, 2023, 264
[22] MCAFNet: Multiscale cross-modality adaptive fusion network for multispectral object detection
Zheng, Shangpo
Liu, Junfeng
Jun, Zeng
DIGITAL SIGNAL PROCESSING, 2025, 159
[23] Geometry Uncertainty Projection Network for Monocular 3D Object Detection
Lu, Yan
Ma, Xinzhu
Yang, Lei
Zhang, Tianzhu
Liu, Yating
Chu, Qi
Yan, Junjie
Ouyang, Wanli
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3091 - 3101
[24] Depth-enhancement network for monocular 3D object detection
Liu, Guohua
Lian, Haiyang
Guo, Changrui
MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (09)
[25] Categorical Depth Distribution Network for Monocular 3D Object Detection
Reading, Cody
Harakeh, Ali
Chae, Julia
Waslander, Steven L.
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8551 - 8560
[26] DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection
Kumar, Abhinav
Brazil, Garrick
Corona, Enrique
Parchami, Armin
Liu, Xiaoming
COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 664 - 683
[27] Aerial Monocular 3D Object Detection
Hu, Yue
Fang, Shaoheng
Xie, Weidi
Chen, Siheng
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
[28] Disentangling Monocular 3D Object Detection
Simonelli, Andrea
Bulo, Samuel Rota
Porzi, Lorenzo
Lopez-Antequera, Manuel
Kontschieder, Peter
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1991 - 1999
[29] Cross-modality interaction for few-shot multispectral object detection with semantic knowledge
Huang, Lian
Peng, Zongju
Chen, Fen
Dai, Shaosheng
He, Ziqiang
Liu, Kesheng
NEURAL NETWORKS, 2024, 173
[30] Cross-modality interaction for few-shot multispectral object detection with semantic knowledge
Huang, Lian
Peng, Zongju
Chen, Fen
Dai, Shaosheng
He, Ziqiang
Liu, Kesheng
Neural Networks, 2024, 173

← 1 2 3 4 5 →