Multi-Scale Enhanced Depth Knowledge Distillation for Monocular 3D Object Detection with SEFormer

被引：0

作者：

Zhang, Han ^{[1
]}

Li, Jun ^{[1
]}

Tang, Rui ^{[2
]}

Shi, Zhiping ^{[1
]}

Bu, Aojie ^{[1
]}

机构：

[1] Capital Normal Univ, Informat Engn Coll, Beijing, Peoples R China

[2] ZongMu Technol, Comp Vis Percept Dept, Shanghai, Peoples R China

来源：

2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS | 2024年

关键词：

3D object detection; Knowledge distillation; Autonomous driving;

D O I：

10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics60724.2023.00031

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the context of the Internet of Things, where efficient and accurate perception is crucial. Monocular 3D detection has gained attention due to its cost-effectiveness. This paper introduces an efficient method for monocular 3D object detection, termed Multi-Scale Enhanced Depth Knowledge Distillation (MDKD). Our approach simplifies the teacher network, eliminating the need for extra modal data input while improving the student network's performance. Additionally, we present a Multi-Scale Depth Enhancement (MDE) module and a novel lightweight Squeeze-Excitation Former (SEFormer). Our method addresses the growing demand for precise object detection within IoT environments. Extensive experiments on the KITTI dataset validate our method's effectiveness.

引用

页码：38 / 43

页数：6

共 50 条

[31] Depth dynamic center difference convolutions for monocular 3D object detection
Wu, Xinyu
Ma, Dongliang
Qu, Xin
Jiang, Xin
Zeng, Dan
NEUROCOMPUTING, 2023, 520 : 73 - 81
[32] MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection
Zhang, Renrui
Qiu, Han
Wang, Tai
Guo, Ziyu
Cui, Ziteng
Qiao, Yu
Li, Hongsheng
Gao, Peng
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9121 - 9132
[33] Task-Aware Monocular Depth Estimation for 3D Object Detection
Wang, Xinlong
Yin, Wei
Kong, Tao
Jiang, Yuning
Li, Lei
Shen, Chunhua
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12257 - 12264
[34] 3D-MSFC: A 3D multi-scale features compression method for object detection☆
Li, Zhengxin
Tian, Chongzhen
Yuan, Hui
Lu, Xin
Malekmohamadi, Hossein
DISPLAYS, 2024, 85
[35] MSSD: multi-scale self-distillation for object detection
Zihao Jia
Shengkun Sun
Guangcan Liu
Bo Liu
Visual Intelligence, 2 (1):
[36] Multi-Scale Cross Distillation for Object Detection in Aerial Images
Wang, Kun
Wang, Zi
Li, Zhang
Teng, Xichao
Li, Yang
COMPUTER VISION - ECCV 2024, PT XLIX, 2025, 15107 : 452 - 471
[37] Boosting Monocular 3D Object Detection With Object-Centric Auxiliary Depth Supervision
Kim, Youngseok
Kim, Sanmin
Sim, Sangmin
Choi, Jun Won
Kum, Dongsuk
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (02) : 1801 - 1813
[38] Aerial Monocular 3D Object Detection
Hu, Yue
Fang, Shaoheng
Xie, Weidi
Chen, Siheng
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
[39] Disentangling Monocular 3D Object Detection
Simonelli, Andrea
Bulo, Samuel Rota
Porzi, Lorenzo
Lopez-Antequera, Manuel
Kontschieder, Peter
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1991 - 1999
[40] DID-M3D: Decoupling Instance Depth for Monocular 3D Object Detection
Peng, Liang
Wu, Xiaopei
Yang, Zheng
Liu, Haifeng
Cai, Deng
COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 71 - 88

← 1 2 3 4 5 →