Multi-Scale Enhanced Depth Knowledge Distillation for Monocular 3D Object Detection with SEFormer

被引:0
|
作者
Zhang, Han [1 ]
Li, Jun [1 ]
Tang, Rui [2 ]
Shi, Zhiping [1 ]
Bu, Aojie [1 ]
机构
[1] Capital Normal Univ, Informat Engn Coll, Beijing, Peoples R China
[2] ZongMu Technol, Comp Vis Percept Dept, Shanghai, Peoples R China
关键词
3D object detection; Knowledge distillation; Autonomous driving;
D O I
10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics60724.2023.00031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the context of the Internet of Things, where efficient and accurate perception is crucial. Monocular 3D detection has gained attention due to its cost-effectiveness. This paper introduces an efficient method for monocular 3D object detection, termed Multi-Scale Enhanced Depth Knowledge Distillation (MDKD). Our approach simplifies the teacher network, eliminating the need for extra modal data input while improving the student network's performance. Additionally, we present a Multi-Scale Depth Enhancement (MDE) module and a novel lightweight Squeeze-Excitation Former (SEFormer). Our method addresses the growing demand for precise object detection within IoT environments. Extensive experiments on the KITTI dataset validate our method's effectiveness.
引用
收藏
页码:38 / 43
页数:6
相关论文
共 50 条
  • [41] Knowledge Distillation Anomaly Detection with Multi-Scale Feature Fusion
    Yadang C.
    Liuren C.
    Wenbin Y.
    Jiale Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (10): : 1542 - 1549
  • [42] Retraction Note: Scale invariant point feature (SIPF) for 3D point clouds and 3D multi-scale object detection
    Baowei Lin
    Fasheng Wang
    Fangda Zhao
    Yi Sun
    Neural Computing and Applications, 2024, 36 (18) : 11065 - 11065
  • [43] MSPV3D: Multi-Scale Point-Voxels 3D Object Detection Net
    Zhang, Zheng
    Bao, Zhiping
    Wei, Yun
    Zhou, Yongsheng
    Li, Ming
    Tian, Qing
    REMOTE SENSING, 2024, 16 (17)
  • [44] RETRACTED ARTICLE: Scale invariant point feature (SIPF) for 3D point clouds and 3D multi-scale object detection
    Baowei Lin
    Fasheng Wang
    Fangda Zhao
    Yi Sun
    Neural Computing and Applications, 2018, 29 : 1209 - 1224
  • [45] MuSR: MULTI-SCALE 3D SCENES RECONSTRUCTION BASED ON MONOCULAR VIDEO
    Gao, Han
    Wu, Hao
    Dong, Peiwen
    Xu, Yixin
    Xu, Fengyuan
    Zhong, Sheng
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2840 - 2844
  • [46] Depth-Constrained Network for Multi-Scale Object Detection
    Liu, Guohua
    Li, Yijun
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (10)
  • [47] MoGDE: Boosting Mobile Monocular 3D Object Detection with Ground Depth Estimation
    Zhou, Yunsong
    Liu, Quan
    Zhu, Hongzi
    Li, Yunzhe
    Chang, Shan
    Guo, Minyi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [48] Monocular 3D Object Detection With Sequential Feature Association and Depth Hint Augmentation
    Gao, Tianze
    Pan, Huihui
    Gao, Huijun
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2022, 7 (02): : 240 - 250
  • [49] Depth-conditioned Dynamic Message Propagation for Monocular 3D Object Detection
    Wang, Li
    Du, Liang
    Ye, Xiaoqing
    Fu, Yanwei
    Guo, Guodong
    Xue, Xiangyang
    Feng, Jianfeng
    Zhang, Li
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 454 - 463
  • [50] Revisiting Depth-guided Methods for Monocular 3D Object Detection by Hierarchical Balanced Depth
    Chen, Yi-Rong
    Tseng, Ching-Yu
    Liou, Yi-Syuan
    Wu, Tsung-Han
    Hsu, Winston H.
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229