Multi-Scale Enhanced Depth Knowledge Distillation for Monocular 3D Object Detection with SEFormer

被引:0
|
作者
Zhang, Han [1 ]
Li, Jun [1 ]
Tang, Rui [2 ]
Shi, Zhiping [1 ]
Bu, Aojie [1 ]
机构
[1] Capital Normal Univ, Informat Engn Coll, Beijing, Peoples R China
[2] ZongMu Technol, Comp Vis Percept Dept, Shanghai, Peoples R China
关键词
3D object detection; Knowledge distillation; Autonomous driving;
D O I
10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics60724.2023.00031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the context of the Internet of Things, where efficient and accurate perception is crucial. Monocular 3D detection has gained attention due to its cost-effectiveness. This paper introduces an efficient method for monocular 3D object detection, termed Multi-Scale Enhanced Depth Knowledge Distillation (MDKD). Our approach simplifies the teacher network, eliminating the need for extra modal data input while improving the student network's performance. Additionally, we present a Multi-Scale Depth Enhancement (MDE) module and a novel lightweight Squeeze-Excitation Former (SEFormer). Our method addresses the growing demand for precise object detection within IoT environments. Extensive experiments on the KITTI dataset validate our method's effectiveness.
引用
收藏
页码:38 / 43
页数:6
相关论文
共 50 条
  • [1] MDS-Net: Multi-Scale Depth Stratification 3D Object Detection from Monocular Images
    Xie, Zhouzhen
    Song, Yuying
    Wu, Jingxuan
    Li, Zecheng
    Song, Chunyi
    Xu, Zhiwei
    SENSORS, 2022, 22 (16)
  • [2] Monomm: a multi-scale mamba-enhanced network for real-time monocular 3D object detection
    Fu, Youjia
    Xu, Zihao
    Fu, Junsong
    Xue, Huixia
    Tan, Shuqiu
    Li, Lei
    Qing, Shaoxun
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (03):
  • [3] Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection
    Hong, Yu
    Dai, Hang
    Ding, Yong
    COMPUTER VISION, ECCV 2022, PT X, 2022, 13670 : 87 - 104
  • [4] Monocular 3D Object Detection With Motion Feature Distillation
    Hu, Henan
    Li, Muyu
    Zhu, Ming
    Gao, Wen
    Liu, Peiyu
    Chan, Kwok-Leung
    IEEE ACCESS, 2023, 11 : 82933 - 82945
  • [5] Multi-Scale PointPillars 3D Object Detection Network
    Ya, Hang
    Luo, Guiming
    PROCEEDINGS OF THE 2019 IEEE 18TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC 2019), 2019, : 174 - 179
  • [6] Monocular 3D Object Detection with Depth from Motion
    Wang, Tai
    Pang, Jiangmiao
    Lin, Dahua
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 386 - 403
  • [7] Enhanced frustrum multi-scale VoteNet for 3D object detection in cluttered indoor scene
    Zhang, Xuesong
    He, Yu
    Song, Cunli
    Zhuang, Yan
    APPLIED INTELLIGENCE, 2025, 55 (07)
  • [8] SEFormer: Structure Embedding Transformer for 3D Object Detection
    Feng, Xiaoyu
    Du, Heming
    Fan, Hehe
    Duan, Yueqi
    Liu, Yongpan
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 632 - 640
  • [9] Attention-Based Depth Distillation with 3D-Aware Positional Encoding for Monocular 3D Object Detection
    Wu, Zizhang
    Wu, Yunzhe
    Pu, Jian
    Li, Xianzhi
    Wang, Xiaoquan
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 2892 - 2900
  • [10] EFMF-pillars: 3D object detection based on enhanced features and multi-scale fusion
    Zhang, Wenbiao
    Chen, Gang
    Wang, Hongyan
    Yang, Lina
    Sun, Tao
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2024, 2024 (01):