Lite-FPN for keypoint-based monocular 3D object detection

被引:9
|
作者
Yang, Lei [1 ,2 ]
Zhang, Xinyu [1 ,2 ]
Li, Jun [1 ,2 ]
Wang, Li [1 ,2 ]
Zhu, Minghan [3 ]
Zhu, Lei [4 ]
机构
[1] Tsinghua Univ, State Key Lab Automot Safety & Energy, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Sch Vehicle & Mobil, Beijing 100084, Peoples R China
[3] Univ Michigan, Ann Arbor, MI USA
[4] Mogo Auto Intelligence & Telemetics Informat Techn, Beijing 100084, Peoples R China
基金
国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
Monocular 3D object detection; Multi-scale feature fusion; Lite-FPN; Autonomous driving;
D O I
10.1016/j.knosys.2023.110517
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object detection with a single image is an essential and challenging task for autonomous driving. Multi-scale feature fusion is effective for keypoint-based monocular 3D object detectors to boost performance within a large range of scales and distances. However, the existing FPN modules inevitably increase latency owing to the further extraction and merging operations on multi-scale feature maps. In this paper, we propose a lightweight feature pyramid network called Lite-FPN for keypoint-based monocular 3D object detectors that perform multi-scale feature fusion only at sparsely distributed keypoint locations. Besides, to alleviate the misalignment between classification score and localization precision, we propose an effective regression loss named attention loss, which assigns predictions with misaligned classification score and localization precision larger weights in the training stage. Extensive experiments based on several state-of-the-art keypoint-based detectors on the KITTI and nuScenes datasets show that our proposed methods manage to achieve significant accuracy improvements. Meanwhile, the enhanced SMOKE with our Lite-FPN module surpasses the baseline enhanced by the classic FPN over 19 FPS.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Keypoint-Based Robotic Grasp Detection Scheme in Multi-Object Scenes
    Li, Tong
    Wang, Fei
    Ru, Changlei
    Jiang, Yong
    Li, Jinghong
    SENSORS, 2021, 21 (06) : 1 - 15
  • [22] Monocular 3D Object Detection for Autonomous Driving
    Chen, Xiaozhi
    Kundu, Kaustav
    Zhang, Ziyu
    Ma, Huimin
    Fidler, Sanja
    Urtasun, Raquel
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2147 - 2156
  • [23] Dimension Embeddings for Monocular 3D Object Detection
    Zhang, Yunpeng
    Zheng, Wenzhao
    Zhu, Zheng
    Huang, Guan
    Du, Dalong
    Zhou, Jie
    Lu, Jiwen
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1579 - 1588
  • [24] Keypoint-based Static Object Removal from Photographs
    Volkov, Alexandr
    Efimova, Valeria
    Shalamov, Viacheslav
    Filchenkov, Andrey
    THIRTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2020), 2021, 11605
  • [25] Multivariate Probabilistic Monocular 3D Object Detection
    Shi, Xuepeng
    Chen, Zhixiang
    Kim, Tae-Kyun
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 4270 - 4279
  • [26] Uncertainty Prediction for Monocular 3D Object Detection
    Mun, Junghwan
    Choi, Hyukdoo
    SENSORS, 2023, 23 (12)
  • [27] Monocular 3D object detection for distant objects
    Li, Jiahao
    Han, Xiaohong
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (03) : 33021
  • [28] Homography Loss for Monocular 3D Object Detection
    Gu, Jiaqi
    Wu, Bojian
    Fan, Lubin
    Huang, Jianqiang
    Cao, Shen
    Xiang, Zhiyu
    Hua, Xian-Sheng
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1070 - 1079
  • [29] Simultaneous control of head pose and expressions in 3D facial keypoint-based GAN
    Hatakeyama, Tomoyuki
    Furuta, Ryosuke
    Sato, Yoichi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (33) : 79861 - 79878
  • [30] A 3D Surface Matching Method Using Keypoint-Based Covariance Matrix Descriptors
    Xiong Fengguang
    Han Xie
    IEEE ACCESS, 2017, 5 : 14204 - 14220