Lite-FPN for keypoint-based monocular 3D object detection

被引:9
|
作者
Yang, Lei [1 ,2 ]
Zhang, Xinyu [1 ,2 ]
Li, Jun [1 ,2 ]
Wang, Li [1 ,2 ]
Zhu, Minghan [3 ]
Zhu, Lei [4 ]
机构
[1] Tsinghua Univ, State Key Lab Automot Safety & Energy, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Sch Vehicle & Mobil, Beijing 100084, Peoples R China
[3] Univ Michigan, Ann Arbor, MI USA
[4] Mogo Auto Intelligence & Telemetics Informat Techn, Beijing 100084, Peoples R China
基金
国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
Monocular 3D object detection; Multi-scale feature fusion; Lite-FPN; Autonomous driving;
D O I
10.1016/j.knosys.2023.110517
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object detection with a single image is an essential and challenging task for autonomous driving. Multi-scale feature fusion is effective for keypoint-based monocular 3D object detectors to boost performance within a large range of scales and distances. However, the existing FPN modules inevitably increase latency owing to the further extraction and merging operations on multi-scale feature maps. In this paper, we propose a lightweight feature pyramid network called Lite-FPN for keypoint-based monocular 3D object detectors that perform multi-scale feature fusion only at sparsely distributed keypoint locations. Besides, to alleviate the misalignment between classification score and localization precision, we propose an effective regression loss named attention loss, which assigns predictions with misaligned classification score and localization precision larger weights in the training stage. Extensive experiments based on several state-of-the-art keypoint-based detectors on the KITTI and nuScenes datasets show that our proposed methods manage to achieve significant accuracy improvements. Meanwhile, the enhanced SMOKE with our Lite-FPN module surpasses the baseline enhanced by the classic FPN over 19 FPS.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] OKGR: Occluded Keypoint Generation and Refinement for 3D Object Detection
    Ji, Mingqian
    Yang, Jian
    Zhang, Shanshan
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XII, 2024, 14436 : 3 - 15
  • [32] Monocular 3D Object Detection for Autonomous Driving Based on Contextual Transformer
    She, Xiangyang
    Yan, Weijia
    Dong, Lihong
    Computer Engineering and Applications, 2024, 60 (19) : 178 - 189
  • [33] A Survey on Monocular 3D Object Detection Algorithms Based on Deep Learning
    Wu, Junhui
    Yin, Dong
    Chen, Jie
    Wu, Yusheng
    Si, Huiping
    Lin, Kaiyan
    2020 4TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2020), 2020, 1518
  • [34] Geometry-based Distance Decomposition for Monocular 3D Object Detection
    Shi, Xuepeng
    Ye, Qi
    Chen, Xiaozhi
    Chen, Chuangrong
    Chen, Zhixiang
    Kim, Tae-Kyun
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15152 - 15161
  • [35] Learning Auxiliary Monocular Contexts Helps Monocular 3D Object Detection
    Liu, Xianpeng
    Xue, Nan
    Wu, Tianfu
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1810 - 1818
  • [36] Keypoint-GraspNet: Keypoint-based 6-DoF Grasp Generation from the Monocular RGB-D input
    Chen, Yiye
    Lin, Yunzhi
    Xu, Ruinian
    Vela, Patricio A.
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 7988 - 7995
  • [37] SCPNet: Self-constrained parallelism network for keypoint-based lightweight object detection?
    Zhong, Xian
    Wang, Mengdie
    Liu, Wenxuan
    Yuan, Jingling
    Huang, Wenxin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 90
  • [38] Monocular 3D Object Detection with Bounding Box Denoising in 3D by Perceiver
    Liu, Xianpeng
    Zheng, Ce
    Cheng, Kelvin
    Xue, Nan
    Qi, Guo-Jun
    Wu, Tianfu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6413 - 6423
  • [39] Progressive Coordinate Transforms for Monocular 3D Object Detection
    Wang, Li
    Zhang, Li
    Zhu, Yi
    Zhang, Zhi
    He, Tong
    Li, Mu
    Xue, Xiangyang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [40] Exploring Geometric Consistency for Monocular 3D Object Detection
    Lian, Qing
    Ye, Botao
    Xu, Ruijia
    Yao, Weilong
    Zhang, Tong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1675 - 1684