Lite-FPN for keypoint-based monocular 3D object detection

被引:9
|
作者
Yang, Lei [1 ,2 ]
Zhang, Xinyu [1 ,2 ]
Li, Jun [1 ,2 ]
Wang, Li [1 ,2 ]
Zhu, Minghan [3 ]
Zhu, Lei [4 ]
机构
[1] Tsinghua Univ, State Key Lab Automot Safety & Energy, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Sch Vehicle & Mobil, Beijing 100084, Peoples R China
[3] Univ Michigan, Ann Arbor, MI USA
[4] Mogo Auto Intelligence & Telemetics Informat Techn, Beijing 100084, Peoples R China
基金
国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
Monocular 3D object detection; Multi-scale feature fusion; Lite-FPN; Autonomous driving;
D O I
10.1016/j.knosys.2023.110517
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object detection with a single image is an essential and challenging task for autonomous driving. Multi-scale feature fusion is effective for keypoint-based monocular 3D object detectors to boost performance within a large range of scales and distances. However, the existing FPN modules inevitably increase latency owing to the further extraction and merging operations on multi-scale feature maps. In this paper, we propose a lightweight feature pyramid network called Lite-FPN for keypoint-based monocular 3D object detectors that perform multi-scale feature fusion only at sparsely distributed keypoint locations. Besides, to alleviate the misalignment between classification score and localization precision, we propose an effective regression loss named attention loss, which assigns predictions with misaligned classification score and localization precision larger weights in the training stage. Extensive experiments based on several state-of-the-art keypoint-based detectors on the KITTI and nuScenes datasets show that our proposed methods manage to achieve significant accuracy improvements. Meanwhile, the enhanced SMOKE with our Lite-FPN module surpasses the baseline enhanced by the classic FPN over 19 FPS.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Keypoint3D: Keypoint-Based and Anchor-Free 3D Object Detection for Autonomous Driving with Monocular Vision
    Li, Zhen
    Gao, Yuliang
    Hong, Qingqing
    Du, Yuren
    Serikawa, Seiichi
    Zhang, Lifeng
    REMOTE SENSING, 2023, 15 (05)
  • [2] KPDet: Keypoint-based 3D object detection with Parametric Radius Learning
    Huang, Yuhao
    Zhou, Sanping
    Yan, Xinrui
    Zheng, Nanning
    NEUROCOMPUTING, 2024, 572
  • [3] Monocular 3D Object Detection Based on Pseudo Multimodal Information Extraction and Keypoint Estimation
    Zhao, Dan
    Ji, Chaofeng
    Liu, Guizhong
    APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [4] Local Keypoint-Based Image Detector with Object Detection
    Grycuk, Rafal
    Scherer, Magdalena
    Voloshynovskiy, Sviatoslav
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2017, PT I, 2017, 10245 : 507 - 517
  • [5] Gradient Corner Pooling for Keypoint-Based Object Detection
    Li, Xuyang
    Xie, Xuemei
    Yu, Mingxuan
    Luo, Jiakai
    Rao, Chengwei
    Shi, Guangming
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1460 - 1467
  • [6] Hubless keypoint-based 3D deformable groupwise registration
    Agier, R.
    Valette, S.
    Kechichian, R.
    Fanton, L.
    Prost, R.
    MEDICAL IMAGE ANALYSIS, 2020, 59
  • [7] SMOKE: Single-Stage Monocular 3D Object Detection via Keypoint Estimation
    Liu, Zechen
    Wu, Zizhang
    Toth, Roland
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4289 - 4298
  • [8] CSP-Lite: Real-Time and Efficient Keypoint-Based Pedestrian Detection
    Jia, Yisong
    Pan, Huihui
    Wang, Jue
    Sun, Weichao
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, : 1627 - 1637
  • [9] A Monocular 3D Object Detection Algorithm with Multi-Keypoint Constraints and Depth Estimation Assistance
    Zheng, Jin
    Wang, Sen
    Li, Hang
    Zhou, Yu-Hai
    Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (12): : 2803 - 2818
  • [10] DYNAMIC KEYPOINT-BASED ALGORITHM OF OBJECT TRACKING
    Morgacheva, A. I.
    Kulikov, V. A.
    Kosykh, V. P.
    INTERNATIONAL WORKSHOP PHOTOGRAMMETRIC AND COMPUTER VISION TECHNIQUES FOR VIDEO SURVEILLANCE, BIOMETRICS AND BIOMEDICINE, 2017, 42-2 (W4): : 79 - 82