Monocular 3D object detection for distant objects

被引:0
|
作者
Li, Jiahao [1 ]
Han, Xiaohong [1 ]
机构
[1] Taiyuan Univ Technol, Coll Comp Sci & Technol Coll Data Sci, Taiyuan, Peoples R China
关键词
autonomous driving; computer vision; monocular three-dimensional object detection;
D O I
10.1117/1.JEI.33.3.033021
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
. Autonomous driving represents the future of transportation, and the precise detection of three-dimensional (3D) objects is a fundamental requirement for achieving autonomous driving capabilities. Presently, 3D object detection primarily relies on sensors, such as monocular cameras, stereo cameras, and LiDAR technology. In comparison to stereo cameras and LiDAR, monocular 3D object detection offers the advantages of a wider field of view and reduced cost. However, the existing monocular 3D object detection techniques exhibit limitations in terms of accuracy, particularly when detecting distant objects. To tackle this challenge, we introduce an innovative approach for monocular 3D object detection, specifically tailored for distant objects. The proposed method classifies objects into distant and nearby categories based on the initial depth estimation, employing distinct feature enhancement and refinement modules for each category. Subsequently, it extracts 3D features and, ultimately, derives precise 3D detection bounding boxes. Experimental results using the KITTI dataset demonstrate that this approach substantially enhances the detection accuracy of distant objects while preserving the detection efficacy for nearby objects.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Objects are Different: Flexible Monocular 3D Object Detection
    Zhang, Yunpeng
    Lu, Jiwen
    Zhou, Jie
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3288 - 3297
  • [2] Aerial Monocular 3D Object Detection
    Hu, Yue
    Fang, Shaoheng
    Xie, Weidi
    Chen, Siheng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
  • [3] Disentangling Monocular 3D Object Detection
    Simonelli, Andrea
    Bulo, Samuel Rota
    Porzi, Lorenzo
    Lopez-Antequera, Manuel
    Kontschieder, Peter
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1991 - 1999
  • [4] OBMO: One Bounding Box Multiple Objects for Monocular 3D Object Detection
    Huang, Chenxi
    He, Tong
    Ren, Haidong
    Wang, Wenxiao
    Lin, Binbin
    Cai, Deng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 6570 - 6581
  • [5] Monocular 3D Object Detection for Autonomous Driving
    Chen, Xiaozhi
    Kundu, Kaustav
    Zhang, Ziyu
    Ma, Huimin
    Fidler, Sanja
    Urtasun, Raquel
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2147 - 2156
  • [6] Dimension Embeddings for Monocular 3D Object Detection
    Zhang, Yunpeng
    Zheng, Wenzhao
    Zhu, Zheng
    Huang, Guan
    Du, Dalong
    Zhou, Jie
    Lu, Jiwen
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1579 - 1588
  • [7] Multivariate Probabilistic Monocular 3D Object Detection
    Shi, Xuepeng
    Chen, Zhixiang
    Kim, Tae-Kyun
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 4270 - 4279
  • [8] Uncertainty Prediction for Monocular 3D Object Detection
    Mun, Junghwan
    Choi, Hyukdoo
    SENSORS, 2023, 23 (12)
  • [9] Homography Loss for Monocular 3D Object Detection
    Gu, Jiaqi
    Wu, Bojian
    Fan, Lubin
    Huang, Jianqiang
    Cao, Shen
    Xiang, Zhiyu
    Hua, Xian-Sheng
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1070 - 1079
  • [10] Learning Auxiliary Monocular Contexts Helps Monocular 3D Object Detection
    Liu, Xianpeng
    Xue, Nan
    Wu, Tianfu
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1810 - 1818