Spatial Attention Frustum: A 3D Object Detection Method Focusing on Occluded Objects

被引:0
|
作者
He, Xinglei [1 ]
Zhang, Xiaohan [1 ]
Wang, Yichun [1 ]
Ji, Hongzeng [1 ]
Duan, Xiuhui [1 ]
Guo, Fen [1 ]
机构
[1] Beijing Inst Technol, Sch Mech Engn, Beijing 100081, Peoples R China
关键词
visual attention mechanism; occluded object detection; multi-sensor fusion; 3D object detection; autonomous vehicles; DEPTH;
D O I
10.3390/s22062366
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Achieving the accurate perception of occluded objects for autonomous vehicles is a challenging problem. Human vision can always quickly locate important object regions in complex external scenes, while other regions are only roughly analysed or ignored, defined as the visual attention mechanism. However, the perception system of autonomous vehicles cannot know which part of the point cloud is in the region of interest. Therefore, it is meaningful to explore how to use the visual attention mechanism in the perception system of autonomous driving. In this paper, we propose the model of the spatial attention frustum to solve object occlusion in 3D object detection. The spatial attention frustum can suppress unimportant features and allocate limited neural computing resources to critical parts of the scene, thereby providing greater relevance and easier processing for higher-level perceptual reasoning tasks. To ensure that our method maintains good reasoning ability when faced with occluded objects with only a partial structure, we propose a local feature aggregation module to capture more complex local features of the point cloud. Finally, we discuss the projection constraint relationship between the 3D bounding box and the 2D bounding box and propose a joint anchor box projection loss function, which will help to improve the overall performance of our method. The results of the KITTI dataset show that our proposed method can effectively improve the detection accuracy of occluded objects. Our method achieves 89.46%, 79.91% and 75.53% detection accuracy in the easy, moderate, and hard difficulty levels of the car category, and achieves a 6.97% performance improvement especially in the hard category with a high degree of occlusion. Our one-stage method does not need to rely on another refining stage, comparable to the accuracy of the two-stage method.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Semantic Frustum Based VoxelNet for 3D Object Detection
    Chen, Feng
    Wu, Fei
    Huang, Qinghua
    Feng, Yujian
    Ge, Qi
    Ji, Yimu
    Hu, Chang-Hui
    Jing, Xiao-Yuan
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 7629 - 7634
  • [2] 3D Object Detection Based on Improved Frustum PointNet
    Liu Xunhua
    Sun Shaoyuan
    Gu Lipeng
    Li Xiang
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (20)
  • [3] 3D Mirrored Object Selection for Occluded Objects in Virtual Environments
    Lee, Joong-Jae
    Park, Jung-Min
    IEEE ACCESS, 2020, 8 : 200259 - 200274
  • [4] SCANET: SPATIAL-CHANNEL ATTENTION NETWORK FOR 3D OBJECT DETECTION
    Lu, Haihua
    Chen, Xuesong
    Zhang, Guiying
    Zhou, Qiuhao
    Ma, Yanbo
    Zhao, Yong
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1992 - 1996
  • [5] Frustum PointNets for 3D Object Detection from RGB-D Data
    Qi, Charles R.
    Liu, Wei
    Wu, Chenxia
    Su, Hao
    Guibas, Leonidas J.
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 918 - 927
  • [6] Temp-Frustum Net: 3D Object Detection with Temporal Fusion
    Ercelik, Emec
    Yurtsever, Ekim
    Knoll, Alois
    2021 32ND IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2021, : 1095 - 1101
  • [7] Object recognition and 3D reconstruction of occluded objects using binocular stereo
    Priya, L.
    Anand, Sheila
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2018, 21 (01): : 29 - 38
  • [8] Object recognition and 3D reconstruction of occluded objects using binocular stereo
    L. Priya
    Sheila Anand
    Cluster Computing, 2018, 21 : 29 - 38
  • [9] Monocular 3D object detection for distant objects
    Li, Jiahao
    Han, Xiaohong
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (03) : 33021
  • [10] Monocular 3D object detection for occluded targets based on spatial relationships and decoupled depth predictions
    Gao, Yanfei
    Miao, Xiongwei
    Zhang, Guoye
    FRONTIERS IN COMPUTER SCIENCE, 2025, 6