Spatial Attention Frustum: A 3D Object Detection Method Focusing on Occluded Objects

被引:0
|
作者
He, Xinglei [1 ]
Zhang, Xiaohan [1 ]
Wang, Yichun [1 ]
Ji, Hongzeng [1 ]
Duan, Xiuhui [1 ]
Guo, Fen [1 ]
机构
[1] Beijing Inst Technol, Sch Mech Engn, Beijing 100081, Peoples R China
关键词
visual attention mechanism; occluded object detection; multi-sensor fusion; 3D object detection; autonomous vehicles; DEPTH;
D O I
10.3390/s22062366
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Achieving the accurate perception of occluded objects for autonomous vehicles is a challenging problem. Human vision can always quickly locate important object regions in complex external scenes, while other regions are only roughly analysed or ignored, defined as the visual attention mechanism. However, the perception system of autonomous vehicles cannot know which part of the point cloud is in the region of interest. Therefore, it is meaningful to explore how to use the visual attention mechanism in the perception system of autonomous driving. In this paper, we propose the model of the spatial attention frustum to solve object occlusion in 3D object detection. The spatial attention frustum can suppress unimportant features and allocate limited neural computing resources to critical parts of the scene, thereby providing greater relevance and easier processing for higher-level perceptual reasoning tasks. To ensure that our method maintains good reasoning ability when faced with occluded objects with only a partial structure, we propose a local feature aggregation module to capture more complex local features of the point cloud. Finally, we discuss the projection constraint relationship between the 3D bounding box and the 2D bounding box and propose a joint anchor box projection loss function, which will help to improve the overall performance of our method. The results of the KITTI dataset show that our proposed method can effectively improve the detection accuracy of occluded objects. Our method achieves 89.46%, 79.91% and 75.53% detection accuracy in the easy, moderate, and hard difficulty levels of the car category, and achieves a 6.97% performance improvement especially in the hard category with a high degree of occlusion. Our one-stage method does not need to rely on another refining stage, comparable to the accuracy of the two-stage method.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] FocalFormer3D: Focusing on Hard Instance for 3D Object Detection
    Chen, Yilun
    Yu, Zhiding
    Chen, Yukang
    Lan, Shiyi
    Anandkumar, Anima
    Jia, Jiaya
    Alvarez, Jose M.
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8360 - 8371
  • [22] 3D Object Detection Algorithm Based on the Reconstruction of Sparse Point Clouds in the Viewing Frustum
    Xu, Xing
    Wu, Xiang
    Zhao, Yun
    Lue, Xiaoshu
    Aapaoja, Aki
    MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [23] Frustum FusionNet: Amodal 3D Object Detection with Multi-Modal Feature Fusion
    Zuo, Liangyu
    Li, Yaochen
    Han, Mengtao
    Li, Qiao
    Liu, Yuehu
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2746 - 2751
  • [24] Faraway-Frustum: Dealing with Lidar Sparsity for 3D Object Detection using Fusion
    Zhang, Haolin
    Yang, Dongfang
    Yurtsever, Ekim
    Redmill, Keith A.
    Ozguner, Umit
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2646 - 2652
  • [25] High Dimensional Frustum PointNet for 3D Object Detection from Camera, LiDAR, and Radar
    Wang, Leichen
    Chen, Tianbai
    Anklam, Carsten
    Goldluecke, Bastian
    2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 1615 - 1622
  • [26] PointGAT: Graph attention networks for 3D object detection
    Zhou H.
    Wang W.
    Liu G.
    Zhou Q.
    Intelligent and Converged Networks, 2022, 3 (02): : 204 - 216
  • [27] A Taxonomy of 3D Occluded Objects Recognition Techniques
    Soleimanizadeh, Shiva
    Mohamad, Dzulkifli
    Saba, Tanzila
    Al-ghamdi, Jarallah Saleh
    3D RESEARCH, 2016, 7 (01) : 1 - 14
  • [28] VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention
    Deng, Shengheng
    Liang, Zhihao
    Sun, Lin
    Jia, Kui
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8438 - 8447
  • [29] A Single-Stage 3D Object Detection Method Based on Sparse Attention Mechanism
    Jia, Songche
    Zhang, Zhenyu
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 414 - 425
  • [30] Robust 3D Object Detection for Moving Objects Based on PointPillars
    Nakamura, Ryota
    Enokida, Shuichi
    2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 611 - 617