Spatial Attention Frustum: A 3D Object Detection Method Focusing on Occluded Objects

被引:0
|
作者
He, Xinglei [1 ]
Zhang, Xiaohan [1 ]
Wang, Yichun [1 ]
Ji, Hongzeng [1 ]
Duan, Xiuhui [1 ]
Guo, Fen [1 ]
机构
[1] Beijing Inst Technol, Sch Mech Engn, Beijing 100081, Peoples R China
关键词
visual attention mechanism; occluded object detection; multi-sensor fusion; 3D object detection; autonomous vehicles; DEPTH;
D O I
10.3390/s22062366
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Achieving the accurate perception of occluded objects for autonomous vehicles is a challenging problem. Human vision can always quickly locate important object regions in complex external scenes, while other regions are only roughly analysed or ignored, defined as the visual attention mechanism. However, the perception system of autonomous vehicles cannot know which part of the point cloud is in the region of interest. Therefore, it is meaningful to explore how to use the visual attention mechanism in the perception system of autonomous driving. In this paper, we propose the model of the spatial attention frustum to solve object occlusion in 3D object detection. The spatial attention frustum can suppress unimportant features and allocate limited neural computing resources to critical parts of the scene, thereby providing greater relevance and easier processing for higher-level perceptual reasoning tasks. To ensure that our method maintains good reasoning ability when faced with occluded objects with only a partial structure, we propose a local feature aggregation module to capture more complex local features of the point cloud. Finally, we discuss the projection constraint relationship between the 3D bounding box and the 2D bounding box and propose a joint anchor box projection loss function, which will help to improve the overall performance of our method. The results of the KITTI dataset show that our proposed method can effectively improve the detection accuracy of occluded objects. Our method achieves 89.46%, 79.91% and 75.53% detection accuracy in the easy, moderate, and hard difficulty levels of the car category, and achieves a 6.97% performance improvement especially in the hard category with a high degree of occlusion. Our one-stage method does not need to rely on another refining stage, comparable to the accuracy of the two-stage method.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] A Deep Model of Visual Attention for Saliency Detection on 3D Objects
    Rouhafzay, Ghazal
    Cretu, Ana-Maria
    Payeur, Pierre
    NEURAL PROCESSING LETTERS, 2023, 55 (07) : 8847 - 8867
  • [32] A Deep Model of Visual Attention for Saliency Detection on 3D Objects
    Ghazal Rouhafzay
    Ana-Maria Cretu
    Pierre Payeur
    Neural Processing Letters, 2023, 55 : 8847 - 8867
  • [33] Semi-supervised 3D object detection based on frustum transformation and RGB voxel grid
    Wang, Yan
    Yuan, Tiantian
    Hu, Bin
    Li, Yao
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2024, 53 (08):
  • [34] A Frustum-based probabilistic framework for 3D object detection by fusion of LiDAR and camera data
    Gong, Zheng
    Lin, Haojia
    Zhang, Dedong
    Luo, Zhipeng
    Zelek, John
    Chen, Yiping
    Nurunnabi, Abdul
    Wang, Cheng
    Li, Jonathan
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 159 : 90 - 100
  • [35] 3D-FFS: Faster 3D object detection with Focused Frustum Search in sensor fusion based networks
    Ganguly, Aniruddha
    Ishmam, Tasin
    Islam, Khandker Aftarul
    Rahman, Md Zahidur
    Bayzid, Md Shamsuzzoha
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 6848 - 6853
  • [36] Investigating Attention Mechanism in 3D Point Cloud Object Detection
    Qiu, Shi
    Wu, Yunfan
    Anwar, Saeed
    Li, Chongyi
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 403 - 412
  • [37] Design of Class in Unknown Object Segmentation Focusing on 3D Object Detection in Depth Image
    Amemiya, Tatsuya
    Tasaki, Tsuyoshi
    2021 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2021, : 706 - 707
  • [38] Attention-based Proposals Refinement for 3D Object Detection
    Minh-Quan Dao
    Hery, Elwan
    Fremont, Vincent
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 197 - 205
  • [39] ARPNET: attention region proposal network for 3D object detection
    Yangyang Ye
    Chi Zhang
    Xiaoli Hao
    Science China Information Sciences, 2019, 62
  • [40] 3D Object Detection with Attention: Shell-Based Modeling
    Zhang X.
    Zhao Z.
    Sun W.
    Cui Q.
    Computer Systems Science and Engineering, 2023, 46 (01): : 537 - 550