CDAF3D: Cross-Dimensional Attention Fusion for Indoor 3D Object Detection

被引:0
|
作者
Wang, Shilin [1 ]
Huang, Hai [1 ]
Zhu, Yueyan [1 ]
Tang, Zhenqi [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
基金
国家重点研发计划;
关键词
Indoor 3D Object Detection; Fusion Features; Point Cloud;
D O I
10.1007/978-981-97-8493-6_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object detection is a crucial task in computer vision and autonomous systems, which is widely utilized in robotics, autonomous driving, and augmented reality. With the advancement of input devices, researchers propose to use multimodal information to improve the detection accuracy. However, integrating 2D and 3D features effectively to harness their complementary nature for detection tasks is still a challenge. In this paper, we note that the complementary nature of geometric and visual texture information can effectively strengthen feature fusion, which plays a key role in detection. To this end, we propose the Cross-Dimensional Attention Fusion-based indoor 3D object detection method (CDAF3D). This method dynamically learns geometric information with corresponding 2D image texture details through a cross-dimensional attention mechanism, enabling the model to capture and integrate spatial and textural information effectively. Additionally, due to the nature of 3D object detection, where intersecting entities with different specific labels are unrealistic, we further propose Preventive 3D Intersect Loss (P3DIL). This loss enhances detection accuracy by addressing intersections between objects of different labels. We evaluate the proposed CDAF3D on the SUN RGB-D and Scannet v2 datasets. Our results achieve 78.2 mAP@0.25 and 66.5 mAP@0.50 on ScanNetV2 and 70.3 mAP@0.25 and 54.1 mAP@0.50 on SUN RGB-D. The proposed CDAF3D outperforms all the multi-sensor-based methods with 3D IoU thresholds of 0.25 and 0.5.
引用
收藏
页码:165 / 177
页数:13
相关论文
共 50 条
  • [31] SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detection
    Zhu, Yun
    Hui, Le
    Shen, Yaqi
    Xie, Jin
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7811 - 7819
  • [32] BCAF-3D: Bilateral Content Awareness Fusion for cross-modal 3D object detection
    Chen, Mu
    Liu, Pengfei
    Zhao, Huaici
    KNOWLEDGE-BASED SYSTEMS, 2023, 279
  • [33] Improving 3D Object Detection with Context-Aware and Dimensional Interaction Attention
    Jing Zhou
    Zixin Gong
    Junchi Zhang
    Neural Processing Letters, 56
  • [34] Improving 3D Object Detection with Context-Aware and Dimensional Interaction Attention
    Zhou, Jing
    Gong, Zixin
    Zhang, Junchi
    NEURAL PROCESSING LETTERS, 2024, 56 (01)
  • [35] CAF-RCNN: multimodal 3D object detection with cross-attention
    Liu, Junting
    Liu, Deer
    Zhu, Lei
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (19) : 6131 - 6146
  • [36] Cross-Modality 3D Object Detection
    Zhu, Ming
    Ma, Chao
    Ji, Pan
    Yang, Xiaokang
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3771 - 3780
  • [37] Cross-Supervised LiDAR-Camera Fusion for 3D Object Detection
    Zuo, Chao Jie
    Gu, Cao Yu
    Guo, Yi Kun
    Miao, Xiao Dong
    IEEE ACCESS, 2025, 13 : 10447 - 10458
  • [38] PointGAT: Graph attention networks for 3D object detection
    Zhou H.
    Wang W.
    Liu G.
    Zhou Q.
    Intelligent and Converged Networks, 2022, 3 (02): : 204 - 216
  • [39] Point-Level Fusion and Channel Attention for 3D Object Detection in Autonomous Driving
    Shen, Juntao
    Fang, Zheng
    Huang, Jin
    SENSORS, 2025, 25 (04)
  • [40] SRFDet3D: Sparse Region Fusion based 3D Object Detection
    Erabati, Gopi Krishna
    Araujo, Helder
    NEUROCOMPUTING, 2024, 593