ARM3D: Attention-based relation module for indoor 3D object detection

被引:0
|
作者
Yuqing Lan [1 ]
Yao Duan [1 ]
Chenyi Liu [1 ]
Chenyang Zhu [1 ]
Yueshan Xiong [1 ]
Hui Huang [2 ]
Kai Xu [1 ]
机构
[1] College of Computer, National University of Defense Technology
[2] Shenzhen University
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
Relation contexts have been proved to be useful for many challenging vision tasks. In the field of3D object detection, previous methods have been taking the advantage of context encoding, graph embedding, or explicit relation reasoning to extract relation contexts.However, there exist inevitably redundant relation contexts due to noisy or low-quality proposals. In fact,invalid relation contexts usually indicate underlying scene misunderstanding and ambiguity, which may,on the contrary, reduce the performance in complex scenes. Inspired by recent attention mechanism like Transformer, we propose a novel 3D attention-based relation module(ARM3D). It encompasses objectaware relation reasoning to extract pair-wise relation contexts among qualified proposals and an attention module to distribute attention weights towards different relation contexts. In this way, ARM3D can take full advantage of the useful relation contexts and filter those less relevant or even confusing contexts,which mitigates the ambiguity in detection. We have evaluated the effectiveness of ARM3D by plugging it into several state-of-the-art 3D object detectors and showing more accurate and robust detection results. Extensive experiments show the capability and generalization of ARM3D on 3D object detection. Our source code is available at https://github.com/lanlan96/ARM3D.
引用
收藏
页码:395 / 414
页数:20
相关论文
共 50 条
  • [41] 3D Object Detection Based on Voxel Self-Attention Auxiliary Networks
    Cao, Jie
    Peng, Yiqiang
    Fan, Likang
    Wang, Longfei
    LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (24)
  • [42] BAFusion: Bidirectional Attention Fusion for 3D Object Detection Based on LiDAR and Camera
    Liu, Min
    Jia, Yuanjun
    Lyu, Youhao
    Dong, Qi
    Yang, Yanyu
    SENSORS, 2024, 24 (14)
  • [43] 3D Object Detection Based on Attention and Multi-Scale Feature Fusion
    Liu, Minghui
    Ma, Jinming
    Zheng, Qiuping
    Liu, Yuchen
    Shi, Gang
    SENSORS, 2022, 22 (10)
  • [44] MEAN: An attention-based approach for 3D mesh shape classification
    Jicheng Dai
    Rubin Fan
    Yupeng Song
    Qing Guo
    Fazhi He
    The Visual Computer, 2024, 40 : 2987 - 3000
  • [45] A Multi-Modal Attention-Based Approach for Points of Interest Detection on 3D Shapes
    Shu, Zhenyu
    Yu, Junlong
    Chao, Kai
    Xin, Shiqing
    Liu, Ligang
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2025, 31 (03) : 1698 - 1712
  • [46] Spatio-Temporal Attention-Based LSTM Networks for 3D Action Recognition and Detection
    Song, Sijie
    Lan, Cuiling
    Xing, Junliang
    Zeng, Wenjun
    Liu, Jiaying
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (07) : 3459 - 3471
  • [47] MEAN: An attention-based approach for 3D mesh shape classification
    Dai, Jicheng
    Fan, Rubin
    Song, Yupeng
    Guo, Qing
    He, Fazhi
    VISUAL COMPUTER, 2024, 40 (04): : 2987 - 3000
  • [48] Attention-Based 3D Human Pose Sequence Refinement Network
    Kim, Do-Yeop
    Chang, Ju-Yong
    SENSORS, 2021, 21 (13)
  • [49] Attention-Based 3D Neural Architectures for Predicting Cracks in Designs
    Iyer, Naresh
    Raghavan, Sathyanarayanan
    Zhang, Yiming
    Jiao, Yang
    Robinson, Dean
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT I, 2021, 12891 : 179 - 190
  • [50] A robust 3D unique descriptor for 3D object detection
    Joshi, Piyush
    Rastegarpanah, Alireza
    Stolkin, Rustam
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (03)