MNAT-Net: Multi-Scale Neighborhood Aggregation Transformer Network for Point Cloud Classification and Segmentation

被引:4
|
作者
Wang, Xuchu [1 ]
Yuan, Yue [2 ]
机构
[1] Chongqing Univ, Coll Optoelect Engn, Minist Educ, Key Lab Optoelect Technol & Syst, Chongqing 400044, Peoples R China
[2] Chongqing Univ, Coll Optoelect Engn, Chongqing 400044, Peoples R China
关键词
Point cloud; classification; segmentation; multi-scale neighborhood feature aggregation; transformer;
D O I
10.1109/TITS.2024.3373507
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Accurate understanding of 3D objects in complex scenes plays essential roles in the fields of intelligent transportation and autonomous driving technology. Recent deep neural networks have made significant progress in 3D visual tasks by using point cloud data. However, the acquisition of geometric features and the expression of local fine-grained features in point clouds are still not sufficient for the classification and segmentation tasks. Inspired by the application of transformer structures in 2D and 3D computer vision tasks, in this paper, a multi-scale neighborhood aggregation transformer network (MNAT-Net) is proposed for point cloud classification and segmentation, which captures the global semantic information and local geometric structure features of point clouds by aggregating the receptive field and node weights. MNAT-Net consists of three key components, namely the multi-scale neighborhood feature aggregation module, the global transformer module and the category-weighted focal loss. The neighborhood features learned by the MNAT-Net network is sent to the global transformer module to fully enrich the contextual representation. Experimental results show that MNAT-Net achieves competitive performance on publicly available ModelNet40, ShapeNet, S3DIS and SemanticKITTI data sets in comparison to related methods.
引用
收藏
页码:9153 / 9167
页数:15
相关论文
共 50 条
  • [1] Multi-Scale Neighborhood Feature Extraction and Aggregation for Point Cloud Segmentation
    Li, Dawei
    Shi, Guoliang
    Wu, Yuhao
    Yang, Yanping
    Zhao, Mingbo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2175 - 2191
  • [2] MSP-Net: Multi-Scale Point Cloud Classification Network
    Bai J.
    Xu H.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (11): : 1917 - 1924
  • [3] Multi-Scale Attentive Aggregation for LiDAR Point Cloud Segmentation
    Geng, Xiaoxiao
    Ji, Shunping
    Lu, Meng
    Zhao, Lingli
    REMOTE SENSING, 2021, 13 (04) : 1 - 12
  • [4] MLMS-Net: A Point Cloud Classification Network with Multi-Level and Multi-Scale
    Xue D.
    Cheng Y.
    Wen P.
    Yu W.
    Qin X.
    Cheng, Yinglei, 1600, Xi'an Jiaotong University (54): : 70 - 78
  • [5] Dilated Multi-scale Fusion for Point Cloud Classification and Segmentation
    Fan Guo
    Qingquan Ren
    Jin Tang
    Zhiyong Li
    Multimedia Tools and Applications, 2022, 81 : 6069 - 6090
  • [6] Dilated Multi-scale Fusion for Point Cloud Classification and Segmentation
    Guo, Fan
    Ren, Qingquan
    Tang, Jin
    Li, Zhiyong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (05) : 6069 - 6090
  • [7] Micro-Gear Point Cloud Segmentation Based on Multi-Scale Point Transformer
    Su, Yizhou
    Wang, Xunwei
    Qi, Guanghao
    Lei, Baozhen
    APPLIED SCIENCES-BASEL, 2024, 14 (10):
  • [8] MSNet: Multi-Scale Convolutional Network for Point Cloud Classification
    Wang, Lei
    Huang, Yuchun
    Shan, Jie
    He, Liu
    REMOTE SENSING, 2018, 10 (04)
  • [9] Multi-scale learnable key-channel attention network for point cloud classification and segmentation
    Zhao, Jie
    Liu, Yian
    Wu, Bin
    APPLIED SOFT COMPUTING, 2024, 159
  • [10] Multi-scale strip pooling feature aggregation network for cloud and cloud shadow segmentation
    Chen Lu
    Min Xia
    Haifeng Lin
    Neural Computing and Applications, 2022, 34 : 6149 - 6162