MNAT-Net: Multi-Scale Neighborhood Aggregation Transformer Network for Point Cloud Classification and Segmentation

被引:4
|
作者
Wang, Xuchu [1 ]
Yuan, Yue [2 ]
机构
[1] Chongqing Univ, Coll Optoelect Engn, Minist Educ, Key Lab Optoelect Technol & Syst, Chongqing 400044, Peoples R China
[2] Chongqing Univ, Coll Optoelect Engn, Chongqing 400044, Peoples R China
关键词
Point cloud; classification; segmentation; multi-scale neighborhood feature aggregation; transformer;
D O I
10.1109/TITS.2024.3373507
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Accurate understanding of 3D objects in complex scenes plays essential roles in the fields of intelligent transportation and autonomous driving technology. Recent deep neural networks have made significant progress in 3D visual tasks by using point cloud data. However, the acquisition of geometric features and the expression of local fine-grained features in point clouds are still not sufficient for the classification and segmentation tasks. Inspired by the application of transformer structures in 2D and 3D computer vision tasks, in this paper, a multi-scale neighborhood aggregation transformer network (MNAT-Net) is proposed for point cloud classification and segmentation, which captures the global semantic information and local geometric structure features of point clouds by aggregating the receptive field and node weights. MNAT-Net consists of three key components, namely the multi-scale neighborhood feature aggregation module, the global transformer module and the category-weighted focal loss. The neighborhood features learned by the MNAT-Net network is sent to the global transformer module to fully enrich the contextual representation. Experimental results show that MNAT-Net achieves competitive performance on publicly available ModelNet40, ShapeNet, S3DIS and SemanticKITTI data sets in comparison to related methods.
引用
收藏
页码:9153 / 9167
页数:15
相关论文
共 50 条
  • [31] CGMA-Net: Cross-Level Guidance and Multi-Scale Aggregation Network for Polyp Segmentation
    Zheng, Jianwei
    Yan, Yidong
    Zhao, Liang
    Pan, Xiang
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (03) : 1424 - 1435
  • [32] MFCTrans-net: a Multi-scale Fusion and Channel Transformer Net for Retinal Vessel Segmentation
    Li, Zhuo
    Li, Biyuan
    Zhang, Jun
    Mei, Jianqiang
    Li, Binghui
    FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705
  • [33] LPMSNet: Location Pooling Multi-Scale Network for Cloud and Cloud Shadow Segmentation
    Dai, Xin
    Chen, Kai
    Xia, Min
    Weng, Liguo
    Lin, Haifeng
    REMOTE SENSING, 2023, 15 (16)
  • [34] SSPU-Net: A Structure Sensitive Point Cloud Upsampling Network with Multi-Scale Spatial Refinement
    Wang, Jin
    Chen, Jiade
    Shi, Yunhui
    Ling, Nam
    Yin, Baocai
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1546 - 1555
  • [35] Point Set Multi-Level Aggregation Feature Extraction Based on Multi-Scale Max Pooling and LDA for Point Cloud Classification
    Tong, Guofeng
    Li, Yong
    Zhang, Weilong
    Chen, Dong
    Zhang, Zhenxin
    Yang, Jingchao
    Zhang, Jianjun
    REMOTE SENSING, 2019, 11 (23)
  • [36] BMCS-Net: A Bi-directional multi-scale cascaded segmentation network based on transformer-guided feature Aggregation for medical images
    Li, Bicao
    Wang, Jing
    Wang, Bei
    Shao, Zhuhong
    Li, Wei
    Huang, Jie
    Li, Panpan
    Computers in Biology and Medicine, 2024, 180
  • [37] EMS-Net: Enhanced Multi-Scale Network for Polyp Segmentation
    Wang, Miao
    An, Xingwei
    Li, Yuhao
    Li, Ning
    Hang, Wei
    Liu, Gang
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 2936 - 2939
  • [38] RETINAL VESSEL SEGMENTATION VIA A SEMANTICS AND MULTI-SCALE AGGREGATION NETWORK
    Xu, Rui
    Ye, Xinchen
    Jiang, Guiliang
    Liu, Tiantian
    Li, Liang
    Tanaka, Satoshi
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1085 - 1089
  • [39] MSMANet: A multi-scale mesh aggregation network for brain tumor segmentation
    Zhang, Yan
    Lu, Yao
    Chen, Wankun
    Chang, Yankang
    Gu, Haiming
    Yu, Bin
    APPLIED SOFT COMPUTING, 2021, 110
  • [40] Multi-Scale Feature Aggregation Network for Semantic Segmentation of Land Cover
    Shen, Xu
    Weng, Liguo
    Xia, Min
    Lin, Haifeng
    REMOTE SENSING, 2022, 14 (23)