MNAT-Net: Multi-Scale Neighborhood Aggregation Transformer Network for Point Cloud Classification and Segmentation

被引:4
|
作者
Wang, Xuchu [1 ]
Yuan, Yue [2 ]
机构
[1] Chongqing Univ, Coll Optoelect Engn, Minist Educ, Key Lab Optoelect Technol & Syst, Chongqing 400044, Peoples R China
[2] Chongqing Univ, Coll Optoelect Engn, Chongqing 400044, Peoples R China
关键词
Point cloud; classification; segmentation; multi-scale neighborhood feature aggregation; transformer;
D O I
10.1109/TITS.2024.3373507
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Accurate understanding of 3D objects in complex scenes plays essential roles in the fields of intelligent transportation and autonomous driving technology. Recent deep neural networks have made significant progress in 3D visual tasks by using point cloud data. However, the acquisition of geometric features and the expression of local fine-grained features in point clouds are still not sufficient for the classification and segmentation tasks. Inspired by the application of transformer structures in 2D and 3D computer vision tasks, in this paper, a multi-scale neighborhood aggregation transformer network (MNAT-Net) is proposed for point cloud classification and segmentation, which captures the global semantic information and local geometric structure features of point clouds by aggregating the receptive field and node weights. MNAT-Net consists of three key components, namely the multi-scale neighborhood feature aggregation module, the global transformer module and the category-weighted focal loss. The neighborhood features learned by the MNAT-Net network is sent to the global transformer module to fully enrich the contextual representation. Experimental results show that MNAT-Net achieves competitive performance on publicly available ModelNet40, ShapeNet, S3DIS and SemanticKITTI data sets in comparison to related methods.
引用
收藏
页码:9153 / 9167
页数:15
相关论文
共 50 条
  • [41] MLGTM: Multi-Scale Local Geometric Transformer-Mamba Application in Terracotta Warriors Point Cloud Classification
    Zhou, Pengbo
    An, Li
    Wang, Yong
    Geng, Guohua
    REMOTE SENSING, 2024, 16 (16)
  • [42] MLFNet- Point Cloud Semantic Segmentation Convolution Network Based on Multi-Scale Feature Fusion
    Yang, Jingfang
    Zou, Bochang
    Qiu, Huadong
    Li, Zhi
    IEEE ACCESS, 2021, 9 : 44950 - 44962
  • [43] MVPNet: A multi-scale voxel-point adaptive fusion network for point cloud semantic segmentation in urban scenes
    Li, Huchen
    Guan, Haiyan
    Ma, Lingfei
    Lei, Xiangda
    Yu, Yongtao
    Wang, Hanyun
    Delavar, Mahmoud Reza
    Li, Jonathan
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 122
  • [44] Multi-Scale Local Context Embedding for LiDAR Point Cloud Classification
    Huang, Rong
    Hong, Danfeng
    Xu, Yusheng
    Yao, Wei
    Stilla, Uwe
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (04) : 721 - 725
  • [45] Classification of LiDAR Point Cloud based on Multi-scale Features and PointNet
    Zhao Zhongyang
    Cheng Yinglei
    Shi Xiaosong
    Qin Xianxiang
    Sun Li
    2018 EIGHTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2018, : 57 - 63
  • [46] MSATNet: multi-scale adaptive transformer network for motor imagery classification
    Hu, Lingyan
    Hong, Weijie
    Liu, Lingyu
    FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [47] Mix-layers semantic extraction and multi-scale aggregation transformer for semantic segmentation
    Li, Tianping
    Yang, Xiaolong
    Zhang, Zhenyi
    Cui, Zhaotong
    Maoxia, Zhou
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (01)
  • [48] MSFANet: Multi-Scale Strip Feature Attention Network for Cloud and Cloud Shadow Segmentation
    Chen, Kai
    Dai, Xin
    Xia, Min
    Weng, Liguo
    Hu, Kai
    Lin, Haifeng
    REMOTE SENSING, 2023, 15 (19)
  • [49] MpMsCFMA-Net: Multi-path Multi-scale Context Feature Mixup and Aggregation Network for medical image segmentation
    Che, Miao
    Wu, Zongfei
    Zhang, Jiahao
    Liu, Xilin
    Zhang, Shuai
    Liu, Yifei
    Feng, Shu
    Wu, Yongfei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [50] P-Swin: Parallel Swin transformer multi-scale semantic segmentation network for land cover classification
    Wang, Di
    Yang, Ronghao
    Zhang, Zhenxin
    Liu, Hanhu
    Tan, Junxiang
    Li, Shaoda
    Yang, Xiaoxia
    Wang, Xiao
    Tang, Kangqi
    Qiao, Yichun
    Su, Po
    COMPUTERS & GEOSCIENCES, 2023, 175