MATNet: Semantic segmentation of 3D point clouds with multiscale adaptive transformer

被引:1
|
作者
Zheng, Yufei [1 ]
Lu, Jian [1 ]
Chen, Xiaogai [1 ]
Zhang, Kaibing [1 ]
Zhou, Jian [1 ]
机构
[1] Xian Polytech Univ, Sch Elect & Informat, Xian 710600, Peoples R China
基金
中国国家自然科学基金;
关键词
3D point cloud; Semantic segmentation; Multiscale; Selfattention; Transformer;
D O I
10.1016/j.compeleceng.2024.109526
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the Transformer model has made significant progress in semantic segmentation tasks. However, existing self-attention mechanisms perform well in capturing remote dependencies and global features, but ignore local area information in point cloud data and have limitations in dealing with multi-scale features. To address this problem, this paper introduces a multiscale self-attention fusion (MSA) module, which adaptively fuses features within different scale neighborhoods and learns global contextual features by connecting local neighborhoods. Then, the multiscale channel aggregation module (MCA)is used to perform deep point-by-point and point-by-point convolution of the point cloud channel, aggregating channel features at multiple scales to extract more accurate local feature information. Finally, in this study, the multiscale adaptive fusion (MSA) module and the multiscale channel aggregation (MCA) module form a sequential network structure that adaptively and dynamically adjusts different scales of point cloud objects to enhance the perception of objects of different sizes for better segmentation performance. By testing and validating the model on the publicly available S3DIS Area 5 dataset and the ScantNetV2 dataset, the model achieves mIoU index values of 71.9% and 72.3%, respectively, which demonstrates the effectiveness and superiority of the proposed method. Code will be made publicly available at https://github.com/Cocoyufei/MAT/tree/master.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Graph Transformer for 3D point clouds classification and semantic segmentation
    Zhou, Wei
    Wang, Qian
    Jin, Weiwei
    Shi, Xinzhe
    He, Ying
    COMPUTERS & GRAPHICS-UK, 2024, 124
  • [2] SEGCloud: Semantic Segmentation of 3D Point Clouds
    Tchapmi, Lyne P.
    Choy, Christopher B.
    Armeni, Iro
    Gwak, JunYoung
    Savarese, Silvio
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, : 537 - 547
  • [3] U-shaped network based on Transformer for 3D point clouds semantic segmentation
    Zhang, Jiazhe
    Li, Xingwei
    Zhao, Xianfa
    Ge, Yizhi
    Zhang, Zheng
    2021 THE 5TH INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, ICVIP 2021, 2021, : 170 - 176
  • [4] Point attention network for semantic segmentation of 3D point clouds
    Feng, Mingtao
    Zhang, Liang
    Lin, Xuefei
    Gilani, Syed Zulqarnain
    Mian, Ajmal
    PATTERN RECOGNITION, 2020, 107 (107)
  • [5] GrowSP: Unsupervised Semantic Segmentation of 3D Point Clouds
    Zhang, Zihui
    Yang, Bo
    Wang, Bing
    Li, Bo
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17619 - 17629
  • [6] Semantic Classification of 3D Point Clouds with Multiscale Spherical Neighborhoods
    Thomas, Hugues
    Deschaud, Jean-Emmanuel
    Marcotegui, Beatriz
    Goulette, Francois
    Le Gall, Yann
    2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 390 - 398
  • [7] Transformer for 3D Point Clouds
    Wang, Jiayun
    Chakraborty, Rudrasis
    Yu, Stella X.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4419 - 4431
  • [8] Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds
    Engelmann, Francis
    Kontogianni, Theodora
    Hermans, Alexander
    Leibe, Bastian
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 716 - 724
  • [9] GECNN for Weakly Supervised Semantic Segmentation of 3D Point Clouds
    He, Zifen
    Zhu, Shouye
    Huang, Ying
    Zhang, Yinhui
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (12) : 2237 - 2243
  • [10] Global Context Reasoning for Semantic Segmentation of 3D Point Clouds
    Ma, Yanni
    Guo, Yulan
    Liu, Hao
    Lei, Yinjie
    Wen, Gongjian
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2920 - 2929