MATNet: Semantic segmentation of 3D point clouds with multiscale adaptive transformer

被引:1
|
作者
Zheng, Yufei [1 ]
Lu, Jian [1 ]
Chen, Xiaogai [1 ]
Zhang, Kaibing [1 ]
Zhou, Jian [1 ]
机构
[1] Xian Polytech Univ, Sch Elect & Informat, Xian 710600, Peoples R China
基金
中国国家自然科学基金;
关键词
3D point cloud; Semantic segmentation; Multiscale; Selfattention; Transformer;
D O I
10.1016/j.compeleceng.2024.109526
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the Transformer model has made significant progress in semantic segmentation tasks. However, existing self-attention mechanisms perform well in capturing remote dependencies and global features, but ignore local area information in point cloud data and have limitations in dealing with multi-scale features. To address this problem, this paper introduces a multiscale self-attention fusion (MSA) module, which adaptively fuses features within different scale neighborhoods and learns global contextual features by connecting local neighborhoods. Then, the multiscale channel aggregation module (MCA)is used to perform deep point-by-point and point-by-point convolution of the point cloud channel, aggregating channel features at multiple scales to extract more accurate local feature information. Finally, in this study, the multiscale adaptive fusion (MSA) module and the multiscale channel aggregation (MCA) module form a sequential network structure that adaptively and dynamically adjusts different scales of point cloud objects to enhance the perception of objects of different sizes for better segmentation performance. By testing and validating the model on the publicly available S3DIS Area 5 dataset and the ScantNetV2 dataset, the model achieves mIoU index values of 71.9% and 72.3%, respectively, which demonstrates the effectiveness and superiority of the proposed method. Code will be made publicly available at https://github.com/Cocoyufei/MAT/tree/master.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Efficient Convolutions for Real-Time Semantic Segmentation of 3D Point Clouds
    Zhang, Chris
    Luo, Wenjie
    Urtasun, Raquel
    2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 399 - 408
  • [32] Know What Your Neighbors Do: 3D Semantic Segmentation of Point Clouds
    Engelmann, Francis
    Kontogianni, Theodora
    Schult, Jonas
    Leibe, Bastian
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT III, 2019, 11131 : 395 - 409
  • [33] Semantic segmentation of sparsely annotated 3D point clouds by pseudo-labelling
    Xu, Katie
    Yao, Yasuhiro
    Murasaki, Kazuhiko
    Ando, Shingo
    Sagata, Atsushi
    2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 463 - 471
  • [34] Dense Supervision Propagation for Weakly Supervised Semantic Segmentation on 3D Point Clouds
    Wei, Jiacheng
    Lin, Guosheng
    Yap, Kim-Hui
    Liu, Fayao
    Hung, Tzu-Yi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 4367 - 4377
  • [35] Generative Zero-Shot Learning for Semantic Segmentation of 3D Point Clouds
    Michele, Bjorn
    Boulch, Alexandre
    Puy, Gilles
    Bucher, Maxime
    Marlet, Renaud
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 992 - 1002
  • [36] Augmented Edge Graph Convolutional Networks for Semantic Segmentation of 3D Point Clouds
    Zhang Lujian
    Bi Yuanwei
    Liu Yaowen
    Huang Yansen
    LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (08)
  • [37] Joint Semantic-Instance Segmentation of 3D Point Clouds: Instance Separation and Semantic Fusion
    Zhong, Min
    Zeng, Gang
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6616 - 6623
  • [38] FAT: FIELD-AWARE TRANSFORMER FOR 3D POINT CLOUD SEMANTIC SEGMENTATION
    Zhou, Junjie
    Xiong, Yongping
    Chiu, Chinwai
    Liu, Fangyu
    Gong, Xiangyang
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 660 - 664
  • [39] Efficient 3D Semantic Segmentation with Superpoint Transformer
    Robert, Damien
    Raguet, Hugo
    Landrieu, Loic
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17149 - 17158
  • [40] DAPS3D: Domain Adaptive Projective Segmentation of 3D LiDAR Point Clouds
    Klokov, Alexey A.
    Pak, Di Un
    Khorin, Aleksandr
    Yudin, Dmitry A.
    Kochiev, Leon
    Luchinskiy, Vladimir D.
    Bezuglyj, Vitaly D.
    IEEE ACCESS, 2023, 11 : 79341 - 79356