PCT: Point cloud transformer

被引:22
|
作者
Meng-Hao Guo [1 ]
Jun-Xiong Cai [1 ]
Zheng-Ning Liu [1 ]
Tai-Jiang Mu [1 ]
Ralph R.Martin [2 ]
Shi-Min Hu [1 ]
机构
[1] BNRist, Department of Computer Science and Technology, Tsinghua University
[2] Cardiff University
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
The irregular domain and lack of ordering make it challenging to design deep neural networks for point cloud processing. This paper presents a novel framework named Point Cloud Transformer(PCT) for point cloud learning. PCT is based on Transformer,which achieves huge success in natural language processing and displays great potential in image processing. It is inherently permutation invariant for processing a sequence of points, making it well-suited for point cloud learning. To better capture local context within the point cloud, we enhance input embedding with the support of farthest point sampling and nearest neighbor search. Extensive experiments demonstrate that the PCT achieves the state-of-the-art performance on shape classification, part segmentation, semantic segmentation,and normal estimation tasks.
引用
收藏
页码:187 / 199
页数:13
相关论文
共 50 条
  • [41] TDNet: transformer-based network for point cloud denoising
    Xu, Xueli
    Geng, Guohua
    Cao, Xin
    Li, Kang
    Zhou, Mingquan
    APPLIED OPTICS, 2022, 61 (06) : C80 - C88
  • [42] SWPT: Spherical Window-Based Point Cloud Transformer
    Guo, Xindong
    Sun, Yu
    Zhao, Rong
    Kuang, Liqun
    Han, Xie
    COMPUTER VISION - ACCV 2022, PT I, 2023, 13841 : 396 - 412
  • [43] MULTISCALE REPRESENTATIONS LEARNING TRANSFORMER FRAMEWORK FOR POINT CLOUD CLASSIFICATION
    Sun, Yajie
    Zia, Ali
    Zhou, Jun
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3354 - 3358
  • [44] Neighborhood Multi-Compound Transformer for Point Cloud Registration
    Wang, Yong
    Zhou, Pengbo
    Geng, Guohua
    An, Li
    Li, Kang
    Li, Ruoxue
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 8469 - 8480
  • [45] An Enhanced Downsampling Transformer Network for Point Cloud Semantic Segmentation
    Wang, Yang
    Wei, Zixuan
    Wan, Zhibo
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 262 - 269
  • [46] Hybrid Cross-Transformer-KPConv for Point Cloud Segmentation
    Wen, Shuhuan
    Li, Pengjiang
    Zhang, Hong
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 126 - 130
  • [47] GSFORMER: GEOMETRIC-SPATIAL TRANSFORMER ON POINT CLOUD COMPLETION
    Long, Yijun
    Chen, Zhaoyu
    Lu, Hong
    Zhang, Wenqiang
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1175 - 1180
  • [48] GTPCR: Graph-Enhanced Transformer for Point Cloud Registration
    Chen, Kai
    Yao, Junfeng
    Li, Yuanhang
    Zhang, Han
    Shen, Huabo
    Qian, Quan
    Wu, Xing
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1304 - 1309
  • [49] Position-Guided Point Cloud Panoptic Segmentation Transformer
    Xiao, Zeqi
    Zhang, Wenwei
    Wang, Tai
    Loy, Chen Change
    Lin, Dahua
    Pang, Jiangmiao
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, 133 (01) : 275 - 290
  • [50] Learning Key Features Transformer Network for Point Cloud Processing
    You, Guobang
    Hu, Yikun
    Liu, Yimei
    Liu, Haoyan
    Fan, Hao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT II, 2024, 14426 : 295 - 306