Point Transformer-Based Salient Object Detection Network for 3-D Measurement Point Clouds

被引:3
|
作者
Wei, Zeyong [1 ,2 ]
Chen, Baian [1 ,2 ]
Wang, Weiming [3 ]
Chen, Honghua [1 ,2 ]
Wei, Mingqiang [1 ,2 ]
Li, Jonathan [4 ,5 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Sch Comp Sci & Technol, Nanjing 210016, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Shenzhen Inst Res, Shenzhen 518038, Peoples R China
[3] Hong Kong Metropolitan Univ, Sch Sci & Technol, Hong Kong, Peoples R China
[4] Univ Waterloo, Dept Geog & Environm Management, Waterloo, ON N2L 3G1, Canada
[5] Univ Waterloo, Dept Syst Design Engn, Waterloo, ON N2L 3G1, Canada
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
Transformers; Feature extraction; Point cloud compression; Three-dimensional displays; Object detection; Task analysis; Semantics; 3-D measurement point cloud; 3-D salient object detection (SOD); point transformer; PSOD-Net; IMAGE;
D O I
10.1109/TGRS.2024.3355968
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
While salient object detection (SOD) on 2-D images has been extensively studied, there is very little SOD work on 3-D measurement surfaces. We propose an effective point transformer-based SOD network for 3-D measurement point clouds, termed PSOD-Net. PSOD-Net is an encoder-decoder network that takes full advantage of transformers to model the contextual information in both multiscale point- and scenewise manners. In the encoder, we develop a point context transformer (PCT) module to capture region contextual features at the point level; PCT contains two different transformers to excavate the relationship among points. In the decoder, we develop a scene context transformer (SCT) module to learn context representations at the scene level; SCT contains both upsampling-and-transformer (UT) blocks and multicontext aggregation (MCA) units to integrate the global semantic and multilevel features from the encoder into the global scene context. Experiments show clear improvements of PSOD-Net over its competitors and validate that PSOD-Net is more robust to challenging cases such as small objects, multiple objects, and objects with complex structures.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [41] TransNet: Transformer-Based Point Cloud Sampling Network
    Lee, Hookyung
    Jeon, Jaeseung
    Hong, Seokjin
    Kim, Jeesu
    Yoo, Jinwoo
    SENSORS, 2023, 23 (10)
  • [42] TDNet: transformer-based network for point cloud denoising
    Xu, Xueli
    Geng, Guohua
    Cao, Xin
    Li, Kang
    Zhou, Mingquan
    APPLIED OPTICS, 2022, 61 (06) : C80 - C88
  • [43] Attentional PointNet for 3D-Object Detection in Point Clouds
    Paigwar, Anshul
    Erkent, Ozgur
    Wolf, Christian
    Laugier, Christian
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1297 - 1306
  • [44] 3-D Object Detection With Balanced Prediction Based on Contrastive Point Loss
    Tong, Jiaxun
    Liu, Kaiqi
    Bai, Xia
    Li, Wei
    IEEE SENSORS JOURNAL, 2024, 24 (04) : 4969 - 4977
  • [45] PTT: Point-Track-Transformer Module for 3D Single Object Tracking in Point Clouds
    Shan, Jiayao
    Zhou, Sifan
    Fang, Zheng
    Cui, Yubo
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 1310 - 1316
  • [46] Fast 3-D object symmetry detection for point cloud
    Ruchay, Alexey
    Kalschikov, Vsevolod
    Gridnev, Alexey
    Guo, Hao
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIV, 2021, 11842
  • [47] Boundary-Aware and Semiautomatic Segmentation of 3-D Object in Point Clouds
    Luo, Huan
    Zheng, Quan
    Wang, Cheng
    Guo, Wenzhong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (05) : 910 - 914
  • [48] An Integrated Framework for 3-D Modeling, Object Detection, and Pose Estimation From Point-Clouds
    Guo, Yulan
    Bennamoun, Mohammed
    Sohel, Ferdous
    Lu, Min
    Wan, Jianwei
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2015, 64 (03) : 683 - 693
  • [49] A 3-D point clouds scanning and registration methodology for automatic object digitization
    Chen, Liang-Chia
    Dinh-Cuong Hoang
    Lin, Hsien-, I
    Thanh-Hung Nguyen
    SMART SCIENCE, 2016, 4 (01) : 1 - 7
  • [50] RGB-D Point Cloud Registration Based on Salient Object Detection
    Wan, Teng
    Du, Shaoyi
    Cui, Wenting
    Yao, Runzhao
    Ge, Yuyan
    Li, Ce
    Gao, Yue
    Zheng, Nanning
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) : 3547 - 3559