Point Transformer-Based Salient Object Detection Network for 3-D Measurement Point Clouds

被引:3
|
作者
Wei, Zeyong [1 ,2 ]
Chen, Baian [1 ,2 ]
Wang, Weiming [3 ]
Chen, Honghua [1 ,2 ]
Wei, Mingqiang [1 ,2 ]
Li, Jonathan [4 ,5 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Sch Comp Sci & Technol, Nanjing 210016, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Shenzhen Inst Res, Shenzhen 518038, Peoples R China
[3] Hong Kong Metropolitan Univ, Sch Sci & Technol, Hong Kong, Peoples R China
[4] Univ Waterloo, Dept Geog & Environm Management, Waterloo, ON N2L 3G1, Canada
[5] Univ Waterloo, Dept Syst Design Engn, Waterloo, ON N2L 3G1, Canada
基金
中国国家自然科学基金;
关键词
Transformers; Feature extraction; Point cloud compression; Three-dimensional displays; Object detection; Task analysis; Semantics; 3-D measurement point cloud; 3-D salient object detection (SOD); point transformer; PSOD-Net; IMAGE;
D O I
10.1109/TGRS.2024.3355968
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
While salient object detection (SOD) on 2-D images has been extensively studied, there is very little SOD work on 3-D measurement surfaces. We propose an effective point transformer-based SOD network for 3-D measurement point clouds, termed PSOD-Net. PSOD-Net is an encoder-decoder network that takes full advantage of transformers to model the contextual information in both multiscale point- and scenewise manners. In the encoder, we develop a point context transformer (PCT) module to capture region contextual features at the point level; PCT contains two different transformers to excavate the relationship among points. In the decoder, we develop a scene context transformer (SCT) module to learn context representations at the scene level; SCT contains both upsampling-and-transformer (UT) blocks and multicontext aggregation (MCA) units to integrate the global semantic and multilevel features from the encoder into the global scene context. Experiments show clear improvements of PSOD-Net over its competitors and validate that PSOD-Net is more robust to challenging cases such as small objects, multiple objects, and objects with complex structures.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [1] Salient Object Detection for Point Clouds
    Fan, Songlin
    Gao, Wei
    Li, Ge
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 1 - 19
  • [2] DTSSD: Dual-Channel Transformer-Based Network for Point-Based 3D Object Detection
    Zheng, Zhijie
    Huang, Zhicong
    Zhao, Jingwen
    Hu, Haifeng
    Chen, Dihu
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 798 - 802
  • [3] Transformer-based difference fusion network for RGB-D salient object detection
    Cui, Zhi-Qiang
    Wang, Feng
    Feng, Zheng-Yong
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [4] TANet: Transformer-based asymmetric network for RGB-D salient object detection
    Liu, Chang
    Yang, Gang
    Wang, Shuo
    Wang, Hangxu
    Zhang, Yunhua
    Wang, Yutao
    IET COMPUTER VISION, 2023, 17 (04) : 415 - 430
  • [5] Semantic Consistency Reasoning for 3-D Object Detection in Point Clouds
    Wei, Wenwen
    Wei, Ping
    Liao, Zhimin
    Qin, Jialu
    Cheng, Xiang
    Liu, Meiqin
    Zheng, Nanning
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 14
  • [6] Fast 3-D Urban Object Detection on Streaming Point Clouds
    Boercs, Attila
    Nagy, Balazs
    Benedek, Csaba
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT II, 2015, 8926 : 628 - 639
  • [7] Weakly Supervised Point Clouds Transformer for 3D Object Detection
    Tang, Zuojin
    Sun, Bo
    Ma, Tongwei
    Li, Daosheng
    Xu, Zhenhui
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 3948 - 3955
  • [8] Transformer-based Cross Reference Network for video salient object detection
    Huang, Kan
    Tian, Chunwei
    Su, Jingyong
    Lin, Jerry Chun-Wei
    PATTERN RECOGNITION LETTERS, 2022, 160 : 122 - 127
  • [9] Clusterformer: Cluster-based Transformer for 3D Object Detection in Point Clouds
    Pei, Yu
    Zhao, Xian
    Li, Hao
    Ma, Jingyuan
    Zhang, Jingwei
    Pu, Shiliang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6641 - 6650
  • [10] CasA: A Cascade Attention Network for 3-D Object Detection From LiDAR Point Clouds
    Wu, Hai
    Deng, Jinhao
    Wen, Chenglu
    Li, Xin
    Wang, Cheng
    Li, Jonathan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60