Point Transformer-Based Salient Object Detection Network for 3-D Measurement Point Clouds

被引：3

作者：

Wei, Zeyong ^{[1
,2
]}

Chen, Baian ^{[1
,2
]}

Wang, Weiming ^{[3
]}

Chen, Honghua ^{[1
,2
]}

Wei, Mingqiang ^{[1
,2
]}

Li, Jonathan ^{[4
,5
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Sch Comp Sci & Technol, Nanjing 210016, Peoples R China

[2] Nanjing Univ Aeronaut & Astronaut, Shenzhen Inst Res, Shenzhen 518038, Peoples R China

[3] Hong Kong Metropolitan Univ, Sch Sci & Technol, Hong Kong, Peoples R China

[4] Univ Waterloo, Dept Geog & Environm Management, Waterloo, ON N2L 3G1, Canada

[5] Univ Waterloo, Dept Syst Design Engn, Waterloo, ON N2L 3G1, Canada

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷

基金：

中国国家自然科学基金;

关键词：

Transformers; Feature extraction; Point cloud compression; Three-dimensional displays; Object detection; Task analysis; Semantics; 3-D measurement point cloud; 3-D salient object detection (SOD); point transformer; PSOD-Net; IMAGE;

D O I：

10.1109/TGRS.2024.3355968

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

While salient object detection (SOD) on 2-D images has been extensively studied, there is very little SOD work on 3-D measurement surfaces. We propose an effective point transformer-based SOD network for 3-D measurement point clouds, termed PSOD-Net. PSOD-Net is an encoder-decoder network that takes full advantage of transformers to model the contextual information in both multiscale point- and scenewise manners. In the encoder, we develop a point context transformer (PCT) module to capture region contextual features at the point level; PCT contains two different transformers to excavate the relationship among points. In the decoder, we develop a scene context transformer (SCT) module to learn context representations at the scene level; SCT contains both upsampling-and-transformer (UT) blocks and multicontext aggregation (MCA) units to integrate the global semantic and multilevel features from the encoder into the global scene context. Experiments show clear improvements of PSOD-Net over its competitors and validate that PSOD-Net is more robust to challenging cases such as small objects, multiple objects, and objects with complex structures.

引用

页码：1 / 11

页数：11

共 50 条

[1] Salient Object Detection for Point Clouds
Fan, Songlin
Gao, Wei
Li, Ge
COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 1 - 19
[2] DTSSD: Dual-Channel Transformer-Based Network for Point-Based 3D Object Detection
Zheng, Zhijie
Huang, Zhicong
Zhao, Jingwen
Hu, Haifeng
Chen, Dihu
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 798 - 802
[3] Transformer-based difference fusion network for RGB-D salient object detection
Cui, Zhi-Qiang
Wang, Feng
Feng, Zheng-Yong
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
[4] TANet: Transformer-based asymmetric network for RGB-D salient object detection
Liu, Chang
Yang, Gang
Wang, Shuo
Wang, Hangxu
Zhang, Yunhua
Wang, Yutao
IET COMPUTER VISION, 2023, 17 (04) : 415 - 430
[5] Semantic Consistency Reasoning for 3-D Object Detection in Point Clouds
Wei, Wenwen
Wei, Ping
Liao, Zhimin
Qin, Jialu
Cheng, Xiang
Liu, Meiqin
Zheng, Nanning
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 14
[6] Fast 3-D Urban Object Detection on Streaming Point Clouds
Boercs, Attila
Nagy, Balazs
Benedek, Csaba
COMPUTER VISION - ECCV 2014 WORKSHOPS, PT II, 2015, 8926 : 628 - 639
[7] Weakly Supervised Point Clouds Transformer for 3D Object Detection
Tang, Zuojin
Sun, Bo
Ma, Tongwei
Li, Daosheng
Xu, Zhenhui
2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 3948 - 3955
[8] Transformer-based Cross Reference Network for video salient object detection
Huang, Kan
Tian, Chunwei
Su, Jingyong
Lin, Jerry Chun-Wei
PATTERN RECOGNITION LETTERS, 2022, 160 : 122 - 127
[9] Clusterformer: Cluster-based Transformer for 3D Object Detection in Point Clouds
Pei, Yu
Zhao, Xian
Li, Hao
Ma, Jingyuan
Zhang, Jingwei
Pu, Shiliang
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6641 - 6650
[10] CasA: A Cascade Attention Network for 3-D Object Detection From LiDAR Point Clouds
Wu, Hai
Deng, Jinhao
Wen, Chenglu
Li, Xin
Wang, Cheng
Li, Jonathan
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

← 1 2 3 4 5 →