MHSAN: Multi-view hierarchical self-attention network for 3D shape recognition

被引:4
|
作者
Cao, Jiangzhong [1 ]
Yu, Lianggeng [1 ]
Ling, Bingo Wing-Kuen [1 ]
Yao, Zijie [1 ]
Dai, Qingyun [2 ,3 ]
机构
[1] Guangdong Univ Technol, Sch Informat Engn, Guangzhou 510006, Peoples R China
[2] Guangdong Polytech Normal Univ, Guangzhou 510665, Peoples R China
[3] Guangdong Prov Key Lab Intellectual Property & Big, Guangzhou 510665, Peoples R China
基金
中国国家自然科学基金;
关键词
3D shape recognition; Self-attention; Multi-view learning; View aggregation;
D O I
10.1016/j.patcog.2024.110315
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view learning has demonstrated promising performance for 3D shape recognition. However, existing multi-view methods usually focus on fusing multiple views and ignore the structural and discriminative information carried by 2D views. In this paper, we propose a multi-view hierarchical self-attention network (MHSAN) to explore the geometric and discriminative information from complex 2D views. Specifically, MHSAN consists of two self-attention networks. First, a global self-attention network is adopted to exploit the structure information by embedding position information of views. Then, the discriminative self-attention network learns discriminative information from the views with high classification scores. Through the proposed MHSAN, the geometric and discriminative information is condensed as the novel representation of 3D shapes. To validate the effectiveness of our proposed method, extensive experiments have been conducted on three 3D shape benchmarks. Experimental results demonstrate that our method is generally superior to the state-of-the-art methods in 3D shape classification and retrieval tasks.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Multi-view 3D Reconstruction with Self-attention
    Qian, Qiuting
    2021 14TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2021), 2021, : 20 - 26
  • [2] Multi-View 3D Reconstruction Method Based on Self-Attention Mechanism
    Zhu, Guangzhao
    Bo, Wei
    Yang, Afeng
    Xin, Xu
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (16)
  • [3] Multi-view dual attention network for 3D object recognition
    Wenju Wang
    Yu Cai
    Tao Wang
    Neural Computing and Applications, 2022, 34 : 3201 - 3212
  • [4] Multi-view dual attention network for 3D object recognition
    Wang, Wenju
    Cai, Yu
    Wang, Tao
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (04): : 3201 - 3212
  • [5] MVPN: Multi-View Prototype Network for 3D Shape Recognition
    Wu, Zizhao
    Yang, Ping
    Wang, Yigang
    IEEE ACCESS, 2019, 7 : 130363 - 130372
  • [6] MVTN: Multi-View Transformation Network for 3D Shape Recognition
    Hamdi, Abdullah
    Giancola, Silvio
    Ghanem, Bernard
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1 - 11
  • [7] Multi-view Moments Embedding Network for 3D Shape Recognition
    Xiao, Jun
    Zhang, Yuanxing
    Zhao, Pengyu
    Xiao, Kecheng
    Bian, Kaigui
    Zhang, Chunli
    Yan, Wei
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2257 - 2260
  • [8] Hierarchical Graph Attention Based Multi-View Convolutional Neural Network for 3D Object Recognition
    Zeng, Hui
    Zhao, Tianmeng
    Cheng, Ruting
    Wang, Fuzhou
    Liu, Jiwei
    IEEE ACCESS, 2021, 9 (09): : 33323 - 33335
  • [9] MULTI-VIEW SELF-ATTENTION BASED TRANSFORMER FOR SPEAKER RECOGNITION
    Wang, Rui
    Ao, Junyi
    Zhou, Long
    Liu, Shujie
    Wei, Zhihua
    Ko, Tom
    Li, Qing
    Zhang, Yu
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6732 - 6736
  • [10] Multi-view self-attention networks
    Xu, Mingzhou
    Yang, Baosong
    Wong, Derek F.
    Chao, Lidia S.
    KNOWLEDGE-BASED SYSTEMS, 2022, 241