MHSAN: Multi-view hierarchical self-attention network for 3D shape recognition

被引:4
|
作者
Cao, Jiangzhong [1 ]
Yu, Lianggeng [1 ]
Ling, Bingo Wing-Kuen [1 ]
Yao, Zijie [1 ]
Dai, Qingyun [2 ,3 ]
机构
[1] Guangdong Univ Technol, Sch Informat Engn, Guangzhou 510006, Peoples R China
[2] Guangdong Polytech Normal Univ, Guangzhou 510665, Peoples R China
[3] Guangdong Prov Key Lab Intellectual Property & Big, Guangzhou 510665, Peoples R China
基金
中国国家自然科学基金;
关键词
3D shape recognition; Self-attention; Multi-view learning; View aggregation;
D O I
10.1016/j.patcog.2024.110315
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view learning has demonstrated promising performance for 3D shape recognition. However, existing multi-view methods usually focus on fusing multiple views and ignore the structural and discriminative information carried by 2D views. In this paper, we propose a multi-view hierarchical self-attention network (MHSAN) to explore the geometric and discriminative information from complex 2D views. Specifically, MHSAN consists of two self-attention networks. First, a global self-attention network is adopted to exploit the structure information by embedding position information of views. Then, the discriminative self-attention network learns discriminative information from the views with high classification scores. Through the proposed MHSAN, the geometric and discriminative information is condensed as the novel representation of 3D shapes. To validate the effectiveness of our proposed method, extensive experiments have been conducted on three 3D shape benchmarks. Experimental results demonstrate that our method is generally superior to the state-of-the-art methods in 3D shape classification and retrieval tasks.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Cross self-attention network for 3D point cloud
    Wang, Gaihua
    Zhai, Qianyu
    Liu, Hong
    KNOWLEDGE-BASED SYSTEMS, 2022, 247
  • [32] Multi-View Group Recommendation Integrating Self-Attention and Graph Convolution
    Wang, Yonggui
    Wang, Xinru
    Computer Engineering and Applications, 60 (08): : 287 - 295
  • [33] Contrastive Multi-View Learning for 3D Shape Clustering
    Peng, Bo
    Lin, Guoting
    Lei, Jianjun
    Qin, Tianyi
    Cao, Xiaochun
    Ling, Nam
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6262 - 6272
  • [34] Learning View-Based Graph Convolutional Network for Multi-View 3D Shape Analysis
    Wei, Xin
    Yu, Ruixuan
    Sun, Jian
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7525 - 7541
  • [35] 3D Point Cloud Recognition Based on a Multi-View Convolutional Neural Network
    Zhang, Le
    Sun, Jian
    Zheng, Qiang
    SENSORS, 2018, 18 (11)
  • [36] Multi-View 3D Shape Recognition via Correspondence-Aware Deep Learning
    Xu, Yong
    Zheng, Chaoda
    Xu, Ruotao
    Quan, Yuhui
    Ling, Haibin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 5299 - 5312
  • [37] A METHOD FOR COMPLETING MISSING 3D POINT CLOUD RECONSTRUCTED FROM AERIAL MULTI-VIEW IMAGES USING SELF-ATTENTION MECHANISM
    Kiyama, Takenobu
    Xie, Chun
    Shishido, Hidehiko
    Toriya, Hisatoshi
    Kitahara, Itaru
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 927 - 930
  • [38] Learning Relationships for Multi-View 3D Object Recognition
    Yang, Ze
    Wang, Liwei
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7504 - 7513
  • [39] Multi-view Manhole Detection, Recognition, and 3D Localisation
    Timofte, Radu
    Van Gool, Luc
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [40] Multi-View Hierarchical Attention Graph Convolutional Network with Domain Adaptation for EEG Emotion Recognition
    Li, Chao
    Wang, Feng
    Bian, Ning
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CRYPTOGRAPHY, NETWORK SECURITY AND COMMUNICATION TECHNOLOGY, CNSCT 2024, 2024, : 624 - 630