MHSAN: Multi-view hierarchical self-attention network for 3D shape recognition

被引:4
|
作者
Cao, Jiangzhong [1 ]
Yu, Lianggeng [1 ]
Ling, Bingo Wing-Kuen [1 ]
Yao, Zijie [1 ]
Dai, Qingyun [2 ,3 ]
机构
[1] Guangdong Univ Technol, Sch Informat Engn, Guangzhou 510006, Peoples R China
[2] Guangdong Polytech Normal Univ, Guangzhou 510665, Peoples R China
[3] Guangdong Prov Key Lab Intellectual Property & Big, Guangzhou 510665, Peoples R China
基金
中国国家自然科学基金;
关键词
3D shape recognition; Self-attention; Multi-view learning; View aggregation;
D O I
10.1016/j.patcog.2024.110315
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view learning has demonstrated promising performance for 3D shape recognition. However, existing multi-view methods usually focus on fusing multiple views and ignore the structural and discriminative information carried by 2D views. In this paper, we propose a multi-view hierarchical self-attention network (MHSAN) to explore the geometric and discriminative information from complex 2D views. Specifically, MHSAN consists of two self-attention networks. First, a global self-attention network is adopted to exploit the structure information by embedding position information of views. Then, the discriminative self-attention network learns discriminative information from the views with high classification scores. Through the proposed MHSAN, the geometric and discriminative information is condensed as the novel representation of 3D shapes. To validate the effectiveness of our proposed method, extensive experiments have been conducted on three 3D shape benchmarks. Experimental results demonstrate that our method is generally superior to the state-of-the-art methods in 3D shape classification and retrieval tasks.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Spatio-Temporal 3D Action Recognition with Hierarchical Self-Attention Mechanism
    Araei, Soheil
    Nadian-Ghomsheh, Ali
    2021 26TH INTERNATIONAL COMPUTER CONFERENCE, COMPUTER SOCIETY OF IRAN (CSICC), 2021,
  • [22] MVCLN: Multi-View Convolutional LSTM Network for Cross-Media 3D Shape Recognition
    Liang, Qi
    Wang, Yixin
    Nie, Weizhi
    Li, Qiang
    IEEE ACCESS, 2020, 8 : 139792 - 139802
  • [23] Multi-View Hierarchical Fusion Network for 3D Object Retrieval and Classification
    Liu, An-An
    Hu, Nian
    Song, Dan
    Guo, Fu-Bin
    Zhou, He-Yu
    Hao, Tong
    IEEE ACCESS, 2019, 7 : 153021 - 153030
  • [24] Multi-view 3D shape style transformation
    Xiuping Liu
    Hua Huang
    Weiming Wang
    Jun Zhou
    The Visual Computer, 2022, 38 : 669 - 684
  • [25] Multi-view 3D shape style transformation
    Liu, Xiuping
    Huang, Hua
    Wang, Weiming
    Zhou, Jun
    VISUAL COMPUTER, 2022, 38 (02): : 669 - 684
  • [26] MV-LFN: Multi-view based local information fusion network for 3D shape recognition
    Zhang, Jing
    Zhou, Dangdang
    Zhao, Yue
    Nie, Weizhi
    Su, Yuting
    VISUAL INFORMATICS, 2021, 5 (03) : 114 - 119
  • [27] Action Recognition with a Multi-View Temporal Attention Network
    Dengdi Sun
    Zhixiang Su
    Zhuanlian Ding
    Bin Luo
    Cognitive Computation, 2022, 14 : 1082 - 1095
  • [28] Action Recognition with a Multi-View Temporal Attention Network
    Sun, Dengdi
    Su, Zhixiang
    Ding, Zhuanlian
    Luo, Bin
    COGNITIVE COMPUTATION, 2022, 14 (03) : 1082 - 1095
  • [29] PVFAN: Point-view fusion attention network for 3D shape recognition
    Cao, Jiangzhong
    Liao, Siyi
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (05) : 8119 - 8133
  • [30] 3D Shape Completion with Multi-View Consistent Inference
    Hu, Tao
    Han, Zhizhong
    Zwicker, Matthias
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10997 - 11004