A novel 3D shape recognition method based on double-channel attention residual network

被引:6
|
作者
Ma, Ziping [1 ]
Zhou, Jie [2 ]
Ma, Jinlin [2 ]
Li, Tingting [2 ]
机构
[1] North Minzu Univ, Coll Math & Informat Sci, Yinchuan 750021, Ningxia, Peoples R China
[2] North Minzu Univ, Coll Comp Sci & Engn, Yinchuan 750021, Ningxia, Peoples R China
基金
中国国家自然科学基金;
关键词
3D shape recognition; Residual; Multi-head self-attention; Weighted loss function; CONVOLUTIONAL NEURAL-NETWORK; POINT CLOUD; RETRIEVAL; CLASSIFICATION;
D O I
10.1007/s11042-022-12041-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning 3D features by deep networks has achieved a successful performance up to now. However, data imbalance and low-resolution voxels still remain and influence the performance of 3D shape recognition. To resolve these issues, we propose double-channel attention residual network (double-RVCNN) as a novel deep network model with residual structure based on multi-head self-attention mechanism. Double-channel structure adopts double channels to input data including voxels and 3D Radon feature matrices, aiming to fully utilize the local and global features. The multi-head self-attention mechanism can integrate the relatively important contents of the input data through multiple heads structure, which can enrich the information processing ability and stabilize the training process of our network. Residual structure with cross-entropy loss and center loss as weighted loss function can avoid information loss to a great extent. Experimental results show that the values of mean average precision (MAP) are 83.31% and 74.04%, the values of classification accuracy are 90.53% and 85.09% on ModelNet10 and ModelNet40 datasets respectively, which demonstrates that our method performs a better 3D shape recognition accuracy than compared methods on test datasets.
引用
收藏
页码:32519 / 32548
页数:30
相关论文
共 50 条
  • [31] Res3ATN-Deep 3D Residual Attention Network for Hand Gesture Recognition in Videos
    Dhingra, Naina
    Kunz, Andreas
    2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 491 - 501
  • [32] Panorama based on multi-channel-attention CNN for 3D model recognition
    Weizhi Nie
    Kun Wang
    Qi Liang
    Roubing He
    Multimedia Systems, 2019, 25 : 655 - 662
  • [33] Panorama based on multi-channel-attention CNN for 3D model recognition
    Nie, Weizhi
    Wang, Kun
    Liang, Qi
    He, Roubing
    MULTIMEDIA SYSTEMS, 2019, 25 (06) : 655 - 662
  • [34] Hand-drawn sketch recognition with a double-channel convolutional neural network
    Zhang, Lei
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2021, 2021 (01)
  • [35] Hand-drawn sketch recognition with a double-channel convolutional neural network
    Lei Zhang
    EURASIP Journal on Advances in Signal Processing, 2021
  • [36] Load Prediction in Double-Channel Residual Self-Attention Temporal Convolutional Network with Weight Adaptive Updating in Cloud Computing
    Lin, Jiang
    Guan, Yepeng
    SENSORS, 2024, 24 (10)
  • [37] Action recognition method based on a novel keyframe extraction method and enhanced 3D convolutional neural network
    Tian, Qiuhong
    Li, Saiwei
    Zhang, Yuankui
    Lu, Hongyi
    Pan, Hao
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (01) : 475 - 491
  • [38] MHSAN: Multi-view hierarchical self-attention network for 3D shape recognition
    Cao, Jiangzhong
    Yu, Lianggeng
    Ling, Bingo Wing-Kuen
    Yao, Zijie
    Dai, Qingyun
    PATTERN RECOGNITION, 2024, 150
  • [39] Attention-Guided Fusion Network of Point Cloud and Multiple Views for 3D Shape Recognition
    Peng, Bo
    Yu, Zengrui
    Lei, Jianjun
    Song, Jiahui
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 185 - 188
  • [40] Sparse attention double-channel FCN network for numerical analysis tracheid features in larch
    Li, Chao
    Zhang, Lixin
    Wang, Saipeng
    Chen, Xun
    Jing, Weipeng
    FRONTIERS IN PLANT SCIENCE, 2022, 13