A novel 3D shape recognition method based on double-channel attention residual network

被引：6

作者：

Ma, Ziping ^{[1
]}

Zhou, Jie ^{[2
]}

Ma, Jinlin ^{[2
]}

Li, Tingting ^{[2
]}

机构：

[1] North Minzu Univ, Coll Math & Informat Sci, Yinchuan 750021, Ningxia, Peoples R China

[2] North Minzu Univ, Coll Comp Sci & Engn, Yinchuan 750021, Ningxia, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2022年 / 81卷 / 22期

基金：

中国国家自然科学基金;

关键词：

3D shape recognition; Residual; Multi-head self-attention; Weighted loss function; CONVOLUTIONAL NEURAL-NETWORK; POINT CLOUD; RETRIEVAL; CLASSIFICATION;

D O I：

10.1007/s11042-022-12041-9

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Learning 3D features by deep networks has achieved a successful performance up to now. However, data imbalance and low-resolution voxels still remain and influence the performance of 3D shape recognition. To resolve these issues, we propose double-channel attention residual network (double-RVCNN) as a novel deep network model with residual structure based on multi-head self-attention mechanism. Double-channel structure adopts double channels to input data including voxels and 3D Radon feature matrices, aiming to fully utilize the local and global features. The multi-head self-attention mechanism can integrate the relatively important contents of the input data through multiple heads structure, which can enrich the information processing ability and stabilize the training process of our network. Residual structure with cross-entropy loss and center loss as weighted loss function can avoid information loss to a great extent. Experimental results show that the values of mean average precision (MAP) are 83.31% and 74.04%, the values of classification accuracy are 90.53% and 85.09% on ModelNet10 and ModelNet40 datasets respectively, which demonstrates that our method performs a better 3D shape recognition accuracy than compared methods on test datasets.

引用

页码：32519 / 32548

页数：30

共 50 条

[31] Res3ATN-Deep 3D Residual Attention Network for Hand Gesture Recognition in Videos
Dhingra, Naina
Kunz, Andreas
2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 491 - 501
[32] Panorama based on multi-channel-attention CNN for 3D model recognition
Weizhi Nie
Kun Wang
Qi Liang
Roubing He
Multimedia Systems, 2019, 25 : 655 - 662
[33] Panorama based on multi-channel-attention CNN for 3D model recognition
Nie, Weizhi
Wang, Kun
Liang, Qi
He, Roubing
MULTIMEDIA SYSTEMS, 2019, 25 (06) : 655 - 662
[34] Hand-drawn sketch recognition with a double-channel convolutional neural network
Zhang, Lei
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2021, 2021 (01)
[35] Hand-drawn sketch recognition with a double-channel convolutional neural network
Lei Zhang
EURASIP Journal on Advances in Signal Processing, 2021
[36] Load Prediction in Double-Channel Residual Self-Attention Temporal Convolutional Network with Weight Adaptive Updating in Cloud Computing
Lin, Jiang
Guan, Yepeng
SENSORS, 2024, 24 (10)
[37] Action recognition method based on a novel keyframe extraction method and enhanced 3D convolutional neural network
Tian, Qiuhong
Li, Saiwei
Zhang, Yuankui
Lu, Hongyi
Pan, Hao
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (01) : 475 - 491
[38] MHSAN: Multi-view hierarchical self-attention network for 3D shape recognition
Cao, Jiangzhong
Yu, Lianggeng
Ling, Bingo Wing-Kuen
Yao, Zijie
Dai, Qingyun
PATTERN RECOGNITION, 2024, 150
[39] Attention-Guided Fusion Network of Point Cloud and Multiple Views for 3D Shape Recognition
Peng, Bo
Yu, Zengrui
Lei, Jianjun
Song, Jiahui
2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 185 - 188
[40] Sparse attention double-channel FCN network for numerical analysis tracheid features in larch
Li, Chao
Zhang, Lixin
Wang, Saipeng
Chen, Xun
Jing, Weipeng
FRONTIERS IN PLANT SCIENCE, 2022, 13

← 1 2 3 4 5 →