3D Human Pose Estimation Using Improved Semantic Graph Convolutional Based on Fusing Non-local Neural Network and Multi-Head Attention

被引:0
|
作者
Gui W. [1 ]
Luo Y. [1 ]
机构
[1] School of Electrical and Information Engineering, Zhengzhou University, 100 Avenue of Science, Zhengzhou
关键词
3D human pose estimation; Multi-head attention mechanism; Non-local neural networks; Semantic graph convolutional networks;
D O I
10.1007/s40031-024-01050-x
中图分类号
学科分类号
摘要
Although semantic graph convolutions networks can effectively learn the dependencies between joints and bones, their accuracy in estimating human body coordinates is not high. Aiming at solving the above problem, this paper studies semantic graph convolutional networks and discovers the limitations of capturing complex long-range dependencies and assigning appropriate importance weights across graph nodes. To overcome these issues, a novel module, NMHA, is built by fusing multi-head attention and non-local neural networks to enhance the relational modeling capabilities of semantic graph convolutional networks. Furthermore, this paper proposes a new 3D human pose estimation model, NMHA-SemGCN, which incorporates NMHA to better address the defects of human pose estimation. Detailed experiments conducted on the Human3.6M and HumanEva-I datasets reveal that NMHA-SemGCN achieves significant improvements in accuracy over the previous approach. These results show the effectiveness and innovation of our method. Moreover, the paper presents a comprehensive approach for estimating human poses from monocular images to 3D skeletal coordinates utilizing the NMHA-SemGCN model, demonstrating its potential for practical applications. © The Institution of Engineers (India) 2024.
引用
收藏
页码:1109 / 1119
页数:10
相关论文
共 50 条
  • [11] 3D HEAD POSE ESTIMATION BASED ON GRAPH CONVOLUTIONAL NETWORK FROM A SINGLE RGB IMAGE
    Lie, Wen-Nung
    Yim, Monyneath
    Aing, Lee
    Chiang, Jui-Chiu
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3963 - 3967
  • [12] ResGAT: an improved graph neural network based on multi-head attention mechanism and residual network for paper classification
    Xuejian Huang
    Zhibin Wu
    Gensheng Wang
    Zhipeng Li
    Yuansheng Luo
    Xiaofang Wu
    Scientometrics, 2024, 129 : 1015 - 1036
  • [13] 3D HEAD POSE ESTIMATION WITH CONVOLUTIONAL NEURAL NETWORK TRAINED ON SYNTHETIC IMAGES
    Liu, Xiabing
    Lang, Wei
    Wank, Yumeng
    Li, Shuyang
    Pei, Mingtao
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 1289 - 1293
  • [14] Improved Convolutional Neural Network Based on Multi-head Attention Mechanism for Industrial Process Fault Classification
    Cui, Wenzhi
    Deng, Xiaogang
    Zhang, Zheng
    PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 918 - 922
  • [15] ResGAT: an improved graph neural network based on multi-head attention mechanism and residual network for paper classification
    Huang, Xuejian
    Wu, Zhibin
    Wang, Gensheng
    Li, Zhipeng
    Luo, Yuansheng
    Wu, Xiaofang
    SCIENTOMETRICS, 2024, 129 (02) : 1015 - 1036
  • [16] 3D Human Pose Estimation Using Mobius Graph Convolutional Networks
    Azizi, Niloofar
    Possegger, Horst
    Rodola, Emanuele
    Bischof, Horst
    COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 160 - 178
  • [17] Semantic Graph Convolutional Networks for 3D Human Pose Regression
    Zhao, Long
    Peng, Xi
    Tian, Yu
    Kapadia, Mubbasir
    Metaxas, Dimitris N.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3420 - 3430
  • [18] Interactive Selection Recommendation Based on the Multi-head Attention Graph Neural Network
    Zhang, Shuxi
    Chen, Jianxia
    Yao, Meihan
    Wu, Xinyun
    Ge, Yvfan
    Li, Shu
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT III, 2024, 14449 : 447 - 458
  • [19] Extracting biomedical relations via a multi-head attention based graph convolutional network
    Wang, Erniu
    Wang, Fan
    Yang, Zhihao
    Wang, Lei
    Zhang, Yin
    Lin, Hongfei
    Wang, Jian
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 793 - 798
  • [20] Relation-balanced graph convolutional network for 3D human pose estimation
    Chen, Lu
    Liu, Qiong
    IMAGE AND VISION COMPUTING, 2023, 140