3D Human Pose Estimation Using Improved Semantic Graph Convolutional Based on Fusing Non-local Neural Network and Multi-Head Attention

被引:0
|
作者
Gui W. [1 ]
Luo Y. [1 ]
机构
[1] School of Electrical and Information Engineering, Zhengzhou University, 100 Avenue of Science, Zhengzhou
关键词
3D human pose estimation; Multi-head attention mechanism; Non-local neural networks; Semantic graph convolutional networks;
D O I
10.1007/s40031-024-01050-x
中图分类号
学科分类号
摘要
Although semantic graph convolutions networks can effectively learn the dependencies between joints and bones, their accuracy in estimating human body coordinates is not high. Aiming at solving the above problem, this paper studies semantic graph convolutional networks and discovers the limitations of capturing complex long-range dependencies and assigning appropriate importance weights across graph nodes. To overcome these issues, a novel module, NMHA, is built by fusing multi-head attention and non-local neural networks to enhance the relational modeling capabilities of semantic graph convolutional networks. Furthermore, this paper proposes a new 3D human pose estimation model, NMHA-SemGCN, which incorporates NMHA to better address the defects of human pose estimation. Detailed experiments conducted on the Human3.6M and HumanEva-I datasets reveal that NMHA-SemGCN achieves significant improvements in accuracy over the previous approach. These results show the effectiveness and innovation of our method. Moreover, the paper presents a comprehensive approach for estimating human poses from monocular images to 3D skeletal coordinates utilizing the NMHA-SemGCN model, demonstrating its potential for practical applications. © The Institution of Engineers (India) 2024.
引用
收藏
页码:1109 / 1119
页数:10
相关论文
共 50 条
  • [41] 3D Human Pose Estimation Using Convolutional Neural Networks with 2D Pose Information
    Park, Sungheon
    Hwang, Jihye
    Kwak, Nojun
    COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III, 2016, 9915 : 156 - 169
  • [42] Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud
    Jung, Tae-Won
    Jeong, Chi-Seo
    Kim, In-Seon
    Yu, Min-Su
    Kwon, Soon-Chul
    Jung, Kye-Dong
    SENSORS, 2022, 22 (21)
  • [43] Hierarchical Graph Attention Based Multi-View Convolutional Neural Network for 3D Object Recognition
    Zeng, Hui
    Zhao, Tianmeng
    Cheng, Ruting
    Wang, Fuzhou
    Liu, Jiwei
    IEEE ACCESS, 2021, 9 (09): : 33323 - 33335
  • [44] 3D Human Pose Estimation from Monocular Images with Deep Convolutional Neural Network
    Li, Sijin
    Chan, Antoni B.
    COMPUTER VISION - ACCV 2014, PT II, 2015, 9004 : 332 - 347
  • [45] A residual semantic graph convolutional network with high-resolution representation for 3D human pose estimation in a virtual fashion show
    Zhang P.
    Ding P.
    Li G.
    Zhang J.
    Multimedia Tools and Applications, 2024, 83 (29) : 73649 - 73669
  • [46] Enhanced 3D Human Pose Estimation from Videos by Using Attention-Based Neural Network with Dilated Convolutions
    Liu, Ruixu
    Shen, Ju
    Wang, He
    Chen, Chen
    Cheung, Sen-ching
    Asari, Vijayan K.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (05) : 1596 - 1615
  • [47] Enhanced 3D Human Pose Estimation from Videos by Using Attention-Based Neural Network with Dilated Convolutions
    Ruixu Liu
    Ju Shen
    He Wang
    Chen Chen
    Sen-ching Cheung
    Vijayan K. Asari
    International Journal of Computer Vision, 2021, 129 : 1596 - 1615
  • [48] Iterative graph filtering network for 3D human pose estimation
    Islam, Zaedul
    Ben Hamza, A.
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
  • [49] Iterative Graph Filtering Network for 3D Human Pose Estimation
    Islam, Zaedul
    Ben Hamza, A.
    arXiv, 2023,
  • [50] Regular Splitting Graph Network for 3D Human Pose Estimation
    Hassan, Md. Tanvir
    Ben Hamza, A.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4212 - 4222