Multi-hop graph transformer network for 3D human pose estimation

被引:3
|
作者
Islam, Zaedul [1 ]
Ben Hamza, A. [1 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
3D human pose estimation; Graph convolutional network; Transformer; Multi-hop; Dilated convolution;
D O I
10.1016/j.jvcir.2024.104174
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate 3D human pose estimation is a challenging task due to occlusion and depth ambiguity. In this paper, we introduce a multi -hop graph transformer network designed for 2D -to -3D human pose estimation in videos by leveraging the strengths of multi-head self-attention and multi -hop graph convolutional networks with disentangled neighborhoods to capture spatio-temporal dependencies and handle long-range interactions. The proposed network architecture consists of a graph attention block composed of stacked layers of multi-head self-attention and graph convolution with learnable adjacency matrix, and a multi -hop graph convolutional block comprised of multi -hop convolutional and dilated convolutional layers. The combination of multi-head self-attention and multi -hop graph convolutional layers enables the model to capture both local and global dependencies, while the integration of dilated convolutional layers enhances the model's ability to handle spatial details required for accurate localization of the human body joints. Extensive experiments demonstrate the effectiveness and generalization ability of our model, achieving competitive performance on benchmark datasets.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] U-shaped spatial-temporal transformer network for 3D human pose estimation
    Yang, Honghong
    Guo, Longfei
    Zhang, Yumei
    Wu, Xiaojun
    MACHINE VISION AND APPLICATIONS, 2022, 33 (06)
  • [42] Position constrained network for 3D human pose estimation
    Xiena Dong
    Jun Yu
    Jian Zhang
    Multimedia Systems, 2023, 29 : 459 - 468
  • [43] Position constrained network for 3D human pose estimation
    Dong, Xiena
    Yu, Jun
    Zhang, Jian
    MULTIMEDIA SYSTEMS, 2023, 29 (02) : 459 - 468
  • [44] Optimizing Network Structure for 3D Human Pose Estimation
    Ci, Hai
    Wang, Chunyu
    Ma, Xiaoxuan
    Wang, Yizhou
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2262 - 2271
  • [45] Multi-hypothesis representation learning for transformer-based 3D human pose estimation
    Li, Wenhao
    Liu, Hong
    Tang, Hao
    Wang, Pichao
    PATTERN RECOGNITION, 2023, 141
  • [46] 3D HUMAN POSE REGRESSION USING GRAPH CONVOLUTIONAL NETWORK
    Banik, Soubarna
    García, Alejandro Mendoza
    Knoll, Alois
    arXiv, 2021,
  • [47] 3D HUMAN POSE REGRESSION USING GRAPH CONVOLUTIONAL NETWORK
    Banik, Soubarna
    GarcIa, Alejandro Mendoza
    Knoll, Alois
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 924 - 928
  • [48] Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud
    Jung, Tae-Won
    Jeong, Chi-Seo
    Kim, In-Seon
    Yu, Min-Su
    Kwon, Soon-Chul
    Jung, Kye-Dong
    SENSORS, 2022, 22 (21)
  • [49] Dual view graph transformer networks for multi-hop knowledge graph reasoning
    Sun, Congcong
    Chen, Jianrui
    Shao, Zhongshi
    Huang, Junjie
    NEURAL NETWORKS, 2025, 186
  • [50] Dynamic Graph Reasoning for Multi-person 3D Pose Estimation
    Qiu, Zhongwei
    Yang, Qiansheng
    Wang, Jian
    Fu, Dongmei
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3521 - 3529