Multi-hop graph transformer network for 3D human pose estimation

被引:3
|
作者
Islam, Zaedul [1 ]
Ben Hamza, A. [1 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
3D human pose estimation; Graph convolutional network; Transformer; Multi-hop; Dilated convolution;
D O I
10.1016/j.jvcir.2024.104174
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate 3D human pose estimation is a challenging task due to occlusion and depth ambiguity. In this paper, we introduce a multi -hop graph transformer network designed for 2D -to -3D human pose estimation in videos by leveraging the strengths of multi-head self-attention and multi -hop graph convolutional networks with disentangled neighborhoods to capture spatio-temporal dependencies and handle long-range interactions. The proposed network architecture consists of a graph attention block composed of stacked layers of multi-head self-attention and graph convolution with learnable adjacency matrix, and a multi -hop graph convolutional block comprised of multi -hop convolutional and dilated convolutional layers. The combination of multi-head self-attention and multi -hop graph convolutional layers enables the model to capture both local and global dependencies, while the integration of dilated convolutional layers enhances the model's ability to handle spatial details required for accurate localization of the human body joints. Extensive experiments demonstrate the effectiveness and generalization ability of our model, achieving competitive performance on benchmark datasets.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Multi-hop Graph Transformer Network for 3D Human Pose Estimation
    Islam, Zaedul
    Hamza, A. Ben
    arXiv,
  • [2] Multi-hop Modulated Graph Convolutional Networks for 3D Human Pose Estimation
    Lee, Jae Yung
    Kim, I. Gil
    BMVC 2022 - 33rd British Machine Vision Conference Proceedings, 2022,
  • [3] DGFormer: Dynamic graph transformer for 3D human pose estimation
    Chen, Zhangmeng
    Dai, Ju
    Bai, Junxuan
    Pan, Junjun
    PATTERN RECOGNITION, 2024, 152
  • [4] Deep Semantic Graph Transformer for Multi-View 3D Human Pose Estimation
    Zhang, Lijun
    Zhou, Kangkang
    Lu, Feng
    Zhou, Xiang-Dong
    Shi, Yu
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7205 - 7214
  • [5] Modulated Graph Convolutional Network for 3D Human Pose Estimation
    Zou, Zhiming
    Tang, Wei
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11457 - 11467
  • [6] Flexible Graph Convolutional Network for 3D Human Pose Estimation
    Shahjahan, Abu Taib Mohammed
    Hamza, A. Ben
    arXiv,
  • [7] Iterative graph filtering network for 3D human pose estimation
    Islam, Zaedul
    Ben Hamza, A.
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
  • [8] Iterative Graph Filtering Network for 3D Human Pose Estimation
    Islam, Zaedul
    Ben Hamza, A.
    arXiv, 2023,
  • [9] Regular Splitting Graph Network for 3D Human Pose Estimation
    Hassan, Md. Tanvir
    Ben Hamza, A.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4212 - 4222
  • [10] Hierarchical parallel multi-scale graph network for 3d human pose estimation
    Yang, Honghong
    Liu, Hongxi
    Zhang, Yumei
    Wu, Xiaojun
    APPLIED SOFT COMPUTING, 2023, 140