STIGANet: Integrating DGCNS and attention mechanisms for real-time 3D pose estimation in sports

被引:0
|
作者
Liu, Qi [1 ]
Wang, Zhezhou [2 ]
Zhang, Han [3 ]
Miao, Changqing [1 ]
机构
[1] Anyang Normal Univ, Sch Phys Educ, Anyang 455000, Peoples R China
[2] Ningxia Univ, Sch Phys Educ, Ningxia 750021, Peoples R China
[3] Anyang Normal Univ, Sch Comp & Informat Engn, Anyang 455000, Peoples R China
关键词
Sports action analysis; 3D human pose estimation; Dynamic graph convolutional networks; Attention mechanisms; Deformable transformer encoder;
D O I
10.1016/j.aej.2025.02.058
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In modern sports training and competitions, precise action analysis and feedback are essential for optimizing athletes' performance. Traditional methods, however, are time-consuming, labor-intensive, and prone to subjective judgment, leading to inconsistencies and inaccuracies. Existing AI-based approaches struggle with high-speed movements, complex backgrounds, and real-time processing. To address these limitations, we propose the Spatio-Temporal Interweaved Graph and Attention Network (STIGANet) for accurate 3D human pose estimation. STIGANet combines Dynamic Graph Convolutional Networks (DGCN), a Spatio-Temporal Cross-Attention Mechanism (STCA), Spatio-Temporal Interweaved Attention (STIA), and a Deformable Transformer Encoder, enabling effective capture and fusion of spatial and temporal features inhuman actions. The model improves pose estimation accuracy and robustness in dynamic, real-time sports environments. On the Human3.6M and MPI-INF-3DHP datasets, STIGANet achieves superior performance with MPJPEs of 38.2 mm and 45.3 mm, respectively, outperforming existing methods. These findings highlight the model's potential for real-time sports action analysis. Overall, this work enhances sports action analysis by combining graph convolutional networks with attention mechanisms, offering a robust framework for real-time insights during sports training and rehabilitation.
引用
收藏
页码:236 / 247
页数:12
相关论文
共 50 条
  • [1] Fast 3D Hand Pose Estimation for Real-time System
    Song, Jae-Hun
    Kang, Suk-Ju
    2020 17TH INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC 2020), 2020, : 121 - 122
  • [2] Real-Time 3D Hand Pose Estimation with 3D Convolutional Neural Networks
    Ge, Liuhao
    Liang, Hui
    Yuan, Junsong
    Thalmann, Daniel
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (04) : 956 - 970
  • [3] Real-time camera pose estimation for sports fields
    Leonardo Citraro
    Pablo Márquez-Neila
    Stefano Savarè
    Vivek Jayaram
    Charles Dubout
    Félix Renaut
    Andrés Hasfura
    Horesh Ben Shitrit
    Pascal Fua
    Machine Vision and Applications, 2020, 31
  • [4] Real-time camera pose estimation for sports fields
    Citraro, Leonardo
    Marquez-Neila, Pablo
    Savare, Stefano
    Jayaram, Vivek
    Dubout, Charles
    Renaut, Felix
    Hasfura, Andres
    Ben Shitrit, Horesh
    Fua, Pascal
    MACHINE VISION AND APPLICATIONS, 2020, 31 (03)
  • [5] REAL-TIME 3D RECONSTRUCTION AND POSE ESTIMATION FOR HUMAN MOTION ANALYSIS
    Graf, Holger
    Yoon, Sang Min
    Malerczyk, Cornelius
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 3981 - 3984
  • [6] Real-time 3D Pose Estimation from Single Depth Images
    Schnuerer, Thomas
    Fuchs, Stefan
    Eisenbach, Markus
    Gross, Horst-Michael
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 716 - 724
  • [7] Downsizing Heatmap Resolution for real-time 3D Human Pose Estimation
    Kong, Dae-hyeon
    Kang, Suk-ju
    2021 36TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC), 2021,
  • [8] Real-time modeling of face deformation for 3D head pose estimation
    Oka, K
    Sato, Y
    ANALYSIS AND MODELLING OF FACES AND GESTURES, PROCEEDINGS, 2005, 3723 : 308 - 320
  • [9] Real-time head tracking and 3D pose estimation from range data
    Malassiotis, S
    Strintzis, MG
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 2, PROCEEDINGS, 2003, : 859 - 862
  • [10] Real-time upper body detection and 3D pose estimation in monoscopic images
    Micilotta, Antonio S.
    Ong, Eng-Jon
    Bowden, Richard
    COMPUTER VISION - ECCV 2006, PT 3, PROCEEDINGS, 2006, 3953 : 139 - 150