Joint graph convolution networks and transformer for human pose estimation in sports technique analysis

被引:6
|
作者
Cheng, Hongren [1 ,2 ]
Wang, Jing [3 ]
Zhao, Anran [4 ]
Zhong, Yaping [1 ,2 ]
Li, Jingli [5 ]
Dong, Liangshan [6 ]
机构
[1] Wuhan Sports Univ, Sports Big Data Res Ctr, Wuhan 430079, Peoples R China
[2] Hubei Prov Sports & Hlth Innovat Dev Res Ctr, Wuhan 430079, Hubei, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Sch Automat, Chongqing 400065, Peoples R China
[4] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China
[5] Huazhong Univ Sci & Technol, Sch Phys Educ, Wuhan 430074, Peoples R China
[6] China Univ Geosci, Sch Phys Educ, Wuhan 430074, Peoples R China
关键词
Human pose estimation; Graph convolutional network; Transformer; The topological structure between; IMAGE STEGANOGRAPHY METHOD;
D O I
10.1016/j.jksuci.2023.101819
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human pose estimation has various applications in domains such as sports technology analysis, virtual reality, and education. However, most previous studies focused on the respective feature representations of keypoints, but disregarded the topological relationship among keypoints. To address this challenge, we propose GTPose, a network structure that integrates graph convolutional networks and Transform. First of all, a set of multi-scale convolution operations are applied to extract local feature maps of images. Secondly, the positions of keypoints are roughly estimated by using Transform to process the sequential relations between feature maps. Finally, GCN is adopted to model the topological structure between keypoints to accurately locate the location of keypoints and learn feature representations. The performance of GTPose is evaluated on two real datasets: MS COCO and MPII. Experimental results demonstrate that GTPose outperforms other methods in human pose estimation tasks. In addition, experimental results also show that the spatial relationship between keypoints is effective for accurately characterizing keypoints.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] 3D human pose estimation with multi-scale graph convolution and hierarchical body pooling
    Huang, Ke
    Sui, TianQi
    Wu, Hong
    MULTIMEDIA SYSTEMS, 2022, 28 (02) : 403 - 412
  • [32] LDNet: Lightweight dynamic convolution network for human pose estimation
    Xu, Dingning
    Zhang, Rong
    Guo, Lijun
    Feng, Cun
    Gao, Shangce
    ADVANCED ENGINEERING INFORMATICS, 2022, 54
  • [33] Learning Joint Structure for Human Pose Estimation
    Feng, Shenming
    Hu, Haifeng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 16 (03)
  • [34] Joint relation based human pose estimation
    Shuang Liang
    Gang Chu
    Chi Xie
    Jiewen Wang
    The Visual Computer, 2022, 38 : 1369 - 1381
  • [35] Graph embedded analysis for head pose estimation
    Fu, Yun
    Huang, Thomas S.
    PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION - PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE, 2006, : 3 - +
  • [36] Joint relation based human pose estimation
    Liang, Shuang
    Chu, Gang
    Xie, Chi
    Wang, Jiewen
    VISUAL COMPUTER, 2022, 38 (04): : 1369 - 1381
  • [37] Enhancing human pose estimation in sports training: Integrating spatiotemporal transformer for improved accuracy and real-time
    Xi, Xinyao
    Zhang, Chen
    Jia, Wen
    Jiang, Ruxue
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 109 : 144 - 156
  • [38] Refining Joint Locations for Human Pose Tracking in Sports Videos
    Zecha, Dan
    Einfalt, Moritz
    Lienhart, Rainer
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2524 - 2532
  • [39] Hierarchical Graph Neural Network for Human Pose Estimation
    Zheng, Guanghua
    Zhao, Zhongqiu
    Zhang, Zhao
    Yang, Yi
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2663 - 2668
  • [40] GraFormer: Graph-oriented Transformer for 3D Pose Estimation
    Zhao, Weixi
    Wang, Weiqiang
    Tian, Yunjie
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20406 - 20415