HKE-GCN: Heatmaps-guided Keypoints Encoder and Graph Convolutional Network for Human Pose Estimation

被引:4
|
作者
Xia, Han [1 ]
Wang, Yiran [2 ]
Wang, Xiaoru [1 ]
Xiong, Songkai [1 ]
Yu, Zhihong [3 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] Beijing Forestry Univ, Beijing, Peoples R China
[3] Intel China Res Ctr, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Human Pose Estimation; Heatmaps-guided Keypoints Encoder; Graph Convolutional Network;
D O I
10.1109/IJCNN55064.2022.9892251
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-person pose estimation is a challenging task which aims to locate keypoints for multiple persons. Graph convolutional network can effectively capture the semantic relationship among keypoints according to the kinematic structure of the human body, which is beneficial to locate keypoints but is the lack of ability of most CNN-based models. However, existing GCN-based methods mostly flatten the 2D features directly to obtain 1D embeddings, leading to the redundant information in keypoints embeddings, large size of keypoints embeddings, and high computation cost. To address these problems, we propose a two-stage framework based on Heatmaps-guided Keypoints Encoder and graph convolutional network, called HKE-GCN. The first stage uses a heatmaps-based network to predict the heatmaps of keypoints, then the second stage refines the prediction of the first stage. The second stage consists of two modules: Heatmaps-guided Keypoints Encoder (HKE) and Graph-based Refinement Module (GRM), which are used to generate keypoints embeddings according to the guidance of heatmaps and explicitly learn the relationship among keypoints based on GCN, respectively. Experiments show our framework is model-agnostic and our proposed modules are effective and lightweight. Our best model achieves state-of-the-art 76.4AP on COCO test-dev.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Graph Convolutional Adversarial Network for Human Body Pose and Mesh Estimation
    Huang, Yuancheng
    Xiao, Nanfeng
    IEEE ACCESS, 2020, 8 : 215419 - 215425
  • [2] Human Pose Estimation Based on a Spatial Temporal Graph Convolutional Network
    Wu, Meng
    Shi, Pudong
    APPLIED SCIENCES-BASEL, 2023, 13 (05):
  • [3] EVA-GCN: Head Pose Estimation Based on Graph Convolutional Networks
    Xin, Miao
    Mo, Shentong
    Lin, Yuanze
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1462 - 1471
  • [4] PR-GCN: A Deep Graph Convolutional Network with Point Refinement for 6D Pose Estimation
    Zhou, Guangyuan
    Wang, Huiqun
    Chen, Jiaxin
    Huang, Di
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2773 - 2782
  • [5] Modulated Graph Convolutional Network for 3D Human Pose Estimation
    Zou, Zhiming
    Tang, Wei
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11457 - 11467
  • [6] Flexible Graph Convolutional Network for 3D Human Pose Estimation
    Shahjahan, Abu Taib Mohammed
    Hamza, A. Ben
    arXiv,
  • [7] Human pose estimation with spatial context relationships based on graph convolutional network
    Han, Na
    PROCEEDINGS OF 2020 IEEE 5TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2020), 2020, : 1566 - 1570
  • [8] Complex Human Pose Estimation via Keypoints Association Constraint Network
    Zhu, Xuan
    Guo, Zhenpeng
    Liu, Xin
    Li, Bin
    Peng, Jinye
    Chen, Peirong
    Wang, Rongzhi
    IEEE ACCESS, 2020, 8 : 205938 - 205947
  • [9] HPGCN: Hierarchical poselet-guided graph convolutional network for 3D pose estimation
    Wu, Yongpeng
    Kong, Dehui
    Wang, Shaofan
    Li, Jinghua
    Yin, Baocai
    NEUROCOMPUTING, 2022, 487 : 243 - 256
  • [10] SA-GCN: structure-aware graph convolutional networks for crowd pose estimation
    Wang, Jia
    Luo, Yanmin
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (09): : 10046 - 10062