HKE-GCN: Heatmaps-guided Keypoints Encoder and Graph Convolutional Network for Human Pose Estimation

被引:4
|
作者
Xia, Han [1 ]
Wang, Yiran [2 ]
Wang, Xiaoru [1 ]
Xiong, Songkai [1 ]
Yu, Zhihong [3 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] Beijing Forestry Univ, Beijing, Peoples R China
[3] Intel China Res Ctr, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Human Pose Estimation; Heatmaps-guided Keypoints Encoder; Graph Convolutional Network;
D O I
10.1109/IJCNN55064.2022.9892251
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-person pose estimation is a challenging task which aims to locate keypoints for multiple persons. Graph convolutional network can effectively capture the semantic relationship among keypoints according to the kinematic structure of the human body, which is beneficial to locate keypoints but is the lack of ability of most CNN-based models. However, existing GCN-based methods mostly flatten the 2D features directly to obtain 1D embeddings, leading to the redundant information in keypoints embeddings, large size of keypoints embeddings, and high computation cost. To address these problems, we propose a two-stage framework based on Heatmaps-guided Keypoints Encoder and graph convolutional network, called HKE-GCN. The first stage uses a heatmaps-based network to predict the heatmaps of keypoints, then the second stage refines the prediction of the first stage. The second stage consists of two modules: Heatmaps-guided Keypoints Encoder (HKE) and Graph-based Refinement Module (GRM), which are used to generate keypoints embeddings according to the guidance of heatmaps and explicitly learn the relationship among keypoints based on GCN, respectively. Experiments show our framework is model-agnostic and our proposed modules are effective and lightweight. Our best model achieves state-of-the-art 76.4AP on COCO test-dev.
引用
收藏
页数:8
相关论文
共 50 条
  • [11] SA-GCN: structure-aware graph convolutional networks for crowd pose estimation
    Jia Wang
    Yanmin Luo
    The Journal of Supercomputing, 2023, 79 : 10046 - 10062
  • [12] Structure guided network for human pose estimation
    Chen, Yilei
    Xie, Xuemei
    Yin, Wenjie
    Li, Bo'ao
    Li, Fu
    APPLIED INTELLIGENCE, 2023, 53 (18) : 21012 - 21026
  • [13] Structure guided network for human pose estimation
    Yilei Chen
    Xuemei Xie
    Wenjie Yin
    Bo’ao Li
    Fu Li
    Applied Intelligence, 2023, 53 : 21012 - 21026
  • [14] PVA-GCN: point-voxel absorbing graph convolutional network for 3D human pose estimation from monocular video
    Liu, Minghao
    Wang, Wenshan
    Zhao, Wei
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3627 - 3641
  • [15] PVA-GCN: point-voxel absorbing graph convolutional network for 3D human pose estimation from monocular video
    Minghao Liu
    Wenshan Wang
    Wei Zhao
    Signal, Image and Video Processing, 2024, 18 : 3627 - 3641
  • [16] GLA-GCN: Global-local Adaptive Graph Convolutional Network for 3D Human Pose Estimation from Monocular Video
    Yu, Bruce X. B.
    Zhang, Zhi
    Liu, Yongxu
    Zhong, Sheng-hua
    Liu, Yan
    Chen, Chang Wen
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8784 - 8795
  • [17] Human action recognition using a convolutional neural network based on skeleton heatmaps from two-stage pose estimation
    Sun, Ruiqi
    Zhang, Qin
    Luo, Chuang
    Guo, Jiamin
    Chai, Hui
    BIOMIMETIC INTELLIGENCE AND ROBOTICS, 2022, 2 (03):
  • [18] Relation-balanced graph convolutional network for 3D human pose estimation
    Chen, Lu
    Liu, Qiong
    IMAGE AND VISION COMPUTING, 2023, 140
  • [19] Automated Human Action Recognition with Improved Graph Convolutional Network-based Pose Estimation
    Baghel, Amit
    Kushwaha, Alok Kumar Singh
    Singh, Roshan
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2025, 39 (02)
  • [20] Pedestrian Trajectory Prediction in Heterogeneous Traffic Using Pose Keypoints-Based Convolutional Encoder-Decoder Network
    Chen, Kai
    Song, Xiao
    Ren, Xiaoxiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (05) : 1764 - 1775