Fixed-resolution representation network for human pose estimation

被引:0
|
作者
Yongxiang Liu
Xiaorong Hou
机构
[1] University of Electronic Science and Technology of China,School of Automation Engineering
来源
Multimedia Systems | 2022年 / 28卷
关键词
Human pose estimation; Fixed-resolution representation; Multi-receptive fields; Feature extraction; Information selection;
D O I
暂无
中图分类号
学科分类号
摘要
Human pose estimation from a single image is a fundamental yet challenging task in computer vision. Most existing methods gradually generated multi-resolution from high-resolution to low-resolution, then recovered the higher resolution from the low resolution and used it to generate final pose heatmaps, such as Hourglass and HRNet and their variants. In this paper, we propose a novel architecture named fixed-resolution representation network for human pose estimation, which maintains fixed-resolution through the whole process to keep rich spatial-structural information. An Improved Pyramid Convolutional Bottleneck (IPCB) is firstly proposed to encode feature maps with multi receptive fields with the same resolution. Secondly, we introduce an efficient channel attention mechanism to enhance the feature extraction and information selection capability of IPCB, making the performance of IPCB better. Thirdly, considering the deviation from using the flip test of reasoning, we use an existing technology: Unbiased Data Processing. Fourthly, due to the change of the model structure and the limited computing resources, we introduce an iterative retraining strategy to solve the problem of pre-training. We empirically demonstrate the effectiveness of our method and achieve a competitive performance with 1.7M parameters and 3G FLOPs, 89.5 (PCKh@0.5) and 92.7 (PCK@0.2) respectively, compared with the state-of-the-art methods on the benchmark dataset: the MPII and LSP key points detection dataset.
引用
收藏
页码:1597 / 1609
页数:12
相关论文
共 50 条
  • [31] Multistage attention network for human pose estimation
    Zhou, Jingyang
    Wen, Guangzhao
    Zhang, Yu
    Geng, Xin
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [32] Structure guided network for human pose estimation
    Yilei Chen
    Xuemei Xie
    Wenjie Yin
    Bo’ao Li
    Fu Li
    Applied Intelligence, 2023, 53 : 21012 - 21026
  • [33] Learning high resolution reservation for human pose estimation
    Gao, Bingkun
    Ma, Ke
    Bi, Hongbo
    Wang, Ling
    Wu, Chenlei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (19) : 29251 - 29265
  • [34] Efficient High-Resolution Human Pose Estimation
    Qin, Xiaofei
    Qiu, Lingfeng
    He, Changxiang
    Zhang, Xuedian
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2022, 13631 : 383 - 396
  • [35] A residual semantic graph convolutional network with high-resolution representation for 3D human pose estimation in a virtual fashion show
    Zhang P.
    Ding P.
    Li G.
    Zhang J.
    Multimedia Tools and Applications, 2024, 83 (29) : 73649 - 73669
  • [36] Learning high resolution reservation for human pose estimation
    Bingkun Gao
    Ke Ma
    Hongbo Bi
    Ling Wang
    Chenlei Wu
    Multimedia Tools and Applications, 2021, 80 : 29251 - 29265
  • [37] Deep High-Resolution Network With Double Attention Residual Blocks for Human Pose Estimation
    Huo, Zhanqiang
    Jin, Han
    Qiao, Yingxu
    Luo, Fen
    IEEE ACCESS, 2020, 8 : 224947 - 224957
  • [38] High resolution network for human hand pose estimation based on 3D convolution
    Sang N.
    Li M.
    1600, Huazhong University of Science and Technology (48): : 1 - 6
  • [39] Multi-scale Attention Aided Multi-Resolution Network for Human Pose Estimation
    Selvam, Srinika
    Mishra, Deepak
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 : 461 - 472
  • [40] DB-HRNet: Dual Branch High-Resolution Network for Human Pose Estimation
    Wang, Yanxia
    Wang, Renjie
    Shi, Hu
    IEEE ACCESS, 2023, 11 : 120628 - 120641