Fixed-resolution representation network for human pose estimation

被引:0
|
作者
Yongxiang Liu
Xiaorong Hou
机构
[1] University of Electronic Science and Technology of China,School of Automation Engineering
来源
Multimedia Systems | 2022年 / 28卷
关键词
Human pose estimation; Fixed-resolution representation; Multi-receptive fields; Feature extraction; Information selection;
D O I
暂无
中图分类号
学科分类号
摘要
Human pose estimation from a single image is a fundamental yet challenging task in computer vision. Most existing methods gradually generated multi-resolution from high-resolution to low-resolution, then recovered the higher resolution from the low resolution and used it to generate final pose heatmaps, such as Hourglass and HRNet and their variants. In this paper, we propose a novel architecture named fixed-resolution representation network for human pose estimation, which maintains fixed-resolution through the whole process to keep rich spatial-structural information. An Improved Pyramid Convolutional Bottleneck (IPCB) is firstly proposed to encode feature maps with multi receptive fields with the same resolution. Secondly, we introduce an efficient channel attention mechanism to enhance the feature extraction and information selection capability of IPCB, making the performance of IPCB better. Thirdly, considering the deviation from using the flip test of reasoning, we use an existing technology: Unbiased Data Processing. Fourthly, due to the change of the model structure and the limited computing resources, we introduce an iterative retraining strategy to solve the problem of pre-training. We empirically demonstrate the effectiveness of our method and achieve a competitive performance with 1.7M parameters and 3G FLOPs, 89.5 (PCKh@0.5) and 92.7 (PCK@0.2) respectively, compared with the state-of-the-art methods on the benchmark dataset: the MPII and LSP key points detection dataset.
引用
收藏
页码:1597 / 1609
页数:12
相关论文
共 50 条
  • [1] Fixed-resolution representation network for human pose estimation
    Liu, Yongxiang
    Hou, Xiaorong
    MULTIMEDIA SYSTEMS, 2022, 28 (05) : 1597 - 1609
  • [2] Deep High-Resolution Representation Learning for Human Pose Estimation
    Sun, Ke
    Xiao, Bin
    Liu, Dong
    Wang, Jingdong
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5686 - 5696
  • [3] SHaRPose: Sparse High-Resolution Representation for Human Pose Estimation
    An, Xiaoqi
    Zhao, Lin
    Gong, Chen
    Wang, Nannan
    Wang, Di
    Yang, Jian
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 691 - 699
  • [4] Simple Multi-Resolution Representation Learning for Human Pose Estimation
    Tran, Trung Q.
    Nguyen, Giang, V
    Kim, Daeyoung
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 511 - 518
  • [5] Lightweight Multi-Resolution Network for Human Pose Estimation
    Li, Pengxin
    Wang, Rong
    Zhang, Wenjing
    Liu, Yinuo
    Xu, Chenyue
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 138 (03): : 2239 - 2255
  • [6] Ghost shuffle lightweight pose network with effective feature representation and learning for human pose estimation
    Yang, Senquan
    Wen, Jiajun
    Fan, Junjun
    IET COMPUTER VISION, 2022, 16 (06) : 525 - 540
  • [7] Human Pose Estimation Based on Attention Multi-resolution Network
    Zhang, Congcong
    He, Ning
    Sun, Qixiang
    Yin, Xiaojie
    Lu, Ke
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 682 - 687
  • [8] High-Resolution with Global Context Network for Human Pose Estimation
    Wang, Kehao
    Li, Chenglin
    Ren, Ruiqi
    2022 27TH ASIA PACIFIC CONFERENCE ON COMMUNICATIONS (APCC 2022): CREATING INNOVATIVE COMMUNICATION TECHNOLOGIES FOR POST-PANDEMIC ERA, 2022, : 621 - 626
  • [9] Lightweight and Efficient High-Resolution Network for Human Pose Estimation
    Liu, Jiarui
    Gong, Xiugang
    Guo, Qun
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 232 - 240
  • [10] FastNet: Fast high-resolution network for human pose estimation
    Luo, Yanmin
    Ou, Zhilong
    Wan, Tianjun
    Guo, Jing-Ming
    IMAGE AND VISION COMPUTING, 2022, 119