Fixed-resolution representation network for human pose estimation

被引：0

作者：

Yongxiang Liu

Xiaorong Hou

机构：

[1] University of Electronic Science and Technology of China,School of Automation Engineering

来源：

Multimedia Systems | 2022年 / 28卷

关键词：

Human pose estimation; Fixed-resolution representation; Multi-receptive fields; Feature extraction; Information selection;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Human pose estimation from a single image is a fundamental yet challenging task in computer vision. Most existing methods gradually generated multi-resolution from high-resolution to low-resolution, then recovered the higher resolution from the low resolution and used it to generate final pose heatmaps, such as Hourglass and HRNet and their variants. In this paper, we propose a novel architecture named fixed-resolution representation network for human pose estimation, which maintains fixed-resolution through the whole process to keep rich spatial-structural information. An Improved Pyramid Convolutional Bottleneck (IPCB) is firstly proposed to encode feature maps with multi receptive fields with the same resolution. Secondly, we introduce an efficient channel attention mechanism to enhance the feature extraction and information selection capability of IPCB, making the performance of IPCB better. Thirdly, considering the deviation from using the flip test of reasoning, we use an existing technology: Unbiased Data Processing. Fourthly, due to the change of the model structure and the limited computing resources, we introduce an iterative retraining strategy to solve the problem of pre-training. We empirically demonstrate the effectiveness of our method and achieve a competitive performance with 1.7M parameters and 3G FLOPs, 89.5 (PCKh@0.5) and 92.7 (PCK@0.2) respectively, compared with the state-of-the-art methods on the benchmark dataset: the MPII and LSP key points detection dataset.

引用

页码：1597 / 1609

页数：12

共 50 条

[1] Fixed-resolution representation network for human pose estimation
Liu, Yongxiang
Hou, Xiaorong
MULTIMEDIA SYSTEMS, 2022, 28 (05) : 1597 - 1609
[2] Deep High-Resolution Representation Learning for Human Pose Estimation
Sun, Ke
Xiao, Bin
Liu, Dong
Wang, Jingdong
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5686 - 5696
[3] SHaRPose: Sparse High-Resolution Representation for Human Pose Estimation
An, Xiaoqi
Zhao, Lin
Gong, Chen
Wang, Nannan
Wang, Di
Yang, Jian
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 691 - 699
[4] Simple Multi-Resolution Representation Learning for Human Pose Estimation
Tran, Trung Q.
Nguyen, Giang, V
Kim, Daeyoung
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 511 - 518
[5] Lightweight Multi-Resolution Network for Human Pose Estimation
Li, Pengxin
Wang, Rong
Zhang, Wenjing
Liu, Yinuo
Xu, Chenyue
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 138 (03): : 2239 - 2255
[6] Ghost shuffle lightweight pose network with effective feature representation and learning for human pose estimation
Yang, Senquan
Wen, Jiajun
Fan, Junjun
IET COMPUTER VISION, 2022, 16 (06) : 525 - 540
[7] Human Pose Estimation Based on Attention Multi-resolution Network
Zhang, Congcong
He, Ning
Sun, Qixiang
Yin, Xiaojie
Lu, Ke
PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 682 - 687
[8] High-Resolution with Global Context Network for Human Pose Estimation
Wang, Kehao
Li, Chenglin
Ren, Ruiqi
2022 27TH ASIA PACIFIC CONFERENCE ON COMMUNICATIONS (APCC 2022): CREATING INNOVATIVE COMMUNICATION TECHNOLOGIES FOR POST-PANDEMIC ERA, 2022, : 621 - 626
[9] Lightweight and Efficient High-Resolution Network for Human Pose Estimation
Liu, Jiarui
Gong, Xiugang
Guo, Qun
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 232 - 240
[10] FastNet: Fast high-resolution network for human pose estimation
Luo, Yanmin
Ou, Zhilong
Wan, Tianjun
Guo, Jing-Ming
IMAGE AND VISION COMPUTING, 2022, 119

← 1 2 3 4 5 →