Fixed-resolution representation network for human pose estimation

被引：0

作者：

Yongxiang Liu

Xiaorong Hou

机构：

[1] University of Electronic Science and Technology of China,School of Automation Engineering

来源：

Multimedia Systems | 2022年 / 28卷

关键词：

Human pose estimation; Fixed-resolution representation; Multi-receptive fields; Feature extraction; Information selection;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Human pose estimation from a single image is a fundamental yet challenging task in computer vision. Most existing methods gradually generated multi-resolution from high-resolution to low-resolution, then recovered the higher resolution from the low resolution and used it to generate final pose heatmaps, such as Hourglass and HRNet and their variants. In this paper, we propose a novel architecture named fixed-resolution representation network for human pose estimation, which maintains fixed-resolution through the whole process to keep rich spatial-structural information. An Improved Pyramid Convolutional Bottleneck (IPCB) is firstly proposed to encode feature maps with multi receptive fields with the same resolution. Secondly, we introduce an efficient channel attention mechanism to enhance the feature extraction and information selection capability of IPCB, making the performance of IPCB better. Thirdly, considering the deviation from using the flip test of reasoning, we use an existing technology: Unbiased Data Processing. Fourthly, due to the change of the model structure and the limited computing resources, we introduce an iterative retraining strategy to solve the problem of pre-training. We empirically demonstrate the effectiveness of our method and achieve a competitive performance with 1.7M parameters and 3G FLOPs, 89.5 (PCKh@0.5) and 92.7 (PCK@0.2) respectively, compared with the state-of-the-art methods on the benchmark dataset: the MPII and LSP key points detection dataset.

引用

页码：1597 / 1609

页数：12

共 50 条

[31] Multistage attention network for human pose estimation
Zhou, Jingyang
Wen, Guangzhao
Zhang, Yu
Geng, Xin
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
[32] Structure guided network for human pose estimation
Yilei Chen
Xuemei Xie
Wenjie Yin
Bo’ao Li
Fu Li
Applied Intelligence, 2023, 53 : 21012 - 21026
[33] Learning high resolution reservation for human pose estimation
Gao, Bingkun
Ma, Ke
Bi, Hongbo
Wang, Ling
Wu, Chenlei
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (19) : 29251 - 29265
[34] Efficient High-Resolution Human Pose Estimation
Qin, Xiaofei
Qiu, Lingfeng
He, Changxiang
Zhang, Xuedian
PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2022, 13631 : 383 - 396
[35] A residual semantic graph convolutional network with high-resolution representation for 3D human pose estimation in a virtual fashion show
Zhang P.
Ding P.
Li G.
Zhang J.
Multimedia Tools and Applications, 2024, 83 (29) : 73649 - 73669
[36] Learning high resolution reservation for human pose estimation
Bingkun Gao
Ke Ma
Hongbo Bi
Ling Wang
Chenlei Wu
Multimedia Tools and Applications, 2021, 80 : 29251 - 29265
[37] Deep High-Resolution Network With Double Attention Residual Blocks for Human Pose Estimation
Huo, Zhanqiang
Jin, Han
Qiao, Yingxu
Luo, Fen
IEEE ACCESS, 2020, 8 : 224947 - 224957
[38] High resolution network for human hand pose estimation based on 3D convolution
Sang N.
Li M.
1600, Huazhong University of Science and Technology (48): : 1 - 6
[39] Multi-scale Attention Aided Multi-Resolution Network for Human Pose Estimation
Selvam, Srinika
Mishra, Deepak
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 : 461 - 472
[40] DB-HRNet: Dual Branch High-Resolution Network for Human Pose Estimation
Wang, Yanxia
Wang, Renjie
Shi, Hu
IEEE ACCESS, 2023, 11 : 120628 - 120641

← 1 2 3 4 5 →