2D Human pose estimation: a survey

被引:0
|
作者
Haoming Chen
Runyang Feng
Sifan Wu
Hao Xu
Fengcheng Zhou
Zhenguang Liu
机构
[1] Zhejiang Gongshang University,
[2] Zhejiang Lab,undefined
来源
Multimedia Systems | 2023年 / 29卷
关键词
Human pose estimation; Pose estimation; Survey; Deep learning; Convolutional neural network;
D O I
暂无
中图分类号
学科分类号
摘要
Human pose estimation aims at localizing human anatomical keypoints or body parts in the input data (e.g., images, videos, or signals). It forms a crucial component in enabling machines to have an insightful understanding of the behaviors of humans, and has become a salient problem in computer vision and related fields. Deep learning techniques allow learning feature representations directly from the data, significantly pushing the performance boundary of human pose estimation. In this paper, we reap the recent achievements of 2D human pose estimation methods and present a comprehensive survey. Briefly, existing approaches put their efforts in three directions, namely network architecture design, network training refinement, and post processing. Network architecture design looks at the architecture of human pose estimation models, extracting more robust features for keypoint recognition and localization. Network training refinement tap into the training of neural networks and aims to improve the representational ability of models. Post processing further incorporates model-agnostic polishing strategies to improve the performance of keypoint detection. More than 200 research contributions are involved in this survey, covering methodological frameworks, common benchmark datasets, evaluation metrics, and performance comparisons. We seek to provide researchers with a more comprehensive and systematic review on human pose estimation, allowing them to acquire a grand panorama and better identify future directions.
引用
收藏
页码:3115 / 3138
页数:23
相关论文
共 50 条
  • [31] 2D and 3D Human Pose Estimation and Analysis Using Deep Learning
    Yadav, Anju
    Saxena, Rahul
    Bhattacharya, Anubhav
    Pal, Vipin
    Pathak, Nitish
    ADVANCES IN INFORMATION COMMUNICATION TECHNOLOGY AND COMPUTING, AICTC 2021, 2022, 392 : 133 - 143
  • [32] 3D Human Pose Estimation using 2D Body Part Detectors
    Barbulescu, Adela
    Gong, Wenjuan
    Gonzalez, Jordi
    Moeslund, Thomas B.
    Xavier Roca, F.
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 2484 - 2487
  • [33] Cofopose: Conditional 2D Pose Estimation with Transformers
    Aidoo, Evans
    Wang, Xun
    Liu, Zhenguang
    Tenagyei, Edwin Kwadwo
    Owusu-Agyemang, Kwabena
    Kodjiku, Seth Larweh
    Ejianya, Victor Nonso
    Aggrey, Esther Stacy E. B.
    SENSORS, 2022, 22 (18)
  • [34] A survey on deep 3D human pose estimation
    Neupane, Rama Bastola
    Li, Kan
    Boka, Tesfaye Fenta
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 58 (01)
  • [35] A survey on monocular 3D human pose estimation
    Ji X.
    Fang Q.
    Dong J.
    Shuai Q.
    Jiang W.
    Zhou X.
    Virtual Reality and Intelligent Hardware, 2020, 2 (06): : 471 - 500
  • [36] Interact-Pose Datasets for 2D Human Pose Estimation in Multi-person Interaction Scene
    Jiang, Yifei
    Gao, Hao
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2022, PT II, 2022, 1701 : 211 - 223
  • [37] 2D Articulated Human Pose Estimation and Retrieval in (Almost) Unconstrained Still Images
    Eichner, M.
    Marin-Jimenez, M.
    Zisserman, A.
    Ferrari, V.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 99 (02) : 190 - 214
  • [38] LiteDEKR: End-to-end lite 2D human pose estimation network
    Lv, Xueqiang
    Hao, Wei
    Tian, Lianghai
    Han, Jing
    Chen, Yuzhong
    Cai, Zangtai
    IET IMAGE PROCESSING, 2023, 17 (12) : 3392 - 3400
  • [39] An efficient and accurate 2D human pose estimation method using VTTransPose network
    Li, Rui
    Li, Qi
    Yang, Shiqiang
    Zeng, Xin
    Yan, An
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [40] 2D Articulated Human Pose Estimation and Retrieval in (Almost) Unconstrained Still Images
    M. Eichner
    M. Marin-Jimenez
    A. Zisserman
    V. Ferrari
    International Journal of Computer Vision, 2012, 99 : 190 - 214