Bi-Pose: Bidirectional 2D-3D Transformation for Human Pose Estimation From a Monocular Camera

被引:2
|
作者
Du, Songlin [1 ,2 ]
Wang, Hao [3 ]
Yuan, Zhiwei [1 ,2 ]
Ikenaga, Takeshi [3 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
[2] Southeast Univ, Shenzhen Res Inst, Shenzhen 518063, Peoples R China
[3] Waseda Univ, Grad Sch Informat Prod & Syst, Kitakyushu 8080135, Japan
基金
中国国家自然科学基金;
关键词
3D human pose estimation; human-centered automation systems; bidirectional 2D-3D transformation; image-assisted 3D offset prediction; bone-length stability; ALGORITHM; TRACKING; NETWORK;
D O I
10.1109/TASE.2023.3279928
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatically estimating 3D human poses in video and inferring their meanings play an essential role in many human-centered automation systems. Existing researches made remarkable progresses by first estimating 2D human joints in video and then reconstructing 3D human pose from the 2D joints. However, mono-directionally reconstructing 3D pose from 2D joints ignores the interaction between information in 3D space and 2D space, losses rich information of original video, therefore limits the ceiling of estimation accuracy. To this end, this paper proposes a bidirectional 2D-3D transformation framework that bidirectionally exchanges 2D and 3D information and utilizes video information to estimate an offset for refining 3D human pose. In addition, a bone-length stability loss is utilized for the purpose of exploring human body structure to make the estimated 3D pose more natural and to further increase the overall accuracy. By evaluation, estimation error of the proposed method, measured by the mean per joint position error (MPJPE), is only 46.5 mm, which is much lower than state-of-the-art methods under the same experimental condition. The improvement on accuracy will make machines to better understand human poses for building superior human-centered automation systems.
引用
收藏
页码:3483 / 3496
页数:14
相关论文
共 50 条
  • [1] Estimation of camera pose using 2D-3D occluded corner correspondence
    Shi, FH
    Zhang, XY
    Liu, HJ
    Liu, YC
    2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 1256 - 1259
  • [2] 3D Face pose estimation and tracking from a monocular camera
    Ji, Q
    IMAGE AND VISION COMPUTING, 2002, 20 (07) : 499 - 511
  • [3] A new method of camera pose estimation using 2D-3D corner correspondence
    Shi, FH
    Zhang, XY
    Liu, YC
    PATTERN RECOGNITION LETTERS, 2004, 25 (10) : 1155 - 1163
  • [4] Adapted human pose: monocular 3D human pose estimation with zero real 3D pose data
    Liu, Shuangjun
    Sehgal, Naveen
    Ostadabbas, Sarah
    APPLIED INTELLIGENCE, 2022, 52 (12) : 14491 - 14506
  • [5] Using Specular Highlights as Pose Invariant Features for 2D-3D Pose Estimation
    Netz, Aaron
    Osadchy, Margarita
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 721 - 728
  • [6] Evaluation of Human Pose Estimation in 3D with Monocular Camera for Clinical Application
    Carrasco-Plaza, Jose
    Cerda, Mauricio
    INTELLIGENT COMPUTING SYSTEMS (ISICS 2022), 2022, 1569 : 121 - 134
  • [7] Adapted human pose: monocular 3D human pose estimation with zero real 3D pose data
    Shuangjun Liu
    Naveen Sehgal
    Sarah Ostadabbas
    Applied Intelligence, 2022, 52 : 14491 - 14506
  • [8] 2D-3D pose consistency-based conditional random fields for 3D human pose estimation
    Chang, Ju Yong
    Lee, Kyoung Mu
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 169 : 52 - 61
  • [9] Evaluating Recent 2D Human Pose Estimators for 2D-3D Pose Lifting
    Mehraban, Soroush
    Qin, Yiqian
    Taati, Babak
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
  • [10] A survey on monocular 3D human pose estimation
    Ji X.
    Fang Q.
    Dong J.
    Shuai Q.
    Jiang W.
    Zhou X.
    Virtual Reality and Intelligent Hardware, 2020, 2 (06): : 471 - 500