Bi-Pose: Bidirectional 2D-3D Transformation for Human Pose Estimation From a Monocular Camera

被引:2
|
作者
Du, Songlin [1 ,2 ]
Wang, Hao [3 ]
Yuan, Zhiwei [1 ,2 ]
Ikenaga, Takeshi [3 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
[2] Southeast Univ, Shenzhen Res Inst, Shenzhen 518063, Peoples R China
[3] Waseda Univ, Grad Sch Informat Prod & Syst, Kitakyushu 8080135, Japan
基金
中国国家自然科学基金;
关键词
3D human pose estimation; human-centered automation systems; bidirectional 2D-3D transformation; image-assisted 3D offset prediction; bone-length stability; ALGORITHM; TRACKING; NETWORK;
D O I
10.1109/TASE.2023.3279928
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatically estimating 3D human poses in video and inferring their meanings play an essential role in many human-centered automation systems. Existing researches made remarkable progresses by first estimating 2D human joints in video and then reconstructing 3D human pose from the 2D joints. However, mono-directionally reconstructing 3D pose from 2D joints ignores the interaction between information in 3D space and 2D space, losses rich information of original video, therefore limits the ceiling of estimation accuracy. To this end, this paper proposes a bidirectional 2D-3D transformation framework that bidirectionally exchanges 2D and 3D information and utilizes video information to estimate an offset for refining 3D human pose. In addition, a bone-length stability loss is utilized for the purpose of exploring human body structure to make the estimated 3D pose more natural and to further increase the overall accuracy. By evaluation, estimation error of the proposed method, measured by the mean per joint position error (MPJPE), is only 46.5 mm, which is much lower than state-of-the-art methods under the same experimental condition. The improvement on accuracy will make machines to better understand human poses for building superior human-centered automation systems.
引用
收藏
页码:3483 / 3496
页数:14
相关论文
共 50 条
  • [31] Joint Optimization of the 3D Model and 6D Pose for Monocular Pose Estimation
    Guo, Liangchao
    Chen, Lin
    Wang, Qiufu
    Zhang, Zhuo
    Sun, Xiaoliang
    DRONES, 2024, 8 (11)
  • [32] 3D Head pose estimation and camera mouse implementation using a monocular video camera
    Masoomeh Nabati
    Alireza Behrad
    Signal, Image and Video Processing, 2015, 9 : 39 - 44
  • [33] Monocular 3D Pose Estimation via Pose Grammar and Data Augmentation
    Xu, Yuanlu
    Wang, Wenguan
    Liu, Tengyu
    Liu, Xiaobai
    Xie, Jianwen
    Zhu, Song-Chun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 6327 - 6344
  • [34] Monocular 3D Human Pose Estimation by Generation and Ordinal Ranking
    Sharma, Saurabh
    Varigonda, Pavan Teja
    Bindal, Prashast
    Sharma, Abhishek
    Jain, Arjun
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2325 - 2334
  • [35] Double chain networks for monocular 3D human pose estimation
    Bai, Guihu
    Luo, Yanmin
    Pan, Xueliang
    Wang, Youjie
    Wang, Jia
    Guo, Jingming
    IMAGE AND VISION COMPUTING, 2022, 123
  • [36] Deep Kinematics Analysis for Monocular 3D Human Pose Estimation
    Xu, Jingwei
    Yu, Zhenbo
    Ni, Bingbing
    Yang, Jiancheng
    Yang, Xiaokang
    Zhang, Wenjun
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 896 - 905
  • [37] TSwinPose: Enhanced monocular 3D human pose estimation with JointFlow
    Li, Muyu
    Hu, Henan
    Xiong, Jingjing
    Zhao, Xudong
    Yan, Hong
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [38] LEARNING MONOCULAR 3D HUMAN POSE ESTIMATION WITH SKELETAL INTERPOLATION
    Chen, Ziyi
    Sugimoto, Akihiro
    Lai, Shang-Hong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4218 - 4222
  • [39] Probabilistic Monocular 3D Human Pose Estimation with Normalizing Flows
    Wehrbein, Tom
    Rudolph, Marco
    Rosenhahn, Bodo
    Wandt, Bastian
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11179 - 11188
  • [40] On the Effect of Temporal Information on Monocular 3D Human Pose Estimation
    Brauer, Juergen
    Gong, Wenjuan
    Gonzalez, Jordi
    Arens, Michael
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,