Monocular Expressive 3D Human Reconstruction of Multiple People

被引:0
|
作者
Zhao, Zhenghao [1 ]
Tang, Hao [2 ]
Wan, Joy [3 ]
Yan, Yan [1 ]
机构
[1] Illinois Inst Technol, Chicago, IL 60616 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA USA
[3] Univ Illinois, Urbana, IL USA
关键词
3D Pose Estimation; Whole-body Pose Estimation; Multi-person Pose Estimation;
D O I
10.1145/3652583.3658092
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Whole-body pose estimation aims to regress human pose models that include the body, hand, and facial details from RGB images. While the task of whole-body mesh recovery has been extensively studied in recent literature, the focus has predominantly been on human mesh recovery for a single person, despite the frequent occurrence of multiple people in practical scenarios. Similar to body-only cases, such single-person whole-body pose estimation methods often fail in the multiple-people problem for two reasons: (i) Given the ambiguous bounding box, which could contain more than one instance, it is difficult for single-person-oriented methods to regress the body mesh model of the target person. (ii) Single-person pose estimation approaches neglect the person-person occlusions and the depth order among instances, thus generating interpenetrated models. In this paper, we propose the Multi-person Expressive POse (MEPO) model, which exploits expressive 3D human model reconstruction for multiple people. To our best knowledge, our model is the first multi-person whole-body mesh reconstruction model, which is intensified by heatmap, depthmap, and depth order loss. We propose the Heatmap Enhancement Net (HENet) to leverage the heatmap information to assist the model in concentrating on the target person in crowded multi-person cases, while the depthmap delivers depth information of the image. Furthermore, we impose a depth order loss to recover human mesh precisely for overlapped people. In our experiments, we evaluate our model on multiple challenging datasets, including AGORA, which consists of complex occlusions similar to real-world scenarios. Our method has a significant performance improvement compared with the state-of-the-art pose estimation methods.
引用
收藏
页码:423 / 432
页数:10
相关论文
共 50 条
  • [1] Monocular 3D Reconstruction of Human Body
    Zhang, Yuqi
    Li, Dewei
    Jin, Bihui
    Ku, Yunwen
    Xue, Shibei
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 7889 - 7894
  • [2] 3D Human Motion Reconstruction in Unity with Monocular Camera
    Chen, Tai-Wei
    Lin, Wei-Liang
    2020 17TH INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC 2020), 2020, : 191 - 192
  • [3] Monocular, One-stage, Regression of Multiple 3D People
    Sun, Yu
    Bao, Qian
    Liu, Wu
    Fu, Yili
    Black, Michael J.
    Mei, Tao
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11159 - 11168
  • [4] Monocular 3D reconstruction of human motion in long action sequences
    Loy, G
    Eriksson, M
    Sullivan, J
    Carlsson, S
    COMPUTER VISION - ECCV 2004, PT 4, 2004, 2034 : 442 - 455
  • [5] Chasing the Tail in Monocular 3D Human Reconstruction With Prototype Memory
    Rong, Yu
    Liu, Ziwei
    Loy, Chen Change
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2907 - 2919
  • [6] Chasing the Tail in Monocular 3D Human Reconstruction With Prototype Memory
    Rong, Yu
    Liu, Ziwei
    Loy, Chen Change
    IEEE Transactions on Image Processing, 2022, 31 : 2907 - 2919
  • [7] 3D Reconstruction of Human Motion from Monocular Image Sequences
    Wandt, Bastian
    Ackermann, Hanno
    Rosenhahn, Bodo
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (08) : 1505 - 1516
  • [8] Monocular 3D Fingerprint Reconstruction and Unwarping
    Cui, Zhe
    Feng, Jianjiang
    Zhou, Jie
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 8679 - 8695
  • [9] Monocular Visual Odometry and 3D Reconstruction
    Prozorov, Alexandr
    Volokhov, Vladimir
    Priorov, Andrew
    PROCEEDINGS OF THE 15TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION FRUCT, 2014, : 112 - 118
  • [10] 3D Reconstruction of Human Motion and Skeleton from Uncalibrated Monocular Video
    Chen, Yen-Lin
    Chai, Jinxiang
    COMPUTER VISION - ACCV 2009, PT I, 2010, 5994 : 71 - 82