Monocular Expressive 3D Human Reconstruction of Multiple People

被引:0
|
作者
Zhao, Zhenghao [1 ]
Tang, Hao [2 ]
Wan, Joy [3 ]
Yan, Yan [1 ]
机构
[1] Illinois Inst Technol, Chicago, IL 60616 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA USA
[3] Univ Illinois, Urbana, IL USA
关键词
3D Pose Estimation; Whole-body Pose Estimation; Multi-person Pose Estimation;
D O I
10.1145/3652583.3658092
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Whole-body pose estimation aims to regress human pose models that include the body, hand, and facial details from RGB images. While the task of whole-body mesh recovery has been extensively studied in recent literature, the focus has predominantly been on human mesh recovery for a single person, despite the frequent occurrence of multiple people in practical scenarios. Similar to body-only cases, such single-person whole-body pose estimation methods often fail in the multiple-people problem for two reasons: (i) Given the ambiguous bounding box, which could contain more than one instance, it is difficult for single-person-oriented methods to regress the body mesh model of the target person. (ii) Single-person pose estimation approaches neglect the person-person occlusions and the depth order among instances, thus generating interpenetrated models. In this paper, we propose the Multi-person Expressive POse (MEPO) model, which exploits expressive 3D human model reconstruction for multiple people. To our best knowledge, our model is the first multi-person whole-body mesh reconstruction model, which is intensified by heatmap, depthmap, and depth order loss. We propose the Heatmap Enhancement Net (HENet) to leverage the heatmap information to assist the model in concentrating on the target person in crowded multi-person cases, while the depthmap delivers depth information of the image. Furthermore, we impose a depth order loss to recover human mesh precisely for overlapped people. In our experiments, we evaluate our model on multiple challenging datasets, including AGORA, which consists of complex occlusions similar to real-world scenarios. Our method has a significant performance improvement compared with the state-of-the-art pose estimation methods.
引用
收藏
页码:423 / 432
页数:10
相关论文
共 50 条
  • [31] Monocular 3D Tracking of Multiple Interacting Targets
    Osawa, Tatsuya
    Sudo, Kyoko
    Arai, Hiroyuki
    Koike, Hideki
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 3934 - 3937
  • [32] Monocular 3D Pose and Shape Estimation of Multiple People in Natural Scenes The Importance of Multiple Scene Constraints
    Zanfir, Andrei
    Marinoiu, Elisabeta
    Sminchisescu, Cristian
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2148 - 2157
  • [33] Monocular 3D Body Shape Reconstruction under Clothing
    Ferrari, Claudio
    Casini, Leonardo
    Berretti, Stefano
    Del Bimbo, Alberto
    JOURNAL OF IMAGING, 2021, 7 (12)
  • [34] UAS Exploitation by 3D Reconstruction using Monocular Vision
    Diskin, Yakov
    Tompkins, R. Cortland
    Youssef, Menatoallah
    Asari, Vijayan K.
    PROCEEDINGS OF THE 24TH INTERNATIONAL TECHNICAL MEETING OF THE SATELLITE DIVISION OF THE INSTITUTE OF NAVIGATION (ION GNSS 2011), 2011, : 3596 - 3604
  • [35] Corrective 3D Reconstruction of Lips from Monocular Video
    Garrido, Pablo
    Zollhoefer, Michael
    Wu, Chenglei
    Bradley, Derek
    Perez, Patrick
    Beeler, Thabo
    Theobalt, Christian
    ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (06):
  • [36] Deep monocular 3D reconstruction for assisted navigation in bronchoscopy
    Visentini-Scarzanella, Marco
    Sugiura, Takamasa
    Kaneko, Toshimitsu
    Koto, Shinichiro
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2017, 12 (07) : 1089 - 1099
  • [37] Simultaneous segmentation and 3D reconstruction of monocular image sequences
    Ozden, Kemal Egemen
    Schindler, Konrad
    Van Gool, L.
    2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 1072 - 1079
  • [38] Deep monocular 3D reconstruction for assisted navigation in bronchoscopy
    Marco Visentini-Scarzanella
    Takamasa Sugiura
    Toshimitsu Kaneko
    Shinichiro Koto
    International Journal of Computer Assisted Radiology and Surgery, 2017, 12 : 1089 - 1099
  • [39] A 3D reconstruction method for buildings based on monocular vision
    Xu, Boqiang
    Liu, Chao
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2022, 37 (03) : 354 - 369
  • [40] Monocular 3D Reconstruction of Objects Based on Cylindrical Panoramas
    Haeusler, Ralf
    Klette, Reinhard
    Huang, Fay
    ADVANCES IN IMAGE AND VIDEO TECHNOLOGY, PROCEEDINGS, 2009, 5414 : 60 - +