Temporal Consistency Loss for High Resolution Textured and Clothed 3D Human Reconstruction from Monocular Video

被引:2
|
作者
Caliskan, Akin [1 ]
Mustafa, Armin [1 ]
Hilton, Adrian [1 ]
机构
[1] Univ Surrey, CVSSP, Guildford, Surrey, England
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1109/CVPRW53098.2021.00197
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel method to learn temporally consistent 3D reconstruction of clothed people from a monocular video. Recent methods for 3D human reconstruction from monocular video using volumetric, implicit or parametric human shape models, produce per frame reconstructions giving temporally inconsistent output and limited performance when applied to video. In this paper we introduce an approach to learn temporally consistent features for textured reconstruction of clothed 3D human sequences from monocular video by proposing two advances: a novel temporal consistency loss function; and hybrid representation learning for implicit 3D reconstruction from 2D images and coarse 3D geometry. The proposed advances improve the temporal consistency and accuracy of both the 3D reconstruction and texture prediction from a monocular video. Comprehensive comparative performance evaluation on images of people demonstrates that the proposed method significantly outperforms the state-of-the-art learning-based single image 3D human shape estimation approaches achieving significant improvement of reconstruction accuracy, completeness, quality and temporal consistency.
引用
收藏
页码:1780 / 1790
页数:11
相关论文
共 50 条
  • [31] PanoRecon: Real-Time Panoptic 3D Reconstruction from Monocular Video
    Wu, Dong
    Yan, Zike
    Zha, Hongbin
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 21507 - 21518
  • [32] High resolution 3d reconstruction of the human gastroesophageal junction
    Yassi, Rita
    Cheng, Leo K.
    Gerneke, Dane
    Legrice, Ian
    Sands, Greg B.
    Windsor, John A.
    Pullan, Andrew J.
    GASTROENTEROLOGY, 2007, 132 (04) : A171 - A171
  • [33] Cyclic Test-Time Adaptation on Monocular Video for 3D Human Mesh Reconstruction
    Nam, Hyeongjin
    Jung, Daniel Sungho
    Oh, Yeonguk
    Lee, Kyoung Mu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14783 - 14793
  • [34] Bayesian 3D tracking from monocular video
    Brau, Ernesto
    Guan, Jinyan
    Simek, Kyle
    Del Pero, Luca
    Dawson, Colin Reimer
    Barnard, Kobus
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3368 - 3375
  • [35] Efficient 3D recovery of human motion in monocular video
    Chen, Cheng
    Xiao, Jun
    Zhuang, Yueting
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2009, 21 (08): : 1118 - 1126
  • [36] Single-Image 3D Human Pose and Shape Estimation Enhanced by Clothed 3D Human Reconstruction
    Liu, Leyuan
    Gao, Yunqi
    Sun, Jianchi
    Chen, Jingying
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 33 - 44
  • [37] 3D model localisation using high-resolution reconstruction of monocular image sequences
    Serra, B
    AUTOMATIC TARGET RECOGNITION VII, 1997, 3069 : 282 - 293
  • [38] 3D hand reconstruction from a monocular view
    Lee, SU
    Cohen, I
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, 2004, : 310 - 313
  • [39] Temporally consistent reconstruction of 3D clothed human surface with warp field
    Deng, Yong
    Li, Baoxing
    Yang, Yehui
    Zhao, Xu
    IMAGE AND VISION COMPUTING, 2023, 137
  • [40] Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video
    Wei, Wen-Li
    Lin, Jen-Chun
    Liu, Tyng-Luh
    Liao, Hong-Yuan Mark
    arXiv, 2022,