Temporal Consistency Loss for High Resolution Textured and Clothed 3D Human Reconstruction from Monocular Video

被引：2

作者：

Caliskan, Akin ^{[1
]}

Mustafa, Armin ^{[1
]}

Hilton, Adrian ^{[1
]}

机构：

[1] Univ Surrey, CVSSP, Guildford, Surrey, England

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021 | 2021年

基金：

英国工程与自然科学研究理事会;

关键词：

D O I：

10.1109/CVPRW53098.2021.00197

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a novel method to learn temporally consistent 3D reconstruction of clothed people from a monocular video. Recent methods for 3D human reconstruction from monocular video using volumetric, implicit or parametric human shape models, produce per frame reconstructions giving temporally inconsistent output and limited performance when applied to video. In this paper we introduce an approach to learn temporally consistent features for textured reconstruction of clothed 3D human sequences from monocular video by proposing two advances: a novel temporal consistency loss function; and hybrid representation learning for implicit 3D reconstruction from 2D images and coarse 3D geometry. The proposed advances improve the temporal consistency and accuracy of both the 3D reconstruction and texture prediction from a monocular video. Comprehensive comparative performance evaluation on images of people demonstrates that the proposed method significantly outperforms the state-of-the-art learning-based single image 3D human shape estimation approaches achieving significant improvement of reconstruction accuracy, completeness, quality and temporal consistency.

引用

页码：1780 / 1790

页数：11

共 50 条

[31] PanoRecon: Real-Time Panoptic 3D Reconstruction from Monocular Video
Wu, Dong
Yan, Zike
Zha, Hongbin
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 21507 - 21518
[32] High resolution 3d reconstruction of the human gastroesophageal junction
Yassi, Rita
Cheng, Leo K.
Gerneke, Dane
Legrice, Ian
Sands, Greg B.
Windsor, John A.
Pullan, Andrew J.
GASTROENTEROLOGY, 2007, 132 (04) : A171 - A171
[33] Cyclic Test-Time Adaptation on Monocular Video for 3D Human Mesh Reconstruction
Nam, Hyeongjin
Jung, Daniel Sungho
Oh, Yeonguk
Lee, Kyoung Mu
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14783 - 14793
[34] Bayesian 3D tracking from monocular video
Brau, Ernesto
Guan, Jinyan
Simek, Kyle
Del Pero, Luca
Dawson, Colin Reimer
Barnard, Kobus
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3368 - 3375
[35] Efficient 3D recovery of human motion in monocular video
Chen, Cheng
Xiao, Jun
Zhuang, Yueting
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2009, 21 (08): : 1118 - 1126
[36] Single-Image 3D Human Pose and Shape Estimation Enhanced by Clothed 3D Human Reconstruction
Liu, Leyuan
Gao, Yunqi
Sun, Jianchi
Chen, Jingying
ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 33 - 44
[37] 3D model localisation using high-resolution reconstruction of monocular image sequences
Serra, B
AUTOMATIC TARGET RECOGNITION VII, 1997, 3069 : 282 - 293
[38] 3D hand reconstruction from a monocular view
Lee, SU
Cohen, I
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, 2004, : 310 - 313
[39] Temporally consistent reconstruction of 3D clothed human surface with warp field
Deng, Yong
Li, Baoxing
Yang, Yehui
Zhao, Xu
IMAGE AND VISION COMPUTING, 2023, 137
[40] Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video
Wei, Wen-Li
Lin, Jen-Chun
Liu, Tyng-Luh
Liao, Hong-Yuan Mark
arXiv, 2022,

← 1 2 3 4 5 →