共 49 条
Preface: A Data-driven Volumetric Prior for Few-shot Ultra High-resolution Face Synthesis
被引:7
|作者:
Buhler, Marcel C.
[1
,2
]
Sarkar, Kripasindhu
[2
]
Shah, Tanmay
[2
]
Li, Gengyan
[1
,2
]
Wang, Daoye
[2
]
Helminger, Leonhard
[2
]
Orts-Escolano, Sergio
[2
]
Lagun, Dmitry
[2
]
Hilliges, Otmar
[1
]
Beeler, Thabo
[2
]
Meka, Abhimitra
[2
]
机构:
[1] Swiss Fed Inst Technol, Zurich, Switzerland
[2] Google, Mountain View, CA 94043 USA
关键词:
D O I:
10.1109/ICCV51070.2023.00315
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
NeRFs have enabled highly realistic synthesis of human faces including complex appearance and reflectance effects of hair and skin. These methods typically require a large number of multi-view input images, making the process hardware intensive and cumbersome, limiting applicability to unconstrained settings. We propose a novel volumetric human face prior that enables the synthesis of ultra highresolution novel views of subjects that are not part of the prior's training distribution. This prior model consists of an identity-conditioned NeRF, trained on a dataset of lowresolution multi-view images of diverse humans with known camera calibration. A simple sparse landmark-based 3D alignment of the training dataset allows our model to learn a smooth latent space of geometry and appearance despite a limited number of training identities. A high-quality volumetric representation of a novel subject can be obtained by model fitting to 2 or 3 camera views of arbitrary resolution. Importantly, our method requires as few as two views of casually captured images as input at inference time.
引用
收藏
页码:3379 / 3390
页数:12
相关论文