OmniAvatar: Geometry-Guided Controllable 3D Head Synthesis

被引:6
|
作者
Xu, Hongyi [1 ]
Song, Guoxian [1 ]
Jiang, Zihang [1 ,2 ]
Zhang, Jianfeng [1 ,2 ]
Shi, Yichun [1 ]
Liu, Jing [1 ]
Ma, Wanchun [1 ]
Feng, Jiashi [1 ]
Luo, Linjie [1 ]
机构
[1] ByteDance Inc, Culver City, CA 90230 USA
[2] Natl Univ Singapore, Singapore, Singapore
关键词
FIELDS;
D O I
10.1109/CVPR52729.2023.01232
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present OmniAvatar, a novel geometry-guided 3D head synthesis model trained from in-the-wild unstructured images that is capable of synthesizing diverse identity-preserved 3D heads with compelling dynamic details under full disentangled control over camera poses, facial expressions, head shapes, articulated neck and jaw poses. To achieve such high level of disentangled control, we first explicitly define a novel semantic signed distance function (SDF) around a head geometry (FLAME) conditioned on the control parameters. This semantic SDF allows us to build a differentiable volumetric correspondence map from the observation space to a disentangled canonical space from all the control parameters. We then leverage the 3D-aware GAN framework (EG3D) to synthesize detailed shape and appearance of 3D full heads in the canonical space, followed by a volume rendering step guided by the volumetric correspondence map to output into the observation space. To ensure the control accuracy on the synthesized head shapes and expressions, we introduce a geometry prior loss to conform to head SDF and a control loss to conform to the expression code. Further, we enhance the temporal realism with dynamic details conditioned upon varying expressions and joint poses. Our model can synthesize more preferable identity-preserved 3D heads with compelling dynamic details compared to the state-of-the-art methods both qualitatively and quantitatively. We also provide an ablation study to justify many of our system design choices.
引用
收藏
页码:12814 / 12824
页数:11
相关论文
共 50 条
  • [31] Anamorphic 3D geometry
    Hansford, D.
    Collins, D.
    COMPUTING, 2007, 79 (2-4) : 211 - 223
  • [32] VRMath: A 3D microworld for learning 3D geometry
    Yeh, A
    Nason, R
    ED-MEDIA 2004: WORLD CONFERENCE ON EDUCATIONAL MULTIMEDIA, HYPERMEDIA & TELECOMMUNICATIONS, VOLS. 1-7, 2004, : 2183 - 2191
  • [33] Prior-Guided Multi-View 3D Head Reconstruction
    Wang, Xueying
    Guo, Yudong
    Yang, Zhongqi
    Zhang, Juyong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4028 - 4040
  • [34] GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation
    Wang, Gu
    Manhardt, Fabian
    Tombari, Federico
    Ji, Xiangyang
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16606 - 16616
  • [35] Towards Unsupervised Learning of Generative Models for 3D Controllable Image Synthesis
    Liao, Yiyi
    Schwarz, Katja
    Mescheder, Lars
    Geiger, Andreas
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5870 - 5879
  • [36] A morphology controllable synthesis of 3D graphene nanostructures and their energy storage applications
    Zhao, Jie
    Li, Wei-Hua
    Xu, Hui-Zhong
    Sun, Li-Shui
    Li, Chao-Qin
    Liu, Fa-Qian
    RSC ADVANCES, 2016, 6 (75): : 70972 - 70977
  • [37] Self-Supervised Depth Completion Guided by 3D Perception and Geometry Consistency
    Cai, Yu
    Shen, Tianyu
    Huang, Shi-Sheng
    Huang, Hua
    arXiv, 2023,
  • [38] Controllable hydrothermal synthesis of 2D and 3D dendritic aluminum phosphate crystals
    Yang, Qing
    Shen, Ruwei
    Zeng, Changfeng
    Zhang, Lixiong
    CRYSTENGCOMM, 2013, 15 (21): : 4295 - 4302
  • [39] Copy-paste synthesis of 3D geometry with repetitive patterns
    Owada, Shigeru
    Nielsen, Frank
    Igarashi, Takeo
    SMART GRAPHICS, PROCEEDINGS, 2006, 4073 : 184 - 193
  • [40] Geometry-Free View Synthesis: Transformers and no 3D Priors
    Rombach, Robin
    Esser, Patrick
    Ommer, Bjoern
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14336 - 14346