Semantic-aware hyper-space deformable neural radiance fields for facial avatar reconstruction

被引:1
|
作者
Jin, Kaixin [1 ]
Gu, Xiaoling [1 ]
Wang, Zimeng [1 ]
Kuang, Zhenzhong [1 ]
Wu, Zizhao [1 ]
Tan, Min [1 ]
Yu, Jun [1 ]
机构
[1] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Key Lab Complex Syst Modeling & Simulat, 1158,2 Baiyang St, Hangzhou 310018, Zhejiang, Peoples R China
基金
美国国家科学基金会;
关键词
Facial avatar reconstruction; Hyper-space deformation; Semantic guidance; Neural radiance fields;
D O I
10.1016/j.patrec.2024.08.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-fidelity facial avatar reconstruction from monocular videos is a prominent research problem in computer graphics and computer vision. Recent advancements in the Neural Radiance Field (NeRF) have demonstrated remarkable proficiency in rendering novel views and garnered attention for its potential in facial avatar reconstruction. However, previous methodologies have overlooked the complex motion dynamics present across the head, torso, and intricate facial features. Additionally, a deficiency exists in a generalized NeRF-based framework for facial avatar reconstruction adaptable to either 3DMM coefficients or audio input. To tackle these challenges, we propose an innovative framework that leverages semantic-aware hyper-space deformable NeRF, facilitating the reconstruction of high-fidelity facial avatars from either 3DMM coefficients or audio features. Our framework effectively addresses both localized facial movements and broader head and torso motions through semantic guidance and a unified hyper-space deformation module. Specifically, we adopt a dynamic weighted ray sampling strategy to allocate varying degrees of attention to distinct semantic regions, enhancing the deformable NeRF framework with semantic guidance to capture fine-grained details across diverse facial regions. Moreover, we introduce a hyper-space deformation module that enables the transformation of observation space coordinates into canonical hyper-space coordinates, allowing for the learning of natural facial deformation and head-torso movements. Extensive experiments validate the superiority of our framework over existing state-of-the-art methods, demonstrating its effectiveness in producing realistic and expressive facial avatars. Our code is available at https://github.com/jematy/SAHS-Deformable-Nerf.
引用
收藏
页码:160 / 166
页数:7
相关论文
共 7 条
  • [1] Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction
    Gafni, Guy
    Thies, Justus
    Zollhoefer, Michael
    Niessner, Matthias
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8645 - 8654
  • [2] 4D Facial Avatar Reconstruction From Monocular Video via Efficient and Controllable Neural Radiance Fields
    Kwak, Jeong-Gi
    Ko, Hanseok
    IEEE ACCESS, 2024, 12 : 15675 - 15683
  • [3] Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields
    Liu, Tianqi
    Ye, Xinyi
    Shi, Min
    Huang, Zihao
    Pan, Zhiyu
    Peng, Zhan
    Cao, Zhiguo
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 7654 - 7663
  • [4] Scale-aware monocular reconstruction via robot kinematics and visual data in neural radiance fields
    Wei, Ruofeng
    Guo, Jiaxin
    Lu, Yiang
    Zhong, Fangxun
    Sun, Dong
    Dou, Qi
    ARTIFICIAL INTELLIGENCE SURGERY, 2024, 4 (03): : 187 - 198
  • [5] CoupNeRF: Property-aware Neural Radiance Fields for Multi-Material Coupled Scenario Reconstruction
    Li, Jin
    Gao, Yang
    Song, Wenfeng
    Li, Yacong
    Li, Shuai
    Hao, Aimin
    Qin, Hong
    COMPUTER GRAPHICS FORUM, 2024, 43 (07)
  • [6] Enhancing endoscopic scene reconstruction with color-aware inverse rendering through neural SDF and radiance fields
    Qin, Zhibao
    Chen, Qi
    Qian, Kai
    Zheng, Qinhong
    Shi, Junsheng
    Tai, Yonghang
    BIOMEDICAL OPTICS EXPRESS, 2024, 15 (06): : 3914 - 3931
  • [7] EndoSelf: Self-supervised Monocular 3D Scene Reconstruction of Deformable Tissues with Neural Radiance Fields on Endoscopic Videos
    Li, Wenda
    Hayashi, Yuichiro
    Oda, Masahiro
    Kitasaka, Takayuki
    Misawa, Kazunari
    Mori, Kensaku
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VI, 2024, 15006 : 241 - 251