4D Facial Avatar Reconstruction From Monocular Video via Efficient and Controllable Neural Radiance Fields

被引:1
|
作者
Kwak, Jeong-Gi [1 ]
Ko, Hanseok [1 ]
机构
[1] Korea Univ, Sch Elect Engn, Seoul 02841, South Korea
关键词
Neural radiance field (NeRF); monocular facial avatar reconstruction; face reenactment;
D O I
10.1109/ACCESS.2024.3355052
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present an efficient approach for monocular 4D facial avatar reconstruction using a dynamic neural radiance field (NeRF). Over the years, NeRFs have been popular methods for 3D scene representation, but lack computational efficiency and controllabilty, thus it is impractical for real world application such as AR/VR, teleconferencing, and immersive experiences. Recent the introduction of grid-based encoding by InstantNGP has enabled the rendering process of NeRF much faster, but it is limited to static 3D scenes. To address the issues, we focus on developing a novel dynamic NeRF that allows explicit control over pose and facial expression, while keeping the computational efficiency. By leveraging a low-dimensional basis from the morphable model (3DMM) with elaborately designed spatial encoding branch and ambient encoding branch, we condition a dynamic radiance field in an ambient space, improving controllability and visual quality. Our model achieves rendering speeds approximately 30x faster at training and 100x faster at inference than the baseline (NeRFace), enabling practical approaches for real world applications. Through qualitative and quantitative experiments, we demonstrate the effectiveness of our approach. The dynamic NeRF exhibits superior controllability, enhanced 3D consistency, and improved visual quality. Our efficient model opens new possibilities for real-time applications, revolutionizing AR/VR and teleconferencing experiences.
引用
收藏
页码:15675 / 15683
页数:9
相关论文
共 39 条
  • [31] NeRF-OR: neural radiance fields for operating room scene reconstruction from sparse-view RGB-D videos
    Gerats, Beerend G. A.
    Wolterink, Jelmer M.
    Broeders, Ivo A. M. J.
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2025, 20 (01) : 147 - 156
  • [32] Learning Parallel Dense Correspondence from Spatio-Temporal Descriptors for Efficient and Robust 4D Reconstruction
    Tang, Jiapeng
    Xu, Dan
    Jia, Kui
    Zhang, Lei
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6018 - 6027
  • [33] 4DRecons: 4D Neural Implicit Deformable Objects Reconstruction from a single RGB-D Camera with Geometrical and Topological Regularizations
    Cong, Xiaoyan
    Yang, Haitao
    Chen, Liyan
    Zhang, Kaifeng
    Bajaj, Chandrajit
    Yi, Li
    Huang, Qixing
    arXiv,
  • [34] Speech Motion Anomaly Detection via Cross-Modal Translation of 4D Motion Fields from Tagged MRI
    Liu, Xiaofeng
    Xing, Fangxu
    Zhuo, Jiachen
    Stone, Maureen
    Prince, Jerry L.
    El Fakhri, Georges
    Woo, Jonghye
    MEDICAL IMAGING 2024: IMAGE PROCESSING, 2024, 12926
  • [35] 4D Computed Tomography Reconstruction from Few-Projection Data via Temporal Non-local Regularization
    Jia, Xun
    Lou, Yifei
    Dong, Bin
    Tian, Zhen
    Jiang, Steve
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2010, PT I, 2010, 6361 : 143 - +
  • [36] FVMD-ISRe: 3-D Reconstruction From Few-View Multidate Satellite Images Based on the Implicit Surface Representation of Neural Radiance Fields
    Zhang, Chi
    Yan, Yiming
    Zhao, Chunhui
    Su, Nan
    Zhou, Weikun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 14
  • [37] EFFICIENT SYNCHRONIZATION AND RECONSTRUCTION OF 4D NON-GATED CARDIAC IMAGES OF CHICK EMBRYOS OBTAINED FROM OPTICAL COHERENCE TOMOGRAPHY
    Liu, Aiping
    Wang, Ruikang
    Thornburg, Kent L.
    Rugonyi, Sandra
    PROCEEDINGS OF THE ASME SUMMER BIOENGINEERING CONFERENCE - 2009, PT A AND B, 2009, : 1063 - 1064
  • [38] Efficient 4D shape completion from sparse samples via cubic spline fitting in linear rotation-invariant space
    Xia, Qing
    Chen, Chengju
    Liu, Jiarui
    Li, Shuai
    Hao, Aimin
    Qin, Hong
    COMPUTERS & GRAPHICS-UK, 2019, 82 : 129 - 139
  • [39] Machine learning framework for the real-time reconstruction of regional 4D ocean temperature fields from historical reanalysis data and real-time satellite and buoy surface measurements
    Champenois, Bianca
    Sapsis, Themistoklis
    PHYSICA D-NONLINEAR PHENOMENA, 2024, 459