4D Facial Avatar Reconstruction From Monocular Video via Efficient and Controllable Neural Radiance Fields

被引:1
|
作者
Kwak, Jeong-Gi [1 ]
Ko, Hanseok [1 ]
机构
[1] Korea Univ, Sch Elect Engn, Seoul 02841, South Korea
关键词
Neural radiance field (NeRF); monocular facial avatar reconstruction; face reenactment;
D O I
10.1109/ACCESS.2024.3355052
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present an efficient approach for monocular 4D facial avatar reconstruction using a dynamic neural radiance field (NeRF). Over the years, NeRFs have been popular methods for 3D scene representation, but lack computational efficiency and controllabilty, thus it is impractical for real world application such as AR/VR, teleconferencing, and immersive experiences. Recent the introduction of grid-based encoding by InstantNGP has enabled the rendering process of NeRF much faster, but it is limited to static 3D scenes. To address the issues, we focus on developing a novel dynamic NeRF that allows explicit control over pose and facial expression, while keeping the computational efficiency. By leveraging a low-dimensional basis from the morphable model (3DMM) with elaborately designed spatial encoding branch and ambient encoding branch, we condition a dynamic radiance field in an ambient space, improving controllability and visual quality. Our model achieves rendering speeds approximately 30x faster at training and 100x faster at inference than the baseline (NeRFace), enabling practical approaches for real world applications. Through qualitative and quantitative experiments, we demonstrate the effectiveness of our approach. The dynamic NeRF exhibits superior controllability, enhanced 3D consistency, and improved visual quality. Our efficient model opens new possibilities for real-time applications, revolutionizing AR/VR and teleconferencing experiences.
引用
收藏
页码:15675 / 15683
页数:9
相关论文
共 39 条
  • [21] OptiViewNeRF: Optimizing 3D reconstruction via batch view selection and scene uncertainty in Neural Radiance Fields
    Li, You
    Li, Rui
    Li, Ziwei
    Guo, Renzhong
    Tang, Shengjun
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2025, 136
  • [22] In situ volume measurement of dairy cattle via neural radiance fields-based 3D reconstruction
    Jing, Xueyao
    Wu, Tingting
    Shen, Peng
    Chen, Zhiqian
    Jia, Hanyue
    Song, Huaibo
    BIOSYSTEMS ENGINEERING, 2025, 250 : 105 - 116
  • [23] Tensor4D: Efficient Neural 4D Decomposition for High-fidelity Dynamic Reconstruction and Rendering
    Shao, Ruizhi
    Zheng, Zerong
    Tu, Hanzhang
    Liu, Boning
    Zhang, Hongwen
    Liu, Yebin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 16632 - 16642
  • [24] Leveraging Neural Radiance Fields for Large-Scale 3D Reconstruction from Aerial Imagery
    Hermann, Max
    Kwak, Hyovin
    Ruf, Boitumelo
    Weinmann, Martin
    REMOTE SENSING, 2024, 16 (24)
  • [25] The Potential of Neural Radiance Fields and 3D Gaussian Splatting for 3D Reconstruction from Aerial Imagery
    Haitz, Dennis
    Hermann, Max
    Roth, Aglaja Solana
    Weinmann, Michael
    Weinmann, Martin
    ISPRS ANNALS OF THE PHOTOGRAMMETRY, REMOTE SENSING AND SPATIAL INFORMATION SCIENCES: VOLUME X-2-2024, 2024, : 97 - 104
  • [26] Combining Dense Nonrigid Structure from Motion and 3D Morphable Models for Monocular 4D Face Reconstruction
    Koujan, Mohammad Rami
    Roussos, Anastasios
    PROCEEDINGS CVMP 2018: THE 15TH ACM SIGGRAPH EUROPEAN CONFERENCE ON VISUAL MEDIA PRODUCTION, 2018,
  • [27] 3D facial expression reconstruction from video via SFM and dynamic texture mapping
    Zhang J.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2010, 22 (06): : 949 - 958
  • [28] 4D Human Body Capture from Egocentric Video via 3D Scene Grounding
    Liu, Miao
    Yang, Dexin
    Zhang, Yan
    Cui, Zhaopeng
    Rehg, James M.
    Tang, Siyu
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 930 - 939
  • [29] Comparison of 3D Reconstruction between Neural Radiance Fields and Structure-from-Motion-Based Photogrammetry from 360° Videos
    Gupta, Mohit
    Borrmann, Andre
    Czerniawski, Thomas
    COMPUTING IN CIVIL ENGINEERING 2023-DATA, SENSING, AND ANALYTICS, 2024, : 429 - 436
  • [30] Relit-NeuLF: Efficient Relighting and Novel View Synthesis via Neural 4D Light Field
    Li, Zhong
    Song, Liangchen
    Chen, Zhang
    Du, Xiangyu
    Chen, Lele
    Yuan, Junsong
    Xu, Yi
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7007 - 7016