Accurate 3D Face Reconstruction with Facial Component Tokens

被引:6
|
作者
Zhang, Tianke [1 ,2 ]
Chu, Xuangeng [2 ]
Liu, Yunfei [2 ]
Lin, Lijian [2 ]
Yang, Zhendong [2 ]
Xu, Zhengzhuo [1 ,2 ]
Cao, Chengkun [1 ,2 ]
Yu, Fei [3 ]
Zhou, Changyin [3 ]
Yuan, Chun [1 ]
Li, Yu [2 ]
机构
[1] Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[2] IDEA, Shenzhen, Peoples R China
[3] Vistring Inc, Hong Kong, Peoples R China
基金
国家重点研发计划;
关键词
MORPHABLE MODEL;
D O I
10.1109/ICCV51070.2023.00829
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurately reconstructing 3D faces from monocular images and videos is crucial for various applications, such as digital avatar creation. However, the current deep learning-based methods face significant challenges in achieving accurate reconstruction with disentangled facial parameters and ensuring temporal stability in single-frame methods for 3D face tracking on video data. In this paper, we propose TokenFace, a transformer-based monocular 3D face reconstruction model. TokenFace uses separate tokens for different facial components to capture information about different facial parameters and employs temporal transformers to capture temporal information from video data. This design can naturally disentangle different facial components and is flexible to both 2D and 3D training data. Trained on hybrid 2D and 3D data, our model shows its power in accurately reconstructing faces from images and producing stable results for video data. Experimental results on popular benchmarks NoW and Stirling demonstrate that TokenFace achieves state-of-the-art performance, outperforming existing methods on all metrics by a large margin.
引用
收藏
页码:8999 / 9008
页数:10
相关论文
共 50 条
  • [21] Face It: 3D Facial Reconstruction from a Single 2D Image for Games and Simulations
    Kirtzic, J. Steven
    Daescu, Ovidiu
    2011 INTERNATIONAL CONFERENCE ON CYBERWORLDS, 2011, : 244 - 248
  • [22] Facial Landmarks for Forensic Skull-Based 3D Face Reconstruction: A Literature Review
    Vezzetti, Enrico
    Marcolin, Federica
    Tornincasa, Stefano
    Moos, Sandro
    Violante, Maria Grazia
    Dagnes, Nicole
    Monno, Giuseppe
    Uva, Antonio Emmanuele
    Fiorentino, Michele
    AUGMENTED REALITY, VIRTUAL REALITY, AND COMPUTER GRAPHICS, PT I, 2016, 9768 : 172 - 180
  • [23] Analysis of facial configuration from realistic 3D face reconstruction of young Korean men
    Chin, S
    Kim, S
    SYSTEMS MODELING AND SIMULATION: THEORY AND APPLICATIONS, 2005, 3398 : 685 - 693
  • [24] Effect of Facial Feature Points Selection on 3D Face Shape Reconstruction Using Regularization
    Maghari, Ashraf Y. A.
    Liao, Iman Yi
    Belaton, Bahari
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT V, 2012, 7667 : 516 - 524
  • [25] Learning Semantic Representations via Joint 3D Face Reconstruction and Facial Attribute Estimation
    Weng, Zichun
    Xiang, Youjun
    Li, Xianfeng
    Jiang, Juntao
    Huo, Wanliang
    Fu, Yuli
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9696 - 9702
  • [26] 3D Face Reconstruction: The Road to Forensics
    La Cava, Simone Maurizio
    Orru, Giulia
    Drahansky, Martin
    Marcialis, Gian Luca
    Roli, Fabio
    ACM COMPUTING SURVEYS, 2024, 56 (03)
  • [27] Automatic 3D reconstruction for face recognition
    Hu, YX
    Jiang, DL
    Yan, SC
    Zhang, L
    Zhang, HJ
    SIXTH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, PROCEEDINGS, 2004, : 843 - 848
  • [28] Disparity-based 3D face modeling using 3D deformable facial mask for 3D face recognition
    Ansari, A-Nasser
    Abdel-Mottaleb, Mohamed
    Mahoor, Mohammad H.
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 981 - +
  • [29] A Brief Survey: 3D Face Reconstruction
    Gao, Tianhan
    An, Hui
    ADVANCES ON BROAD-BAND WIRELESS COMPUTING, COMMUNICATION AND APPLICATIONS, 2020, 97 : 846 - 854
  • [30] 3D Face Reconstruction with Dense Landmarks
    Wood, Erroll
    Baltrusaitis, Tadas
    Hewitt, Charlie
    Johnson, Matthew
    Shen, Jingjing
    Milosavljevic, Nikola
    Wilde, Daniel
    Garbin, Stephan
    Sharp, Toby
    Stojiljkovic, Ivan
    Cashman, Tom
    Valentin, Julien
    COMPUTER VISION, ECCV 2022, PT XIII, 2022, 13673 : 160 - 177