Accurate 3D Face Reconstruction with Facial Component Tokens

被引：6

作者：

Zhang, Tianke ^{[1
,2
]}

Chu, Xuangeng ^{[2
]}

Liu, Yunfei ^{[2
]}

Lin, Lijian ^{[2
]}

Yang, Zhendong ^{[2
]}

Xu, Zhengzhuo ^{[1
,2
]}

Cao, Chengkun ^{[1
,2
]}

Yu, Fei ^{[3
]}

Zhou, Changyin ^{[3
]}

Yuan, Chun ^{[1
]}

Li, Yu ^{[2
]}

机构：

[1] Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China

[2] IDEA, Shenzhen, Peoples R China

[3] Vistring Inc, Hong Kong, Peoples R China

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年

基金：

国家重点研发计划;

关键词：

MORPHABLE MODEL;

D O I：

10.1109/ICCV51070.2023.00829

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Accurately reconstructing 3D faces from monocular images and videos is crucial for various applications, such as digital avatar creation. However, the current deep learning-based methods face significant challenges in achieving accurate reconstruction with disentangled facial parameters and ensuring temporal stability in single-frame methods for 3D face tracking on video data. In this paper, we propose TokenFace, a transformer-based monocular 3D face reconstruction model. TokenFace uses separate tokens for different facial components to capture information about different facial parameters and employs temporal transformers to capture temporal information from video data. This design can naturally disentangle different facial components and is flexible to both 2D and 3D training data. Trained on hybrid 2D and 3D data, our model shows its power in accurately reconstructing faces from images and producing stable results for video data. Experimental results on popular benchmarks NoW and Stirling demonstrate that TokenFace achieves state-of-the-art performance, outperforming existing methods on all metrics by a large margin.

引用

页码：8999 / 9008

页数：10

共 50 条

[21] Face It: 3D Facial Reconstruction from a Single 2D Image for Games and Simulations
Kirtzic, J. Steven
Daescu, Ovidiu
2011 INTERNATIONAL CONFERENCE ON CYBERWORLDS, 2011, : 244 - 248
[22] Facial Landmarks for Forensic Skull-Based 3D Face Reconstruction: A Literature Review
Vezzetti, Enrico
Marcolin, Federica
Tornincasa, Stefano
Moos, Sandro
Violante, Maria Grazia
Dagnes, Nicole
Monno, Giuseppe
Uva, Antonio Emmanuele
Fiorentino, Michele
AUGMENTED REALITY, VIRTUAL REALITY, AND COMPUTER GRAPHICS, PT I, 2016, 9768 : 172 - 180
[23] Analysis of facial configuration from realistic 3D face reconstruction of young Korean men
Chin, S
Kim, S
SYSTEMS MODELING AND SIMULATION: THEORY AND APPLICATIONS, 2005, 3398 : 685 - 693
[24] Effect of Facial Feature Points Selection on 3D Face Shape Reconstruction Using Regularization
Maghari, Ashraf Y. A.
Liao, Iman Yi
Belaton, Bahari
NEURAL INFORMATION PROCESSING, ICONIP 2012, PT V, 2012, 7667 : 516 - 524
[25] Learning Semantic Representations via Joint 3D Face Reconstruction and Facial Attribute Estimation
Weng, Zichun
Xiang, Youjun
Li, Xianfeng
Jiang, Juntao
Huo, Wanliang
Fu, Yuli
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9696 - 9702
[26] 3D Face Reconstruction: The Road to Forensics
La Cava, Simone Maurizio
Orru, Giulia
Drahansky, Martin
Marcialis, Gian Luca
Roli, Fabio
ACM COMPUTING SURVEYS, 2024, 56 (03)
[27] Automatic 3D reconstruction for face recognition
Hu, YX
Jiang, DL
Yan, SC
Zhang, L
Zhang, HJ
SIXTH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, PROCEEDINGS, 2004, : 843 - 848
[28] Disparity-based 3D face modeling using 3D deformable facial mask for 3D face recognition
Ansari, A-Nasser
Abdel-Mottaleb, Mohamed
Mahoor, Mohammad H.
2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 981 - +
[29] A Brief Survey: 3D Face Reconstruction
Gao, Tianhan
An, Hui
ADVANCES ON BROAD-BAND WIRELESS COMPUTING, COMMUNICATION AND APPLICATIONS, 2020, 97 : 846 - 854
[30] 3D Face Reconstruction with Dense Landmarks
Wood, Erroll
Baltrusaitis, Tadas
Hewitt, Charlie
Johnson, Matthew
Shen, Jingjing
Milosavljevic, Nikola
Wilde, Daniel
Garbin, Stephan
Sharp, Toby
Stojiljkovic, Ivan
Cashman, Tom
Valentin, Julien
COMPUTER VISION, ECCV 2022, PT XIII, 2022, 13673 : 160 - 177

← 1 2 3 4 5 →