Sample-based synthesis of photo-realistic talking heads

被引：47

作者：

Cosatto, E ^{[1
]}

Graf, HP ^{[1
]}

机构：

[1] AT&T Bell Labs, Res, Red Bank, NJ 07701 USA

来源：

COMPUTER ANIMATION 98 - PROCEEDINGS | 1998年

关键词：

talking-head synthesis; sample-based synthesis; photo-realistic rendering; face recognition and location; sample-based coarticulation;

D O I：

10.1109/CA.1998.681914

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes a system that generates photo-realistic video animations of talking heads. First the system derives head models from existing video footage using image recognition techniques. It locates, extracts and labels facial parts such as mouth, eyes, and eyebrows into a compact library. Then, using these face models and a text-to-speech synthesizer, it synthesizes new video sequences of the head where the lips are in synchrony with the accompanying soundtrack. Emotional cues and conversational signals are produced by combining head movements, raising eyebrows, wide open eyes, etc. with the mouth animation. For these animations to be believable, care has to be taken aligning the facial parts so that they blend smoothly into each other and produce seamless animations. Our system uses precise multi-channel facial recognition techniques to track facial parts, and it derives the exact 3D position of the head, enabling the automatic extraction of normalized face parts. Such talking-head animations are useful because they generally increase intelligibility of the human-machine interface in applications where content needs to be narrated to the user, such as educative software.

引用

页码：103 / 110

页数：8

共 50 条

[11] A New Language Independent, Photo-realistic Talking Head Driven by Voice Only
Zhang, Xinjian
Wang, Lijuan
Li, Gang
Seide, Frank
Soong, Frank K.
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2742 - 2746
[12] Photo-realistic Text-driven Malay talking head with multiple expression
Tan, Tian-Swee
Salleh, Sh-Hussain
Chew, Kim-Mey
Lim, Sheau-Chyi
2008 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING, VOLS 1-3, 2008, : 711 - 715
[13] 3D Photo-realistic talking head for human-robot interaction
Simplicio, Carlos
Faria, Diego R.
Dias, Jorge
VIRTUAL AND RAPID MANUFACTURING: ADVANCED RESEARCH IN VIRTUAL AND RAPID PROTOTYPING, 2008, : 677 - +
[14] Image-based rendering for photo-realistic visualization
Verbiest, F.
Willems, G.
van Gool, L.
VIRTUAL AND PHYSICAL PROTOTYPING, 2006, 1 (01) : 19 - 30
[15] Photo-Realistic Exemplar-Based Face Ageing
Schneider, Andreas
Bouabene, Ghazi
Shaiek, Ayet
Schonborn, Sandro
Flament, Ferderic
Francois, Ghislain
Rubert, Virginie
Vetter, Thomas
2019 14TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2019), 2019, : 199 - 206
[16] Photo-Realistic Facial Details Synthesis From Single Image
Chen, Anpei
Chen, Zhang
Zhang, Guli
Mitchell, Kenny
Yu, Jingyi
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9428 - 9438
[17] Open issues in photo-realistic rendering
Purgathofer, W
COMPUTER GRAPHICS FORUM, 2003, 22 (03) : XIX - XIX
[18] Photo-realistic depth-of-field effects synthesis based on real camera parameters
Lin, Huei-Yung
Gu, Kai-Da
ADVANCES IN VISUAL COMPUTING, PT I, 2007, 4841 : 298 - 309
[19] Photo-realistic Neural Domain Randomization
Zakharov, Sergey
Ambrus, Rares
Guizilini, Vitor
Kehl, Wadim
Gaidon, Adrien
COMPUTER VISION, ECCV 2022, PT XXV, 2022, 13685 : 310 - 327
[20] Photo-realistic Facial Texture Transfer
Kaur, Parneet
Zhang, Hang
Dana, Kristin J.
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 2097 - 2105

← 1 2 3 4 5 →