Sample-based synthesis of photo-realistic talking heads

被引:47
|
作者
Cosatto, E [1 ]
Graf, HP [1 ]
机构
[1] AT&T Bell Labs, Res, Red Bank, NJ 07701 USA
关键词
talking-head synthesis; sample-based synthesis; photo-realistic rendering; face recognition and location; sample-based coarticulation;
D O I
10.1109/CA.1998.681914
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a system that generates photo-realistic video animations of talking heads. First the system derives head models from existing video footage using image recognition techniques. It locates, extracts and labels facial parts such as mouth, eyes, and eyebrows into a compact library. Then, using these face models and a text-to-speech synthesizer, it synthesizes new video sequences of the head where the lips are in synchrony with the accompanying soundtrack. Emotional cues and conversational signals are produced by combining head movements, raising eyebrows, wide open eyes, etc. with the mouth animation. For these animations to be believable, care has to be taken aligning the facial parts so that they blend smoothly into each other and produce seamless animations. Our system uses precise multi-channel facial recognition techniques to track facial parts, and it derives the exact 3D position of the head, enabling the automatic extraction of normalized face parts. Such talking-head animations are useful because they generally increase intelligibility of the human-machine interface in applications where content needs to be narrated to the user, such as educative software.
引用
收藏
页码:103 / 110
页数:8
相关论文
共 50 条
  • [11] A New Language Independent, Photo-realistic Talking Head Driven by Voice Only
    Zhang, Xinjian
    Wang, Lijuan
    Li, Gang
    Seide, Frank
    Soong, Frank K.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2742 - 2746
  • [12] Photo-realistic Text-driven Malay talking head with multiple expression
    Tan, Tian-Swee
    Salleh, Sh-Hussain
    Chew, Kim-Mey
    Lim, Sheau-Chyi
    2008 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING, VOLS 1-3, 2008, : 711 - 715
  • [13] 3D Photo-realistic talking head for human-robot interaction
    Simplicio, Carlos
    Faria, Diego R.
    Dias, Jorge
    VIRTUAL AND RAPID MANUFACTURING: ADVANCED RESEARCH IN VIRTUAL AND RAPID PROTOTYPING, 2008, : 677 - +
  • [14] Image-based rendering for photo-realistic visualization
    Verbiest, F.
    Willems, G.
    van Gool, L.
    VIRTUAL AND PHYSICAL PROTOTYPING, 2006, 1 (01) : 19 - 30
  • [15] Photo-Realistic Exemplar-Based Face Ageing
    Schneider, Andreas
    Bouabene, Ghazi
    Shaiek, Ayet
    Schonborn, Sandro
    Flament, Ferderic
    Francois, Ghislain
    Rubert, Virginie
    Vetter, Thomas
    2019 14TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2019), 2019, : 199 - 206
  • [16] Photo-Realistic Facial Details Synthesis From Single Image
    Chen, Anpei
    Chen, Zhang
    Zhang, Guli
    Mitchell, Kenny
    Yu, Jingyi
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9428 - 9438
  • [17] Open issues in photo-realistic rendering
    Purgathofer, W
    COMPUTER GRAPHICS FORUM, 2003, 22 (03) : XIX - XIX
  • [18] Photo-realistic depth-of-field effects synthesis based on real camera parameters
    Lin, Huei-Yung
    Gu, Kai-Da
    ADVANCES IN VISUAL COMPUTING, PT I, 2007, 4841 : 298 - 309
  • [19] Photo-realistic Neural Domain Randomization
    Zakharov, Sergey
    Ambrus, Rares
    Guizilini, Vitor
    Kehl, Wadim
    Gaidon, Adrien
    COMPUTER VISION, ECCV 2022, PT XXV, 2022, 13685 : 310 - 327
  • [20] Photo-realistic Facial Texture Transfer
    Kaur, Parneet
    Zhang, Hang
    Dana, Kristin J.
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 2097 - 2105