Real-time speech-driven face animation with expressions using neural networks

被引:70
|
作者
Hong, PY [1 ]
Wen, Z [1 ]
Huang, TS [1 ]
机构
[1] Univ Illinois, Beckman Inst Adv Sci & Technol, Urbana, IL 61801 USA
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2002年 / 13卷 / 04期
基金
美国国家科学基金会;
关键词
facial deformation modeling; facial motion analysis and synthesis; neural networks; real-time speech-driven; talking face with expressions;
D O I
10.1109/TNN.2002.1021892
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A real-time speech-driven synthetic talking face provides an effective multimodal communication interface in distributed collaboration environments. Nonverbal gestures such as facial expressions are important to human communication and should be considered by speech-driven face animation systems. In this paper, we present a framework that systematically addresses facial deformation modeling, automatic facial motion analysis, and real-time speech-driven face animation with expression using neural networks. Based on this framework, we learn a quantitative visual representation of the facial deformations, called the motion units (MUs). An facial deformation can be approximated by a linear combination of the MUs weighted by MU parameters (MVPs). We develop an MU-based facial motion tracking algorithm which is used to collect an audio-visual training database. Then, we construct a real-time audio-to-MUP mapping by training a set of neural networks using the collected audio-visual training database. The quantitative evaluation of the mapping shows the effectiveness of the proposed approach. Using the proposed method, we develop the functionality of real-time speech-driven face animation with expressions for the iFACE system. Experimental results show that the synthetic expressive talking face of the iFACE system is comparable with a real face in terms of the effectiveness of their influences on bimodal human emotion perception.
引用
收藏
页码:916 / 927
页数:12
相关论文
共 50 条
  • [41] Speech-Driven Facial Animation Using a Shared Gaussian Process Latent Variable Model
    Deena, Salil
    Galata, Aphrodite
    ADVANCES IN VISUAL COMPUTING, PT 1, PROCEEDINGS, 2009, 5875 : 89 - 100
  • [42] Real-time compact optoelectronics neural networks for face recognition
    Javidi, B
    Li, J
    PHOTONIC COMPONENT ENGINEERING AND APPLICATIONS, 1996, 2749 : 195 - 206
  • [43] REAL-TIME AVATAR ANIMATION WITH DYNAMIC FACE TEXTURING
    Fechteler, Philipp
    Paier, Wolfgang
    Hilsmann, Anna
    Eisert, Peter
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 355 - 359
  • [44] Real-Time Facial Animation Generation on Face Mask
    Han, Bin
    Kim, Gerard Jounghyun
    Hwang, Jae-In
    SIGGRAPH ASIA 2022 POSTERS, SA 2022, 2022,
  • [45] The Effect of Real-Time Constraints on Automatic Speech Animation
    Websdale, Danny
    Taylor, Sarah
    Milner, Ben
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2479 - 2483
  • [46] Deepmask: Face masking system using deep neural networks on real-time streaming
    Son S.-B.
    Jung J.-U.
    Oh H.-S.
    Jung Y.-C.
    Journal of Institute of Control, Robotics and Systems, 2020, 26 (06): : 423 - 428
  • [47] Real-Time Surveillance Through Face Recognition Using HOG and Feedforward Neural Networks
    Awais, Muhammad
    Iqbal, Muhammad Javed
    Ahmad, Iftikhar
    Alassafi, Madini O.
    Alghamdi, Rayed
    Basheri, Mohammad
    Waqas, Muhammad
    IEEE ACCESS, 2019, 7 : 121236 - 121244
  • [48] Real-time face detection using circular sliding of the Gabor energy and neural networks
    Reza Mohammadian Fini
    Mahmoud Mahlouji
    Ali Shahidinejad
    Signal, Image and Video Processing, 2022, 16 : 1081 - 1089
  • [49] Real-time face detection using circular sliding of the Gabor energy and neural networks
    Fini, Reza Mohammadian
    Mahlouji, Mahmoud
    Shahidinejad, Ali
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (04) : 1081 - 1089
  • [50] Speech-Driven 3D Facial Animation with Mesh Convolution
    Ji, Xuejie
    Su, Zewei
    Dong, Lanfang
    Li, Guoming
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 14 - 18