Rigid head motion in expressive speech animation: Analysis and synthesis

被引:104
|
作者
Busso, Carlos [1 ]
Deng, Zhigang [1 ]
Grimm, Michael [1 ]
Neumann, Ulrich [1 ]
Narayanan, Shrikanth [1 ]
机构
[1] Univ So Calif, Viterbi Sch Engn, Integrated Media Syst Ctr, Los Angeles, CA 90089 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/TASL.2006.885910
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Rigid head motion is a gesture that conveys important nonverbal information in human communication, and hence it needs to be appropriately modeled and included in realistic facial animations to effectively mimic human behaviors. In this paper, head motion sequences in expressive facial animations are analyzed in terms of their naturalness and emotional salience in perception. Statistical measures are derived from an audiovisual database, comprising synchronized facial gestures and speech, which revealed characteristic patterns in emotional head motion sequences. Head motion patterns with neutral speech significantly differ from head motion patterns with emotional speech in motion activation, range, and velocity. The results show that head motion provides discriminating information about emotional categories. An approach to synthesize emotional head motion sequences driven by prosodic features is presented, expanding upon our previous framework on head motion synthesis. This method naturally models the specific temporal dynamics of emotional head motion sequences by building hidden Markov models for each emotional category (sadness, happiness, anger, and neutral state). Human raters were asked to assess the naturalness and the emotional content of the facial animations. On average, the synthesized head motion sequences were perceived even more natural than the original head motion sequences. The results also show that head motion modifies the emotional perception of the facial animation especially in the valence and activation domain. These results suggest that appropriate. head motion not only significantly improves the naturalness of the animation but can also be used to enhance the emotional content of the animation to effectively engage the users.
引用
收藏
页码:1075 / 1086
页数:12
相关论文
共 50 条
  • [1] Expressive Speech Animation Synthesis with Phoneme-Level Controls
    Deng, Z.
    Neumann, U.
    COMPUTER GRAPHICS FORUM, 2008, 27 (08) : 2096 - 2113
  • [2] Linking facial animation, head motion and speech acoustics
    Yehia, HC
    Kuratate, T
    Vatikiotis-Bateson, E
    JOURNAL OF PHONETICS, 2002, 30 (03) : 555 - 568
  • [3] Towards Expressive Speech Synthesis: Analysis and Modeling of Expressive Speech
    Raptis, Spyros
    Karabetsos, Sotiris
    Chalamandaris, Aimilios
    Tsiakoulis, Pirros
    2014 5th IEEE Conference on Cognitive Infocommunications (CogInfoCom), 2014, : 461 - 465
  • [4] Mood swings: Expressive speech animation
    Chuang, E
    Bregler, C
    ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (02): : 331 - 347
  • [5] Principal components of expressive speech animation
    Kshirsagar, S
    Molet, T
    Magnenat-Thalmann, N
    COMPUTER GRAPHICS INTERNATIONAL 2001, PROCEEDINGS, 2001, : 38 - 44
  • [6] Expressive facial animation synthesis by learning speech coarticulation and expression spaces
    Deng, Zhigang
    Neumann, Ulrich
    Lewis, J. P.
    Kim, Tae-Yong
    Bulut, Murtaza
    Narayanan, Shrikanth
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2006, 12 (06) : 1523 - 1534
  • [7] Expressive facial animation synthesis by learning speech coarticulation and expression spaces
    IEEE Computer Society
    不详
    不详
    不详
    不详
    不详
    不详
    IEEE Trans Visual Comput Graphics, 2006, 6 (1523-1534):
  • [8] Expressive speech-driven facial animation
    Cao, Y
    Tien, WC
    Faloutsos, P
    Pighin, F
    ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (04): : 1283 - 1302
  • [9] Dynamic, Expressive Speech Animation From a Single Mesh
    Wampler, Kevin
    Sasaki, Daichi
    Zhang, Li
    Popovic, Zoran
    SYMPOSIUM ON COMPUTER ANIMATION 2007: ACM SIGGRAPH/ EUROGRAPHICS SYMPOSIUM PROCEEDINGS, 2007, : 53 - 62
  • [10] Expressive talking avatar synthesis and animation
    Xie, Lei
    Jia, Jia
    Meng, Helen
    Deng, Zhigang
    Wang, Lijuan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (22) : 9845 - 9848