The approach of Chinese speech triseme recognition for human mouth animation

被引:0
|
作者
Ouyang, Jianjun [1 ]
Xu, Ming [2 ]
Huang, Yunsen [2 ]
机构
[1] Shenzhen Univ, Coll Informat Engn, Shenzhen, Peoples R China
[2] Shenzhen Univ, Informat Ctr, Shenzhen, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Different from text driven and phoneme based human mouth synthesis approaches, this paper presents the novel natural speech driven mouth animation approach. To capture the context information of continuously speaking mouth shapes, the triseme based modeling technique is employed for acquiring the trisemic HMMs. To obtain the robust model parameters with the limited training data, the states tying procedure is introduced. Considering the compatibility and ambiguity issues, the visemic questions which assigned in the leaf nodes of decision tree are generated that based on the training data. With the modeled HMM parameters, the viterbi beam searching algorithm is applied to time align the trisemic sequences. Mapping the recognized trisemes to the corresponding MPEG-4 FAPs-represented mouth shapes, the speaking mouth can be finally animated through a smoothing process. In terms of the proposed evaluation criterion, the experimental results illustrate that the recognition accuracy is applicable and also the aligning speed is acceptable in human vision.
引用
收藏
页码:666 / +
页数:2
相关论文
共 50 条
  • [21] Speech driven realistic mouth animation based on multi-modal unit selection
    Jiang, Dongmei
    Ravyse, Ilse
    Sahli, Hichem
    Verhelst, Werner
    JOURNAL ON MULTIMODAL USER INTERFACES, 2008, 2 (3-4) : 157 - 169
  • [22] Speech driven realistic mouth animation based on multi-modal unit selection
    Dongmei Jiang
    Ilse Ravyse
    Hichem Sahli
    Werner Verhelst
    Journal on Multimodal User Interfaces, 2008, 2
  • [23] A Dialectal Chinese Speech Recognition Framework
    Jing Li
    Thomas Fang Zheng
    William Byrne
    Dan Jurafsky
    Journal of Computer Science and Technology, 2006, 21 : 106 - 115
  • [24] Continuous Chinese speech recognition and understanding
    Gu, JH
    Liu, JM
    Shen, XQ
    ICICS - PROCEEDINGS OF 1997 INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING, VOLS 1-3: THEME: TRENDS IN INFORMATION SYSTEMS ENGINEERING AND WIRELESS MULTIMEDIA COMMUNICATIONS, 1997, : 989 - 992
  • [25] A dialectal chinese speech recognition framework
    Li, J
    Zheng, TF
    Byrne, W
    Jurafsky, D
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2006, 21 (01) : 106 - 115
  • [26] Tone Recognition of Chinese Whispered Speech
    Gong Chenghui
    Zhao Heming
    PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 401 - +
  • [27] Research issues for Chinese speech recognition
    Du, Limin
    Hou, Ziqiang
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 1995, 23 (10): : 110 - 116
  • [28] Multi-expression facial animation based on speech emotion recognition
    Research Center far Pervasive Computing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao, 2008, 4 (520-525):
  • [29] Korean Speech recognition using phonemics for Lip-sync Animation
    Hwang, Sun-Min
    Song, Bok-Hee
    Yun, Han-Kyung
    2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, ELECTRONICS AND ELECTRICAL ENGINEERING (ISEEE), VOLS 1-3, 2014, : 1010 - +
  • [30] Speech driven facial animation using Chinese mandarin pronunciation rules
    You, MY
    Bu, JJ
    Chen, C
    Song, ML
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2004, PT 3, 2004, 3045 : 886 - 895