The approach of Chinese speech triseme recognition for human mouth animation

被引:0
|
作者
Ouyang, Jianjun [1 ]
Xu, Ming [2 ]
Huang, Yunsen [2 ]
机构
[1] Shenzhen Univ, Coll Informat Engn, Shenzhen, Peoples R China
[2] Shenzhen Univ, Informat Ctr, Shenzhen, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Different from text driven and phoneme based human mouth synthesis approaches, this paper presents the novel natural speech driven mouth animation approach. To capture the context information of continuously speaking mouth shapes, the triseme based modeling technique is employed for acquiring the trisemic HMMs. To obtain the robust model parameters with the limited training data, the states tying procedure is introduced. Considering the compatibility and ambiguity issues, the visemic questions which assigned in the leaf nodes of decision tree are generated that based on the training data. With the modeled HMM parameters, the viterbi beam searching algorithm is applied to time align the trisemic sequences. Mapping the recognized trisemes to the corresponding MPEG-4 FAPs-represented mouth shapes, the speaking mouth can be finally animated through a smoothing process. In terms of the proposed evaluation criterion, the experimental results illustrate that the recognition accuracy is applicable and also the aligning speed is acceptable in human vision.
引用
收藏
页码:666 / +
页数:2
相关论文
共 50 条
  • [1] Triseme decision trees in the continuous speech recognition system for talking head animation
    Xie, L
    Zhao, RC
    Jiang, DM
    Cravyse, I
    Sahli, H
    Conlenis, J
    ACTIVE MEDIA TECHNOLOGY, 2003, : 389 - 395
  • [2] A natural Chinese speech driven mouth animation system
    Xu, Ming
    Ouyang, Jianjun
    Huang, Yunsen
    2007 SECOND INTERNATIONAL CONFERENCE IN COMMUNICATIONS AND NETWORKING IN CHINA, VOLS 1 AND 2, 2007, : 745 - +
  • [3] Triseme decision trees in the continuous speech recognition system for a talking head
    Jiang, DM
    Xie, L
    Ravyse, I
    Zhao, RC
    Sahli, H
    Cornelis, J
    2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 2097 - 2101
  • [4] An approach to speech driven animation
    Sun, Ningping
    Suigetsu, Kaori
    Ayabe, Toru
    2008 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, PROCEEDINGS, 2008, : 113 - 116
  • [5] An efficient approach to Chinese phoneme mouth-shape recognition
    Zhong, X
    Ma, SP
    Zhang, B
    IMAGE EXTRACTION, SEGMENTATION, AND RECOGNITION, 2001, 4550 : 176 - 181
  • [6] AN EXTENDED MULTIRESOLUTION APPROACH TO MOUTH SPECIFIC AAM FITTING FOR SPEECH RECOGNITION
    Berry, Craig
    Kokaram, Anil
    Harte, Naomi
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1959 - 1963
  • [7] 3D Chinese Mouth Animation with Adaptive Encoding
    Yu Haitao
    Ge Shuiying
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 3965 - 3970
  • [8] Human Speech Sentiments Recognition: A Data Mining Approach for Categorization of Speech
    Gupta, Ritika
    Aggarwal, Gaurav
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 3987 - 3991
  • [9] An articulatory approach to video-realistic mouth animation
    Xie, Lei
    Liu, Zhi-Qiang
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 593 - 596
  • [10] Influence of human factors on performance of Chinese speech recognition systems
    Chen, Shanguang
    Jiang, Qiyuan
    Yu, Tiecheng
    Hangtian Yixue Yu Yixue Gongcheng/Space Medicine and Medical Engineering, 1996, 9 (04):