The approach of Chinese speech triseme recognition for human mouth animation

被引：0

作者：

Ouyang, Jianjun ^{[1
]}

Xu, Ming ^{[2
]}

Huang, Yunsen ^{[2
]}

机构：

[1] Shenzhen Univ, Coll Informat Engn, Shenzhen, Peoples R China

[2] Shenzhen Univ, Informat Ctr, Shenzhen, Peoples R China

来源：

ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 5, PROCEEDINGS | 2007年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Different from text driven and phoneme based human mouth synthesis approaches, this paper presents the novel natural speech driven mouth animation approach. To capture the context information of continuously speaking mouth shapes, the triseme based modeling technique is employed for acquiring the trisemic HMMs. To obtain the robust model parameters with the limited training data, the states tying procedure is introduced. Considering the compatibility and ambiguity issues, the visemic questions which assigned in the leaf nodes of decision tree are generated that based on the training data. With the modeled HMM parameters, the viterbi beam searching algorithm is applied to time align the trisemic sequences. Mapping the recognized trisemes to the corresponding MPEG-4 FAPs-represented mouth shapes, the speaking mouth can be finally animated through a smoothing process. In terms of the proposed evaluation criterion, the experimental results illustrate that the recognition accuracy is applicable and also the aligning speed is acceptable in human vision.

引用

页码：666 / +

页数：2

共 50 条

[1] Triseme decision trees in the continuous speech recognition system for talking head animation
Xie, L
Zhao, RC
Jiang, DM
Cravyse, I
Sahli, H
Conlenis, J
ACTIVE MEDIA TECHNOLOGY, 2003, : 389 - 395
[2] A natural Chinese speech driven mouth animation system
Xu, Ming
Ouyang, Jianjun
Huang, Yunsen
2007 SECOND INTERNATIONAL CONFERENCE IN COMMUNICATIONS AND NETWORKING IN CHINA, VOLS 1 AND 2, 2007, : 745 - +
[3] Triseme decision trees in the continuous speech recognition system for a talking head
Jiang, DM
Xie, L
Ravyse, I
Zhao, RC
Sahli, H
Cornelis, J
2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 2097 - 2101
[4] An approach to speech driven animation
Sun, Ningping
Suigetsu, Kaori
Ayabe, Toru
2008 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, PROCEEDINGS, 2008, : 113 - 116
[5] An efficient approach to Chinese phoneme mouth-shape recognition
Zhong, X
Ma, SP
Zhang, B
IMAGE EXTRACTION, SEGMENTATION, AND RECOGNITION, 2001, 4550 : 176 - 181
[6] AN EXTENDED MULTIRESOLUTION APPROACH TO MOUTH SPECIFIC AAM FITTING FOR SPEECH RECOGNITION
Berry, Craig
Kokaram, Anil
Harte, Naomi
19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1959 - 1963
[7] 3D Chinese Mouth Animation with Adaptive Encoding
Yu Haitao
Ge Shuiying
26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 3965 - 3970
[8] Human Speech Sentiments Recognition: A Data Mining Approach for Categorization of Speech
Gupta, Ritika
Aggarwal, Gaurav
PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 3987 - 3991
[9] An articulatory approach to video-realistic mouth animation
Xie, Lei
Liu, Zhi-Qiang
2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 593 - 596
[10] Influence of human factors on performance of Chinese speech recognition systems
Chen, Shanguang
Jiang, Qiyuan
Yu, Tiecheng
Hangtian Yixue Yu Yixue Gongcheng/Space Medicine and Medical Engineering, 1996, 9 (04):

← 1 2 3 4 5 →