The approach of Chinese speech triseme recognition for human mouth animation

被引：0

作者：

Ouyang, Jianjun ^{[1
]}

Xu, Ming ^{[2
]}

Huang, Yunsen ^{[2
]}

机构：

[1] Shenzhen Univ, Coll Informat Engn, Shenzhen, Peoples R China

[2] Shenzhen Univ, Informat Ctr, Shenzhen, Peoples R China

来源：

ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 5, PROCEEDINGS | 2007年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Different from text driven and phoneme based human mouth synthesis approaches, this paper presents the novel natural speech driven mouth animation approach. To capture the context information of continuously speaking mouth shapes, the triseme based modeling technique is employed for acquiring the trisemic HMMs. To obtain the robust model parameters with the limited training data, the states tying procedure is introduced. Considering the compatibility and ambiguity issues, the visemic questions which assigned in the leaf nodes of decision tree are generated that based on the training data. With the modeled HMM parameters, the viterbi beam searching algorithm is applied to time align the trisemic sequences. Mapping the recognized trisemes to the corresponding MPEG-4 FAPs-represented mouth shapes, the speaking mouth can be finally animated through a smoothing process. In terms of the proposed evaluation criterion, the experimental results illustrate that the recognition accuracy is applicable and also the aligning speed is acceptable in human vision.

引用

页码：666 / +

页数：2

共 50 条

[21] Speech driven realistic mouth animation based on multi-modal unit selection
Jiang, Dongmei
Ravyse, Ilse
Sahli, Hichem
Verhelst, Werner
JOURNAL ON MULTIMODAL USER INTERFACES, 2008, 2 (3-4) : 157 - 169
[22] Speech driven realistic mouth animation based on multi-modal unit selection
Dongmei Jiang
Ilse Ravyse
Hichem Sahli
Werner Verhelst
Journal on Multimodal User Interfaces, 2008, 2
[23] A Dialectal Chinese Speech Recognition Framework
Jing Li
Thomas Fang Zheng
William Byrne
Dan Jurafsky
Journal of Computer Science and Technology, 2006, 21 : 106 - 115
[24] Continuous Chinese speech recognition and understanding
Gu, JH
Liu, JM
Shen, XQ
ICICS - PROCEEDINGS OF 1997 INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING, VOLS 1-3: THEME: TRENDS IN INFORMATION SYSTEMS ENGINEERING AND WIRELESS MULTIMEDIA COMMUNICATIONS, 1997, : 989 - 992
[25] A dialectal chinese speech recognition framework
Li, J
Zheng, TF
Byrne, W
Jurafsky, D
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2006, 21 (01) : 106 - 115
[26] Tone Recognition of Chinese Whispered Speech
Gong Chenghui
Zhao Heming
PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 401 - +
[27] Research issues for Chinese speech recognition
Du, Limin
Hou, Ziqiang
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 1995, 23 (10): : 110 - 116
[28] Multi-expression facial animation based on speech emotion recognition
Research Center far Pervasive Computing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao, 2008, 4 (520-525):
[29] Korean Speech recognition using phonemics for Lip-sync Animation
Hwang, Sun-Min
Song, Bok-Hee
Yun, Han-Kyung
2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, ELECTRONICS AND ELECTRICAL ENGINEERING (ISEEE), VOLS 1-3, 2014, : 1010 - +
[30] Speech driven facial animation using Chinese mandarin pronunciation rules
You, MY
Bu, JJ
Chen, C
Song, ML
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2004, PT 3, 2004, 3045 : 886 - 895

← 1 2 3 4 5 →