The approach of Chinese speech triseme recognition for human mouth animation

被引：0

作者：

Ouyang, Jianjun ^{[1
]}

Xu, Ming ^{[2
]}

Huang, Yunsen ^{[2
]}

机构：

[1] Shenzhen Univ, Coll Informat Engn, Shenzhen, Peoples R China

[2] Shenzhen Univ, Informat Ctr, Shenzhen, Peoples R China

来源：

ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 5, PROCEEDINGS | 2007年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Different from text driven and phoneme based human mouth synthesis approaches, this paper presents the novel natural speech driven mouth animation approach. To capture the context information of continuously speaking mouth shapes, the triseme based modeling technique is employed for acquiring the trisemic HMMs. To obtain the robust model parameters with the limited training data, the states tying procedure is introduced. Considering the compatibility and ambiguity issues, the visemic questions which assigned in the leaf nodes of decision tree are generated that based on the training data. With the modeled HMM parameters, the viterbi beam searching algorithm is applied to time align the trisemic sequences. Mapping the recognized trisemes to the corresponding MPEG-4 FAPs-represented mouth shapes, the speaking mouth can be finally animated through a smoothing process. In terms of the proposed evaluation criterion, the experimental results illustrate that the recognition accuracy is applicable and also the aligning speed is acceptable in human vision.

引用

页码：666 / +

页数：2

共 50 条

[41] Study on Chinese speech recognition and understanding system
Qiu, Wei
Xu, Bingzheng
Zhong, Wenqing
Huanan Ligong Daxue Xuebao/Journal of South China University of Technology, 1996, 24 (04):
[42] Speech emotion recognition of Chinese elderly people
Wang, Kunxia
Zhu, Zongbao
Zhang, Jian
Chen, Lifei
WEB INTELLIGENCE, 2018, 16 (03) : 149 - 157
[43] Chinese-English bilingual speech recognition
Yu, SM
Hu, S
Zhang, SW
Xu, B
2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 603 - 609
[44] A study on Yunnan dialectal Chinese speech recognition
Pu, Yuan-Yuan
Yang, Jian
Wei, Hong
Xu, Dan
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 2760 - 2764
[45] Chinese Putonghua speech recognition for special education
Huang, Zhong-Wei
Liu, Ming-Hui
Xu, Ming
Feng, Shan-Shan
Gao, Jian-Wei
Shenzhen Daxue Xuebao (Ligong Ban)/Journal of Shenzhen University Science and Engineering, 2007, 24 (04): : 404 - 405
[46] Multi-Accent Chinese Speech Recognition
Liu Yi
Fung, Pascale
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 133 - +
[47] On the syllable structures of Chinese relating to speech recognition
Zhang, JL
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2450 - 2453
[48] Exploration on speech recognition strategy of spoken Chinese
Kuang, Jishun
He, Liuzao
Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 1993, 20 (02): : 33 - 39
[49] Research on Mandarin Chinese in Speech Emotion Recognition
Wang, Ziyun
Guo, Xiao
2022 5TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND NATURAL LANGUAGE PROCESSING, MLNLP 2022, 2022, : 99 - 103
[50] Chinese dialect speech recognition: a comprehensive survey
Li, Qiang
Mai, Qianyu
Wang, Mandou
Ma, Mingjuan
ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (02)

← 1 2 3 4 5 →