Lip movement synthesis from speech based on Hidden Markov Models

被引：4

作者：

Yamamoto, E ^{[1
]}

Nakamura, S ^{[1
]}

Shikano, K ^{[1
]}

机构：

[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara 63001, Japan

来源：

AUTOMATIC FACE AND GESTURE RECOGNITION - THIRD IEEE INTERNATIONAL CONFERENCE PROCEEDINGS | 1998年

关键词：

D O I：

10.1109/AFGR.1998.670941

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Speech intelligibility can be improved by adding lip image and facial image to speech signal. Thus the lip image synthesis plays a important role to realize a natural human-libe face of computer agents. Moreover the synthesized lip movement images can compensate lack of auditory information for hearing impaired people. We propose a novel lip movement synthesis method based on mapping from input speech based on Hidden Markov Model (HMM). This paper compares the HMM-based method and a conventional method using vector quantization (VQ). In the experiment, error and time differential error between synthesized lip movement images and original ones are used for evaluation. The result shows that the error of the HMM based method is 8.7% smaller than that of the VQ-based method. Moreover, the HMM-based method reduces time differential error by 32% than the VQ's. The result also shows that the errors are mostly caused by phoneme /h/ and /Q/. Since lip shapes of those phonemes are strongly dependent on succeeding phoneme, the contest dependent synthesis on the HMM-based method is applied to reduce the error. The improved HMM-based method realizes reduction of the error(differential error) by 10.5%;(11%) compared with the original HMM-based method.

引用

页码：154 / 159

页数：2

共 50 条

[21] Hidden Markov Models for Speech Recognition Technology Based on Classification and Identification
Wei, Mingzhe
Tang, Wanwei
2ND INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY FOR EDUCATION (ICTE 2015), 2015, : 266 - 269
[22] Development of the hidden Markov models based Lithuanian speech recognition system
Ringeliene, Z.
Lipeika, A.
PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2010, 2010, 7745
[23] Automatic speech decomposition and speech coding using MDCT-based hidden Markov chain and wavelet-based hidden Markov tree models
Tantibundhit, C
Boston, JR
Li, CC
El-Jaroudi, A
2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, : 207 - 210
[24] Graphical Models for Discrete Hidden Markov Models in Speech Recognition
Miguel, Antonio
Ortega, Alfonso
Buera, Luis
Lleida, Eduardo
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1387 - 1390
[25] Robust Geometrical-Based Lip-Reading using Hidden Markov Models
Ibrahim, M. Z.
Mulvaney, D. J.
2013 IEEE EUROCON, 2013, : 2011 - 2016
[26] Speech Analysis Based On Image Information from Lip Movement
Talha, Kamil S.
Wan, Khairunizam
Za'ba, S. K.
Razlan, Zuradzman Mohamad
Shahriman, A. B.
5TH INTERNATIONAL CONFERENCE ON MECHATRONICS (ICOM'13), 2013, 53
[27] Hidden-articulator Markov models for speech recognition
Richardson, M
Bilmes, J
Diorio, C
SPEECH COMMUNICATION, 2003, 41 (2-3) : 511 - 529
[28] Group Sparse Hidden Markov Models for Speech Recognition
Chien, Jen-Tzung
Chiang, Cheng-Chun
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2645 - 2648
[29] Large margin hidden Markov models for speech recognition
Jiang, Hui
Li, Xinwei
Liu, Chaojun
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (05): : 1584 - 1595
[30] Automatic speech recognition using hidden Markov models
Botros, N.M.
Teh, C.K.
Microcomputer Applications, 1994, 13 (01): : 6 - 12

← 1 2 3 4 5 →