Lip movement synthesis from speech based on Hidden Markov Models

被引:4
|
作者
Yamamoto, E [1 ]
Nakamura, S [1 ]
Shikano, K [1 ]
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara 63001, Japan
关键词
D O I
10.1109/AFGR.1998.670941
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech intelligibility can be improved by adding lip image and facial image to speech signal. Thus the lip image synthesis plays a important role to realize a natural human-libe face of computer agents. Moreover the synthesized lip movement images can compensate lack of auditory information for hearing impaired people. We propose a novel lip movement synthesis method based on mapping from input speech based on Hidden Markov Model (HMM). This paper compares the HMM-based method and a conventional method using vector quantization (VQ). In the experiment, error and time differential error between synthesized lip movement images and original ones are used for evaluation. The result shows that the error of the HMM based method is 8.7% smaller than that of the VQ-based method. Moreover, the HMM-based method reduces time differential error by 32% than the VQ's. The result also shows that the errors are mostly caused by phoneme /h/ and /Q/. Since lip shapes of those phonemes are strongly dependent on succeeding phoneme, the contest dependent synthesis on the HMM-based method is applied to reduce the error. The improved HMM-based method realizes reduction of the error(differential error) by 10.5%;(11%) compared with the original HMM-based method.
引用
收藏
页码:154 / 159
页数:2
相关论文
共 50 条
  • [21] Hidden Markov Models for Speech Recognition Technology Based on Classification and Identification
    Wei, Mingzhe
    Tang, Wanwei
    2ND INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY FOR EDUCATION (ICTE 2015), 2015, : 266 - 269
  • [22] Development of the hidden Markov models based Lithuanian speech recognition system
    Ringeliene, Z.
    Lipeika, A.
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2010, 2010, 7745
  • [23] Automatic speech decomposition and speech coding using MDCT-based hidden Markov chain and wavelet-based hidden Markov tree models
    Tantibundhit, C
    Boston, JR
    Li, CC
    El-Jaroudi, A
    2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, : 207 - 210
  • [24] Graphical Models for Discrete Hidden Markov Models in Speech Recognition
    Miguel, Antonio
    Ortega, Alfonso
    Buera, Luis
    Lleida, Eduardo
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1387 - 1390
  • [25] Robust Geometrical-Based Lip-Reading using Hidden Markov Models
    Ibrahim, M. Z.
    Mulvaney, D. J.
    2013 IEEE EUROCON, 2013, : 2011 - 2016
  • [26] Speech Analysis Based On Image Information from Lip Movement
    Talha, Kamil S.
    Wan, Khairunizam
    Za'ba, S. K.
    Razlan, Zuradzman Mohamad
    Shahriman, A. B.
    5TH INTERNATIONAL CONFERENCE ON MECHATRONICS (ICOM'13), 2013, 53
  • [27] Hidden-articulator Markov models for speech recognition
    Richardson, M
    Bilmes, J
    Diorio, C
    SPEECH COMMUNICATION, 2003, 41 (2-3) : 511 - 529
  • [28] Group Sparse Hidden Markov Models for Speech Recognition
    Chien, Jen-Tzung
    Chiang, Cheng-Chun
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2645 - 2648
  • [29] Large margin hidden Markov models for speech recognition
    Jiang, Hui
    Li, Xinwei
    Liu, Chaojun
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (05): : 1584 - 1595
  • [30] Automatic speech recognition using hidden Markov models
    Botros, N.M.
    Teh, C.K.
    Microcomputer Applications, 1994, 13 (01): : 6 - 12