Lip movement synthesis from speech based on Hidden Markov Models

被引:4
|
作者
Yamamoto, E [1 ]
Nakamura, S [1 ]
Shikano, K [1 ]
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara 63001, Japan
关键词
D O I
10.1109/AFGR.1998.670941
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech intelligibility can be improved by adding lip image and facial image to speech signal. Thus the lip image synthesis plays a important role to realize a natural human-libe face of computer agents. Moreover the synthesized lip movement images can compensate lack of auditory information for hearing impaired people. We propose a novel lip movement synthesis method based on mapping from input speech based on Hidden Markov Model (HMM). This paper compares the HMM-based method and a conventional method using vector quantization (VQ). In the experiment, error and time differential error between synthesized lip movement images and original ones are used for evaluation. The result shows that the error of the HMM based method is 8.7% smaller than that of the VQ-based method. Moreover, the HMM-based method reduces time differential error by 32% than the VQ's. The result also shows that the errors are mostly caused by phoneme /h/ and /Q/. Since lip shapes of those phonemes are strongly dependent on succeeding phoneme, the contest dependent synthesis on the HMM-based method is applied to reduce the error. The improved HMM-based method realizes reduction of the error(differential error) by 10.5%;(11%) compared with the original HMM-based method.
引用
收藏
页码:154 / 159
页数:2
相关论文
共 50 条
  • [1] Lip movement synthesis from speech based on Hidden Markov Models
    Yamamoto, E
    Nakamura, S
    Shikano, K
    SPEECH COMMUNICATION, 1998, 26 (1-2) : 105 - 115
  • [2] Speech Synthesis Based on Hidden Markov Models
    Tokuda, Keiichi
    Nankaku, Yoshihiko
    Toda, Tomoki
    Zen, Heiga
    Yamagishi, Junichi
    Oura, Keiichiro
    PROCEEDINGS OF THE IEEE, 2013, 101 (05) : 1234 - 1252
  • [3] Emphasized Speech Synthesis Based on Hidden Markov Models
    Morizane, Kumiko
    Nakamura, Keigo
    Toda, Tomoki
    Saruwatari, Hiroshi
    Shikano, Kiyohiro
    ORIENTAL COCOSDA 2009 - INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS, 2009, : 76 - 81
  • [4] Trainable speech synthesis with trended Hidden Markov Models
    Dines, J
    Sridharan, S
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 833 - 836
  • [5] Speaker Adaptation for Slovak Statistical Parametric Speech Synthesis Based on Hidden Markov Models
    Sulir, Martin
    Juhar, Jozef
    2015 25TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2015, : 137 - 140
  • [6] HIDDEN MARKOV MODELS IN SPEECH RECOGNITION
    Krajcovic, J.
    Hrncar, M.
    Muzikarova, E.
    ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2008, 7 (1-2) : 250 - 252
  • [7] Lip-movement synthesis from speech based on CDHMM-SVR
    Chen, Xin
    Zhang, Qiang
    Wei, Xiaopeng
    Journal of Information and Computational Science, 2007, 4 (02): : 475 - 482
  • [8] The Application of Hidden Markov Models in Speech Recognition
    Gales, Mark
    Young, Steve
    FOUNDATIONS AND TRENDS IN SIGNAL PROCESSING, 2007, 1 (03): : 195 - 304
  • [9] Noisy Hidden Markov Models for Speech Recognition
    Audhkhasi, Kartik
    Osoba, Osonde
    Kosko, Bart
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [10] Hidden Markov models in speech and language processing
    Knill, K
    Young, S
    CORPUS-BASED METHODS IN LANGUAGE AND SPEECH PROCESSING, 1997, 2 : 27 - 68