Lip movement synthesis from speech based on Hidden Markov Models

被引:4
|
作者
Yamamoto, E [1 ]
Nakamura, S [1 ]
Shikano, K [1 ]
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara 63001, Japan
关键词
D O I
10.1109/AFGR.1998.670941
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech intelligibility can be improved by adding lip image and facial image to speech signal. Thus the lip image synthesis plays a important role to realize a natural human-libe face of computer agents. Moreover the synthesized lip movement images can compensate lack of auditory information for hearing impaired people. We propose a novel lip movement synthesis method based on mapping from input speech based on Hidden Markov Model (HMM). This paper compares the HMM-based method and a conventional method using vector quantization (VQ). In the experiment, error and time differential error between synthesized lip movement images and original ones are used for evaluation. The result shows that the error of the HMM based method is 8.7% smaller than that of the VQ-based method. Moreover, the HMM-based method reduces time differential error by 32% than the VQ's. The result also shows that the errors are mostly caused by phoneme /h/ and /Q/. Since lip shapes of those phonemes are strongly dependent on succeeding phoneme, the contest dependent synthesis on the HMM-based method is applied to reduce the error. The improved HMM-based method realizes reduction of the error(differential error) by 10.5%;(11%) compared with the original HMM-based method.
引用
收藏
页码:154 / 159
页数:2
相关论文
共 50 条
  • [31] Fuzzy hidden Markov models for speech and speaker recognition
    Tran, D
    Wagner, M
    18TH INTERNATIONAL CONFERENCE OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY - NAFIPS, 1999, : 426 - 430
  • [32] Factor analysed hidden Markov models for speech recognition
    Rosti, AVI
    Gales, MJF
    COMPUTER SPEECH AND LANGUAGE, 2004, 18 (02): : 181 - 200
  • [33] BAYESIAN SENSING HIDDEN MARKOV MODELS FOR SPEECH RECOGNITION
    Saon, George
    Chien, Jen-Tzung
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5056 - 5059
  • [34] Speech emotion recognition using hidden Markov models
    Nwe, TL
    Foo, SW
    De Silva, LC
    SPEECH COMMUNICATION, 2003, 41 (04) : 603 - 623
  • [35] Speech animation using coupled hidden Markov models
    Xie, Lei
    Liu, Zhi-Qiang
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2006, : 1128 - +
  • [36] Fuzzy hidden Markov models for speech and speaker recognition
    Tran, Dat
    Wagner, Michael
    Annual Conference of the North American Fuzzy Information Processing Society - NAFIPS, 1999, : 426 - 430
  • [37] Speech defect analysis using Hidden Markov Models
    Chaloupka, Zdenek
    Uhlir, Jan
    RADIOENGINEERING, 2007, 16 (01) : 67 - 72
  • [38] REVISITING HIDDEN MARKOV MODELS FOR SPEECH EMOTION RECOGNITION
    Mao, Shuiyang
    Tao, Dehua
    Zhang, Guangyan
    Ching, P. C.
    Lee, Tan
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6715 - 6719
  • [39] IMPROVED HIDDEN MARKOV-MODELS FOR SPEECH RECOGNITION
    AUBERT, X
    BOURLARD, H
    KAMP, Y
    WELLEKENS, CJ
    PHILIPS JOURNAL OF RESEARCH, 1988, 43 (3-4) : 224 - 245
  • [40] Fuzzy Hidden Markov Models for Indonesian Speech Classification
    Yulita, Intan Nurma
    The, Houw Liong
    Adiwijaya
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2012, 16 (03) : 381 - 387