Lip movement synthesis from speech based on Hidden Markov Models

被引:4
|
作者
Yamamoto, E [1 ]
Nakamura, S [1 ]
Shikano, K [1 ]
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara 63001, Japan
关键词
D O I
10.1109/AFGR.1998.670941
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech intelligibility can be improved by adding lip image and facial image to speech signal. Thus the lip image synthesis plays a important role to realize a natural human-libe face of computer agents. Moreover the synthesized lip movement images can compensate lack of auditory information for hearing impaired people. We propose a novel lip movement synthesis method based on mapping from input speech based on Hidden Markov Model (HMM). This paper compares the HMM-based method and a conventional method using vector quantization (VQ). In the experiment, error and time differential error between synthesized lip movement images and original ones are used for evaluation. The result shows that the error of the HMM based method is 8.7% smaller than that of the VQ-based method. Moreover, the HMM-based method reduces time differential error by 32% than the VQ's. The result also shows that the errors are mostly caused by phoneme /h/ and /Q/. Since lip shapes of those phonemes are strongly dependent on succeeding phoneme, the contest dependent synthesis on the HMM-based method is applied to reduce the error. The improved HMM-based method realizes reduction of the error(differential error) by 10.5%;(11%) compared with the original HMM-based method.
引用
收藏
页码:154 / 159
页数:2
相关论文
共 50 条
  • [41] A Novel Speech Recognition Approach Based on Multiple Modeling by Hidden Markov Models
    Samira, Hazmoune
    Fateh, Bougamouza
    Smaine, Mazouzi
    Mohamed, Benmohammed
    2013 INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS TECHNOLOGY (ICCAT), 2013,
  • [42] AUTOMATIC SEGMENTATION AND LABELING OF SPEECH-BASED ON HIDDEN MARKOV-MODELS
    BRUGNARA, F
    FALAVIGNA, D
    OMOLOGO, M
    SPEECH COMMUNICATION, 1993, 12 (04) : 357 - 370
  • [43] Squat Movement Recognition Using Hidden Markov Models
    Rungsawasdisap, Nantana
    Yimit, Adiljan
    Lu, Xin
    Hagihara, Yoshihiro
    2018 INTERNATIONAL WORKSHOP ON ADVANCED IMAGE TECHNOLOGY (IWAIT), 2018,
  • [44] Eye movement analysis with switching hidden Markov models
    Tim Chuk
    Antoni B. Chan
    Shinsuke Shimojo
    Janet H. Hsiao
    Behavior Research Methods, 2020, 52 : 1026 - 1043
  • [45] Eye movement analysis with switching hidden Markov models
    Chuk, Tim
    Chan, Antoni B.
    Shimojo, Shinsuke
    Hsiao, Janet H.
    BEHAVIOR RESEARCH METHODS, 2020, 52 (03) : 1026 - 1043
  • [46] Predicting Reading Performance based on Eye Movement Analysis with Hidden Markov Models
    Zheng, Yueyuan
    Que, Ying
    Hu, Xiao
    Hsiao, Janet H.
    2022 INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT 2022), 2022, : 172 - 176
  • [47] Estimating personality impression from speech record using hidden Markov models
    Jin, Yicheng
    Sakuma, Takuto
    Kato, Shohei
    Kunitachi, Tsutomu
    IEEJ Transactions on Electronics, Information and Systems, 2015, 135 (12) : 1517 - 1523
  • [48] Generating emphatic speech with hidden Markov model for expressive speech synthesis
    Wu, Zhiyong
    Ning, Yishuang
    Zang, Xiao
    Jia, Jia
    Meng, Fanbo
    Meng, Helen
    Cai, Lianhong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (22) : 9909 - 9925
  • [49] A Bayesian Approach to Hidden Semi-Markov Model Based Speech Synthesis
    Hashimoto, Kei
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1719 - 1722
  • [50] Generating emphatic speech with hidden Markov model for expressive speech synthesis
    Zhiyong Wu
    Yishuang Ning
    Xiao Zang
    Jia Jia
    Fanbo Meng
    Helen Meng
    Lianhong Cai
    Multimedia Tools and Applications, 2015, 74 : 9909 - 9925