Analysis of HMM-Based Lombard Speech Synthesis

被引:0
|
作者
Raitio, Tuomo [1 ]
Suni, Antti [2 ]
Vainio, Martti [2 ]
Alku, Paavo [1 ]
机构
[1] Aalto Univ, Dept Signal Proc & Acoust, Helsinki, Finland
[2] Univ Helsinki, Dept Speech Sci, Helsinki, Finland
基金
芬兰科学院;
关键词
speech synthesis; HMM; Lombard effect; speech-in-noise;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Humans modify their voice in interfering noise in order to maintain the intelligibility of their speech - this is called the Lombard effect This ability, however, has not been extensively modeled in speech synthesis. Here we compare several methods of synthesizing speech in noise using a physiologically based statistical speech synthesis system (GlottHMM). The results show that in a realistic street noise situation the synthetic Lombard speech is judged by listeners both as appropriate for the situation and as intelligible as natural Lombard speech. Of the different types of models, one using adaptation and extrapolation performed the best.
引用
收藏
页码:2792 / +
页数:2
相关论文
共 50 条
  • [41] x Formant-controlled HMM-based Speech Synthesis
    Lei, Ming
    Yamagishi, Junichi
    Richmond, Korin
    Ling, Zhen-Hua
    King, Simon
    Dai, Li-Rong
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2788 - +
  • [42] FACTORED MLLR ADAPTATION FOR HMM-BASED EXPRESSIVE SPEECH SYNTHESIS
    Sung, June Sig
    Hong, Doo Hwa
    Lee, Chul Min
    Kim, Nam Soo
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 974 - 977
  • [43] Two-band excitation for HMM-based speech synthesis
    Kim, Sang-Jin
    Hahn, Minsoo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (01) : 378 - 381
  • [44] An acoustic model adaptation using hmm-based speech synthesis
    Tanaka, K
    Kuroiwa, S
    Tsuge, S
    Ren, F
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 368 - 373
  • [45] A Covariance-Tying Technique for HMM-Based Speech Synthesis
    Oura, Keiichiro
    Zen, Heiga
    Nankaku, Yoshihiko
    Lee, Akinobu
    Tokuda, Keiichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (03): : 595 - 601
  • [46] FACTOR ANALYZED VOICE MODELS FOR HMM-BASED SPEECH SYNTHESIS
    Kazumi, Kyosuke
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4234 - 4237
  • [47] Data Selection and Adaptation for Naturalness in HMM-based Speech Synthesis
    Cooper, Erica
    Chang, Alison
    Levitan, Yocheved
    Hirschberg, Julia
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 357 - +
  • [48] Emotion transplantation through adaptation in HMM-based speech synthesis
    Lorenzo-Trueba, Jaime
    Barra-Chicote, Roberto
    San-Segundo, Ruben
    Ferreiros, Javier
    Yamagishi, Junichi
    Montero, Juan M.
    COMPUTER SPEECH AND LANGUAGE, 2015, 34 (01): : 292 - 307
  • [49] Speaker adaptation of pitch and spectrum for HMM-based speech synthesis
    Tamura, M., 1600, John Wiley and Sons Inc. (35):
  • [50] CONTEXTUAL PARTIAL ADDITIVE STRUCTURE FOR HMM-BASED SPEECH SYNTHESIS
    Takaki, Shinji
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7878 - 7882