Speaker Adaptation using Nonlinear Regression Techniques for HMM-based Speech Synthesis

被引:0
|
作者
Hong, Doo Hwa [1 ]
Kang, Shin Jae
Lee, Joun Yeop
Kim, Nam Soo
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul 151742, South Korea
关键词
maximum likelihood linear regression (MLLR); HMM-based speech synthesis; kernel; maximum penalized likelihood kernel regression (MPLKR); LIKELIHOOD LINEAR-REGRESSION; KERNEL REGRESSION;
D O I
10.1109/IIH-MSP.2014.152
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The maximum likelihood linear regression (MLLR) technique is a well-known approach to parameter adaptation in hidden Markov model (HMM)-based systems. In this paper, we propose the maximum penalized likelihood kernel regression (MPLKR) approach as a novel adaptation technique for HMM-based speech synthesis. The proposed algorithm performs a nonlinear regression between the mean vector of the base model and the corresponding mean vector of adaptive data by means of a kernel method. In the experiments, we used various types of parametric kernels for the proposed algorithm and compared their performances with the conventional method. From experimental results, it has been found that the proposed algorithm outperforms the conventional method in terms of the objective measure as well as the subjective listening quality.
引用
收藏
页码:586 / 589
页数:4
相关论文
共 50 条
  • [41] Czech HMM-Based Speech Synthesis
    Hanzlicek, Zdenek
    TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 291 - 298
  • [42] Robustness of HMM-based Speech Synthesis
    Yamagishi, Junichi
    Ling, Zhenhua
    King, Simon
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 581 - 584
  • [43] Rapid Adaptation of Foreign-accented HMM-based Speech Synthesis
    Karhila, Reima
    Wester, Mirjam
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2812 - +
  • [44] Arabic HMM-based Speech Synthesis
    Khalil, Krichi Mohamed
    Adnan, Cherif
    2013 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND SOFTWARE APPLICATIONS (ICEESA), 2013, : 450 - 454
  • [45] HMM-based Speech Synthesis with a Flexible Mandarin Stress Adaptation Model
    Li, Ya
    Pan, Shifeng
    Tao, Jianhua
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 625 - 628
  • [46] HMM-Based Vietnamese Speech Synthesis
    Trinh, Son
    Hoang, Kiem
    INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2015, 3 (04) : 33 - 47
  • [47] HMM-Based Speaker Emotional Recognition Technology for Speech Signal
    Qin, Yuqiang
    Zhang, Xueying
    FRONTIERS OF MANUFACTURING SCIENCE AND MEASURING TECHNOLOGY, PTS 1-3, 2011, 230-232 : 261 - 265
  • [48] UNSUPERVISED CROSS-LINGUAL SPEAKER ADAPTATION FOR HMM-BASED SPEECH SYNTHESIS USING TWO-PASS DECISION TREE CONSTRUCTION
    Gibson, Matthew
    Hirsimaki, Teemu
    Karhila, Reima
    Kurimo, Mikko
    Byrne, William
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4642 - 4645
  • [49] A very low bit rate speech coder using HMM-based speech recognition synthesis techniques
    Tokuda, K
    Masuko, T
    Hiroi, J
    Kobayashi, T
    Kitamura, T
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 609 - 612
  • [50] Speaking style adaptation using context clustering decision tree for HMM-based speech synthesis
    Yamagishi, J
    Tachibana, M
    Masuko, T
    Kobayashi, T
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 5 - 8