Speaker Adaptation using Nonlinear Regression Techniques for HMM-based Speech Synthesis

被引：0

作者：

Hong, Doo Hwa ^{[1
]}

Kang, Shin Jae

Lee, Joun Yeop

Kim, Nam Soo

机构：

[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul 151742, South Korea

来源：

2014 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2014) | 2014年

关键词：

maximum likelihood linear regression (MLLR); HMM-based speech synthesis; kernel; maximum penalized likelihood kernel regression (MPLKR); LIKELIHOOD LINEAR-REGRESSION; KERNEL REGRESSION;

D O I：

10.1109/IIH-MSP.2014.152

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The maximum likelihood linear regression (MLLR) technique is a well-known approach to parameter adaptation in hidden Markov model (HMM)-based systems. In this paper, we propose the maximum penalized likelihood kernel regression (MPLKR) approach as a novel adaptation technique for HMM-based speech synthesis. The proposed algorithm performs a nonlinear regression between the mean vector of the base model and the corresponding mean vector of adaptive data by means of a kernel method. In the experiments, we used various types of parametric kernels for the proposed algorithm and compared their performances with the conventional method. From experimental results, it has been found that the proposed algorithm outperforms the conventional method in terms of the objective measure as well as the subjective listening quality.

引用

页码：586 / 589

页数：4

共 50 条

[41] Czech HMM-Based Speech Synthesis
Hanzlicek, Zdenek
TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 291 - 298
[42] Robustness of HMM-based Speech Synthesis
Yamagishi, Junichi
Ling, Zhenhua
King, Simon
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 581 - 584
[43] Rapid Adaptation of Foreign-accented HMM-based Speech Synthesis
Karhila, Reima
Wester, Mirjam
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2812 - +
[44] Arabic HMM-based Speech Synthesis
Khalil, Krichi Mohamed
Adnan, Cherif
2013 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND SOFTWARE APPLICATIONS (ICEESA), 2013, : 450 - 454
[45] HMM-based Speech Synthesis with a Flexible Mandarin Stress Adaptation Model
Li, Ya
Pan, Shifeng
Tao, Jianhua
2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 625 - 628
[46] HMM-Based Vietnamese Speech Synthesis
Trinh, Son
Hoang, Kiem
INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2015, 3 (04) : 33 - 47
[47] HMM-Based Speaker Emotional Recognition Technology for Speech Signal
Qin, Yuqiang
Zhang, Xueying
FRONTIERS OF MANUFACTURING SCIENCE AND MEASURING TECHNOLOGY, PTS 1-3, 2011, 230-232 : 261 - 265
[48] UNSUPERVISED CROSS-LINGUAL SPEAKER ADAPTATION FOR HMM-BASED SPEECH SYNTHESIS USING TWO-PASS DECISION TREE CONSTRUCTION
Gibson, Matthew
Hirsimaki, Teemu
Karhila, Reima
Kurimo, Mikko
Byrne, William
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4642 - 4645
[49] A very low bit rate speech coder using HMM-based speech recognition synthesis techniques
Tokuda, K
Masuko, T
Hiroi, J
Kobayashi, T
Kitamura, T
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 609 - 612
[50] Speaking style adaptation using context clustering decision tree for HMM-based speech synthesis
Yamagishi, J
Tachibana, M
Masuko, T
Kobayashi, T
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 5 - 8

← 1 2 3 4 5 →