Speaker Adaptation using Relevance Vector Regression for HMM-based Expressive TTS

被引：0

作者：

Hong, Doo Hwa ^{[1
]}

Lee, Joun Yeop

Jang, Se Young

Kim, Nam Soo

机构：

[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul, South Korea

来源：

16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5 | 2015年

关键词：

speech synthesis; speaker adaptation; MLLR; relevance vector regression; LIKELIHOOD LINEAR-REGRESSION; KERNEL REGRESSION;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The conventional maximum likelihood linear regression (MLLR)-based adaptation algorithm employed to acoustic hidden Markov models (HMMs) is too restricted in linear regression to represent the details of mapping charateristics. To overcome this problem, we propose the relevance vector regression (RVR)-based model parameter adaptation technique. In this framework, the conventional technique is extended to have much more basis functions. Also, the weights for conducting a transform matrix are obtained by sparse Bayesian learning, in which most of the weights become zero due to the definition of the prior with the precision hyper-parameters. Furthermore, by using the appropriate kernel functions, RVR can take both of the advantages of linear and nonlinear regression. In the experiments, the emotional speech database is used for adaptation to evaluate the proposed method compared with the conventional constrained MLLR. From the experimental results, we conclude that the RVR adaption method performs better than the conventional method.

引用

页码：1216 / 1220

页数：5

共 50 条

[1] Speaker Adaptation using Nonlinear Regression Techniques for HMM-based Speech Synthesis
Hong, Doo Hwa
Kang, Shin Jae
Lee, Joun Yeop
Kim, Nam Soo
2014 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2014), 2014, : 586 - 589
[2] EVALUATION OF LINEAR REGRESSION FOR SPEAKER ADAPTATION IN HMM-BASED ARTICULATORY MOVEMENTS ESTIMATION
Li, Hao
Tao, Jianhua
Wang, Yang
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4944 - 4948
[3] HMM-Based Style Control for Expressive Speech Synthesis with Arbitrary Speaker's Voice Using Model Adaptation
Nose, Takashi
Tachibana, Makoto
Kobayashi, Takao
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (03): : 489 - 497
[4] FACTORED MLLR ADAPTATION FOR HMM-BASED EXPRESSIVE SPEECH SYNTHESIS
Sung, June Sig
Hong, Doo Hwa
Lee, Chul Min
Kim, Nam Soo
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 974 - 977
[5] Speaker adaptation of pitch and spectrum for HMM-based speech synthesis
Tamura, M., 1600, John Wiley and Sons Inc. (35):
[6] Frequency Warping for Speaker Adaptation in HMM-based Speech Synthesis
Gao, Weixun
Cao, Qiying
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2014, 30 (04) : 1149 - 1166
[7] SPEAKER-INDEPENDENT STYLE CONVERSION FOR HMM-BASED EXPRESSIVE SPEECH SYNTHESIS
Kanagawa, Hiroki
Nose, Takashi
Kobayashi, Takao
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7864 - 7868
[8] SPEAKER SIMILARITY EVALUATION OF FOREIGN-ACCENTED SPEECH SYNTHESIS USING HMM-BASED SPEAKER ADAPTATION
Wester, Mirjam
Karhila, Reima
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5372 - 5375
[9] An On-line Speaker Adaptation Method for HMM-based Speech Recognizers
Banhalmi, Andras
Kocsor, Andras
ACTA CYBERNETICA, 2008, 18 (03): : 379 - 390
[10] Nearest Neighbor Approach in Speaker Adaptation for HMM-based Speech Synthesis
Mohammadi, Amir
Demiroglu, Cenk
2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,

← 1 2 3 4 5 →