UNSUPERVISED CROSS-LINGUAL SPEAKER ADAPTATION FOR HMM-BASED SPEECH SYNTHESIS USING TWO-PASS DECISION TREE CONSTRUCTION

被引:9
|
作者
Gibson, Matthew [1 ]
Hirsimaki, Teemu [2 ]
Karhila, Reima [2 ]
Kurimo, Mikko [2 ]
Byrne, William [1 ]
机构
[1] Univ Cambridge, Dept Engn, Trumpington St, Cambridge CB2 1PZ, England
[2] Aalto Univ, FIN-5400 Helsinki, Finland
关键词
HMM-based speech synthesis; unsupervised speaker adaptation; cross-lingual;
D O I
10.1109/ICASSP.2010.5495196
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper demonstrates how unsupervised cross-lingual adaptation of HMM-based speech synthesis models may be performed without explicit knowledge of the adaptation data language. A two-pass decision tree construction technique is deployed for this purpose. Using parallel translated datasets, cross-lingual and intralingual adaptation are compared in a controlled manner. Listener evaluations reveal that the proposed method delivers performance approaching that of unsupervised intralingual adaptation.
引用
收藏
页码:4642 / 4645
页数:4
相关论文
共 50 条
  • [21] Cross-lingual speaker adaptation using domain adaptation and speaker consistency loss for text-to-speech synthesis
    Xin, Detai
    Saito, Yuki
    Takamichi, Shinnosuke
    Koriyama, Tomoki
    Saruwatari, Hiroshi
    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2021, 5 : 3376 - 3380
  • [22] Nearest Neighbor Approach in Speaker Adaptation for HMM-based Speech Synthesis
    Mohammadi, Amir
    Demiroglu, Cenk
    2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [23] Extended Decision Tree with OR Relationship for HMM-based Speech Synthesis
    Wang, Yang
    Tao, Jianhua
    Yang, Minghao
    Li, Ya
    2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 225 - 229
  • [24] SPEAKER SIMILARITY EVALUATION OF FOREIGN-ACCENTED SPEECH SYNTHESIS USING HMM-BASED SPEAKER ADAPTATION
    Wester, Mirjam
    Karhila, Reima
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5372 - 5375
  • [25] An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation
    Liang, Hui
    Dines, John
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 622 - 625
  • [26] Analysis of Speaker Adaptation Algorithms for HMM-Based Speech Synthesis and a Constrained SMAPLR Adaptation Algorithm
    Yamagishi, Junichi
    Kobayashi, Takao
    Nakano, Yuji
    Ogata, Katsumi
    Isogai, Juri
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (01): : 66 - 83
  • [27] Towards automatic cross-lingual acoustic modelling applied to HMM-based speech synthesis for under-resourced languages
    Justin, Tadej
    Mihelic, France
    Zibert, Janez
    AUTOMATIKA, 2016, 57 (01) : 268 - 281
  • [28] An acoustic model adaptation using hmm-based speech synthesis
    Tanaka, K
    Kuroiwa, S
    Tsuge, S
    Ren, F
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 368 - 373
  • [29] HMM-Based Emphatic Speech Synthesis Using Unsupervised Context Labeling
    Maeno, Yu
    Nose, Takashi
    Kobayashi, Takao
    Ijima, Yusuke
    Nakajima, Hideharu
    Mizuno, Hideyuki
    Yoshioka, Osamu
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1860 - +
  • [30] Speaker and style adaptation using average voice model for style control in HMM-based speech synthesis
    Tachibana, Makoto
    Izawa, Shinsuke
    Nose, Takashi
    Kobayashi, Takao
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4633 - 4636