An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation

被引:0
|
作者
Liang, Hui [1 ]
Dines, John [1 ]
机构
[1] Idiap Res Inst, Martigny, Switzerland
关键词
HMM-based TTS; cross-lingual speaker adaptation; HMM state mapping; language mismatch;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper provides an in-depth analysis of the impacts of language mismatch on the performance of cross-lingual speaker adaptation. Our work confirms the influence of language mismatch between average voice distributions for synthesis and for transform estimation and the necessity of eliminating this mismatch in order to effectively utilize multiple transforms for cross-lingual speaker adaptation. Specifically, we show that language mismatch introduces unwanted language-specific information when estimating multiple transforms, thus making these transforms detrimental to adaptation performance. Our analysis demonstrates speaker characteristics should be separated from language characteristics in order to improve cross-lingual adaptation performance.
引用
收藏
页码:622 / 625
页数:4
相关论文
共 50 条
  • [41] Non-Linearity in mapping based Cross-Lingual Word Embeddings
    Zhao, Jiawei
    Gilman, Andrew
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3583 - 3589
  • [42] Neural Cross-Lingual Relation Extraction Based on BilingualWord Embedding Mapping
    Ni, Jian
    Florian, Radu
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 399 - 409
  • [43] Cross-Lingual Leveled Reading Based on Language-Invariant Features
    Rao, Simin
    Zheng, Hua
    Li, Sujian
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 2677 - 2682
  • [44] Cross-lingual Text-To-Speech Synthesis via Domain Adaptation and Perceptual Similarity Regression in Speaker Space
    Xin, Detai
    Saito, Yuki
    Takamichi, Shinnosuke
    Koriyama, Tomoki
    Saruwatari, Hiroshi
    INTERSPEECH 2020, 2020, : 2947 - 2951
  • [45] Narrowing the language gap: domain adaptation guided cross-lingual passage re-ranking
    Dongmei Chen
    Xin Zhang
    Sheng Zhang
    Neural Computing and Applications, 2023, 35 : 20735 - 20748
  • [46] Narrowing the language gap: domain adaptation guided cross-lingual passage re-ranking
    Chen, Dongmei
    Zhang, Xin
    Zhang, Sheng
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (28): : 20735 - 20748
  • [47] A configurable translation-based cross-lingual ontology mapping system to adjust mapping outcomes
    Fu, Bo
    Brennan, Rob
    O'Sullivan, Declan
    JOURNAL OF WEB SEMANTICS, 2012, 15 : 15 - 36
  • [48] AA SPECTRAL SPACE WARPING APPROACH TO CROSS-LINGUAL VOICE TRANSFORMATION IN HMM-BASED TTS
    Wang, Hao
    Soong, Frank
    Meng, Helen
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4874 - 4878
  • [49] A New HMM-Based Voice Conversion Methodology Evaluated on Monolingual and Cross-Lingual Conversion Tasks
    Percybrooks, Winston S.
    Moore, Elliot
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (12) : 2298 - 2310
  • [50] Cross-lingual Acoustic Model Adaptation based on Transfer Vector Field Smoothing with MAP
    Saiko, Masahiro
    Matsuda, Shigeki
    Hanazawa, Ken
    Isotani, Ryosuke
    Hori, Chiori
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3321 - 3325