An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation

被引:0
|
作者
Liang, Hui [1 ]
Dines, John [1 ]
机构
[1] Idiap Res Inst, Martigny, Switzerland
关键词
HMM-based TTS; cross-lingual speaker adaptation; HMM state mapping; language mismatch;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper provides an in-depth analysis of the impacts of language mismatch on the performance of cross-lingual speaker adaptation. Our work confirms the influence of language mismatch between average voice distributions for synthesis and for transform estimation and the necessity of eliminating this mismatch in order to effectively utilize multiple transforms for cross-lingual speaker adaptation. Specifically, we show that language mismatch introduces unwanted language-specific information when estimating multiple transforms, thus making these transforms detrimental to adaptation performance. Our analysis demonstrates speaker characteristics should be separated from language characteristics in order to improve cross-lingual adaptation performance.
引用
收藏
页码:622 / 625
页数:4
相关论文
共 50 条
  • [31] Language model adaptation in Tamil language using cross-lingual latent semantic analysis with document aligned corpora
    Selvam, M.
    Natarajan, A. M.
    CURRENT SCIENCE, 2010, 98 (07): : 922 - 929
  • [32] Transform Mapping Using Shared Decision Tree Context Clustering for HMM-Based Cross-Lingual Speech Synthesis
    Nagahama, Daiki
    Nose, Takashi
    Koriyama, Tomoki
    Kobayashi, Takao
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 770 - 774
  • [33] Distillation Language Adversarial Network for Cross-lingual Sentiment Analysis
    Wang, Deheng
    Yang, Aimin
    Zhou, Yongmei
    Xie, Fenfang
    Ouyang, Zhouhao
    Peng, Sancheng
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 45 - 50
  • [34] Cross-lingual projection for class-based language models
    Gfeller, Beat
    Schogol, Vlad
    Hall, Keith
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 83 - 88
  • [35] Estimation of Perceptual Spaces for Speaker Identities Based on the Cross-Lingual Discrimination Task
    Tsuzaki, Minoru
    Tokuda, Keiichi
    Kawai, Hisashi
    Ni, Jinfu
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 164 - +
  • [36] On the Robustness of Cross-lingual Speaker Recognition using Transformer-based Approaches
    Liao, Wen-Hung
    Chen, Wei-Yu
    Wu, Yi-Chieh
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 366 - 371
  • [37] A cross-lingual approach to the development of an HMM-based speech synthesis system for Malay
    Mustafa, Mumtaz B.
    Ainon, Raja N.
    Zainuddin, Roziati
    Don, Zuraidah M.
    Knowles, Gerry
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 3204 - 3207
  • [38] Cross-lingual adaptation of a CTC-based multilingual acoustic model
    Tong, Sibo
    Garner, Philip N.
    Bourlard, Herve
    SPEECH COMMUNICATION, 2018, 104 : 39 - 46
  • [39] CROSS-LINGUAL TEXT-INDEPENDENT SPEAKER VERIFICATION USING UNSUPERVISED ADVERSARIAL DISCRIMINATIVE DOMAIN ADAPTATION
    Xia, Wei
    Huang, Jing
    Hansen, John H. L.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5816 - 5820
  • [40] CAM: A cross-lingual adaptation framework for low-resource language speech recognition
    Hu, Qing
    Zhang, Yan
    Zhang, Xianlei
    Han, Zongyu
    Yu, Xilong
    INFORMATION FUSION, 2024, 111