Incorporating Global Variance in the Training Phase of GMM-based Voice Conversion

被引:0
|
作者
Hwang, Hsin-Te [1 ,3 ]
Tsao, Yu [2 ]
Wang, Hsin-Min [3 ]
Wang, Yih-Ru [1 ]
Chen, Sin-Horng [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu, Taiwan
[2] Acad Sinica, Res Ctr Infomrat Technol Innovat, Taipei, Taiwan
[3] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Maximum likelihood-based trajectory mapping considering global variance (MLGV-based trajectory mapping) has been proposed for improving the quality of the converted speech of Gaussian mixture model-based voice conversion (GMM-based VC). Although the quality of the converted speech is significantly improved, the computational cost of the online conversion process is also increased because there is no closed form solution for parameter generation in MLGV-based trajectory mapping, and an iterative process is generally required. To reduce the online computational cost, we propose to incorporate GV in the training phase of GMM-based VC. Then, the conversion process can simply adopt ML-based trajectory mapping (without considering GV in the conversion phase), which has a closed form solution. In this way, it is expected that the quality of the converted speech can be improved without increasing the online computational cost. Our experimental results demonstrate that the proposed method yields a significant improvement in the quality of the converted speech comparing to the conventional GMM-based VC method. Meanwhile, comparing to MLGV-based trajectory mapping, the proposed method provides comparable converted speech quality with reduced computational cost in the conversion process.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Voice Conversion using GMAT with Enhanced Global Variance
    Benisty, Hadas
    Malah, David
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 676 - 679
  • [22] A GMM based residual prediction method for voice conversion
    Xia, J
    Yin, JX
    ISPACS 2005: PROCEEDINGS OF THE 2005 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, 2005, : 389 - 392
  • [23] A GMM-Based Phase Group Identification for Residential Low Voltage Networks
    Karunarathne, Eshan
    Simonovska, Angela
    Ochoa, Luis F.
    Alpcan, Tansu
    2023 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, PESGM, 2023,
  • [24] Voice Conversion Based on STRAIGHT and UBM-GMM
    Gao Yingying
    Zhu Weibin
    PROCEEDINGS OF 2009 CONFERENCE ON COMMUNICATION FACULTY, 2009, : 342 - 345
  • [25] Objective Comparison of Four GMM-Based Methods for PMA-to-Speech Conversion
    Erro, Daniel
    Hernaez, Inma
    Serrano, Luis
    Saratxaga, Ibon
    Navas, Eva
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2016, 2016, 10077 : 24 - 32
  • [26] Statistical Singing Voice Conversion based on Direct Waveform Modification with Global Variance
    Kobayashi, Kazuhiro
    Toda, Tomoki
    Neubig, Graham
    Sakti, Sakriani
    Nakamura, Satoshi
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2754 - 2758
  • [27] GMM-Based Synthetic Samples for Classification of Hyperspectral Images With Limited Training Data
    Davari, Amirabbas
    Aptoula, Erchan
    Yanikoglu, Berrin
    Maier, Andreas
    Riess, Christian
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (06) : 942 - 946
  • [28] A Multi-level GMM-Based Cross-Lingual Voice Conversion Using Language-Specific Mixture Weights for Polyglot Synthesis
    Ramani, B.
    Jeeva, M. P. Actlin
    Vijayalakshmi, P.
    Nagarajan, T.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2016, 35 (04) : 1283 - 1311
  • [29] Voice Conversion Based on Improved GMM and Spectrum with Synchronous Prosody
    Zhang Bing
    Yu Yibiao
    ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 659 - 662
  • [30] Cepstrum Liftering based Voice Conversion using RBF and GMM
    Nirmal, Jagannath
    Kachare, Pramod
    Patnaik, Suprava
    Zaveri, Mukesh
    2013 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2013, : 570 - 575