An algorithm for voice conversion with limited corpus

被引:0
|
作者
GU Dong [1 ]
JIAN Zhihua [1 ]
机构
[1] School of Communication Engineering, Hangzhou Dianzi University
关键词
DTW; An algorithm for voice conversion with limited corpus;
D O I
10.15949/j.cnki.0217-9776.2018.03.008
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
Under the condition of limited target speaker’s corpus, this paper proposed an algorithm for voice conversion using unified tensor dictionary with limited corpus. Firstly,parallel speech of N speakers was selected randomly from the speech corpus to build the base of tensor dictionary. And then, after the operation of multi-series dynamic time warping for those chosen speech, N two-dimension basic dictionaries can be generated which constituted the unified tensor dictionary. During the conversion stage, the two dictionaries of source and target speaker were established by linear combination of the N basic dictionaries using the two speakers’ speech. The experimental results showed that when the number of the basic speaker was 14, our algorithm can obtain the compared performance of the traditional NMFbased method with few target speaker corpus, which greatly facilitate the application of voice conversion system.
引用
收藏
页码:371 / 384
页数:14
相关论文
共 50 条
  • [21] Voice Conversion Based on Unified Dictionary with Clustered Features Between Non-parallel Corpus
    Jin, Hui
    Yu, Yi-Biao
    2018 4TH ANNUAL INTERNATIONAL CONFERENCE ON NETWORK AND INFORMATION SYSTEMS FOR COMPUTERS (ICNISC 2018), 2018, : 229 - 232
  • [22] HMM-Based Maximum Likelihood Frame Alignment for Voice Conversion from a Nonparallel Corpus
    Lee, Ki-Seung
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (12): : 3064 - 3067
  • [23] Emotional Voice Conversion for Mandarin using Tone Nucleus Model - Small Corpus and High Efficiency
    Wang, Miaomiao
    Wen, Miaomiao
    Hirose, Keikichi
    Minematsu, Nobuaki
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SPEECH PROSODY, VOLS I AND II, 2012, : 163 - 166
  • [24] Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectrum tilt
    NTT Human Interface Lab, Kanagawa, Japan
    Speech Commun, 2 (153-164):
  • [25] VOICE CONVERSION
    CHILDERS, DG
    WU, K
    HICKS, DM
    YEGNANARAYANA, B
    SPEECH COMMUNICATION, 1989, 8 (02) : 147 - 158
  • [26] Taco-VC: A Single Speaker Tacotron based Voice Conversion with Limited Data
    Levy-Leshem, Roee
    Giryes, Raja
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 391 - 395
  • [27] Voice Conversion Based on Em pirical Conditional Distribution in Resource-limited Scenarios
    Xu, Ning
    Tang, Yibin
    Bao, Jingyi
    Yao, Xiao
    Jiang, Aimin
    Liu, Xiaofeng
    2015 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2015, : 172 - 173
  • [28] Voice conversion based on Gaussian processes by coherent and asymmetric training with limited training data
    Xu, Ning
    Tang, Yibing
    Bao, Jingyi
    Jiang, Aiming
    Liu, Xiaofeng
    Yang, Zhen
    SPEECH COMMUNICATION, 2014, 58 : 124 - 138
  • [29] A noise robust voice conversion algorithm based on joint dictionary optimization
    Zhang, Shilei
    Jian, Zhihua
    Sun, Minhong
    Zhong, Hua
    Liu, Erxiao
    Shengxue Xuebao/Acta Acustica, 2019, 44 (06): : 1074 - 1082
  • [30] INCA Algorithm for Training Voice Conversion Systems From Nonparallel Corpora
    Erro, Daniel
    Moreno, Asuncion
    Bonafonte, Antonio
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (05): : 944 - 953