An algorithm for voice conversion with limited corpus

被引:0
|
作者
GU Dong [1 ]
JIAN Zhihua [1 ]
机构
[1] School of Communication Engineering, Hangzhou Dianzi University
关键词
DTW; An algorithm for voice conversion with limited corpus;
D O I
10.15949/j.cnki.0217-9776.2018.03.008
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
Under the condition of limited target speaker’s corpus, this paper proposed an algorithm for voice conversion using unified tensor dictionary with limited corpus. Firstly,parallel speech of N speakers was selected randomly from the speech corpus to build the base of tensor dictionary. And then, after the operation of multi-series dynamic time warping for those chosen speech, N two-dimension basic dictionaries can be generated which constituted the unified tensor dictionary. During the conversion stage, the two dictionaries of source and target speaker were established by linear combination of the N basic dictionaries using the two speakers’ speech. The experimental results showed that when the number of the basic speaker was 14, our algorithm can obtain the compared performance of the traditional NMFbased method with few target speaker corpus, which greatly facilitate the application of voice conversion system.
引用
收藏
页码:371 / 384
页数:14
相关论文
共 50 条
  • [1] Factorized WaveNet for voice conversion with limited data
    Du, Hongqiang
    Tian, Xiaohai
    Xie, Lei
    Li, Haizhou
    SPEECH COMMUNICATION, 2021, 130 : 45 - 54
  • [2] DeepConversion: Voice conversion with limited parallel training data
    Zhang, Mingyang
    Sisman, Berrak
    Zhao, Li
    Li, Haizhou
    SPEECH COMMUNICATION, 2020, 122 : 31 - 43
  • [3] WaveNet Vocoder with Limited Training Data for Voice Conversion
    Liu, Li-Juan
    Ling, Zhen-Hua
    Yuan-Jiang
    Ming-Zhou
    Dai, Li-Rong
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1983 - 1987
  • [4] EFFECTIVE WAVENET ADAPTATION FOR VOICE CONVERSION WITH LIMITED DATA
    Du, Hongqiang
    Tian, Xiaohai
    Xie, Lei
    Li, Haizhou
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7779 - 7783
  • [5] A method for voice conversion based on viterbi algorithm
    Jian, Zhi-Hua
    Yang, Zhen
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2009, 37 (07): : 1470 - 1475
  • [6] Voice conversion based on feature combination with limited training data
    Ghorbandoost, Mostafa
    Sayadiyan, Abolghasem
    Ahangar, Mohsen
    Sheikhzadeh, Hamid
    Shahrebabaki, Abdoreza Sabzi
    Amini, Jamal
    SPEECH COMMUNICATION, 2015, 67 : 113 - 128
  • [7] A novel method for voice conversion based on non-parallel corpus
    Sayadian A.
    Mozaffari F.
    International Journal of Speech Technology, 2017, 20 (3) : 587 - 592
  • [8] Restricted Boltzmann Machine-Based Voice Conversion for Nonparallel Corpus
    Lee, Ki-Seung
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (08) : 1103 - 1107
  • [9] Voice Conversion and Spoofed Voice Detection from Parallel English and Urdu Corpus using Cyclic GANs
    Saleem, Summra
    Dilawari, Aniqa
    Khan, Muhammad Usman Ghani
    Husnain, Muhammad
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION IN INDUSTRY (ICRAI), 2019,
  • [10] Many-to-many voice conversion experiments using a Korean speech corpus
    Yook, Dongsuk
    Seo, HyungJin
    Ko, Bonggu
    Yoo, In-Chul
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2022, 41 (03): : 351 - 358