An algorithm for voice conversion with limited corpus

被引：0

作者：

GU Dong ^{[1
]}

JIAN Zhihua ^{[1
]}

机构：

[1] School of Communication Engineering, Hangzhou Dianzi University

来源：

Chinese Journal of Acoustics | 2018年 / 37卷 / 03期

关键词：

DTW; An algorithm for voice conversion with limited corpus;

D O I：

10.15949/j.cnki.0217-9776.2018.03.008

中图分类号：

TN912.3 [语音信号处理];

学科分类号：

0711 ;

摘要：

Under the condition of limited target speaker’s corpus, this paper proposed an algorithm for voice conversion using unified tensor dictionary with limited corpus. Firstly,parallel speech of N speakers was selected randomly from the speech corpus to build the base of tensor dictionary. And then, after the operation of multi-series dynamic time warping for those chosen speech, N two-dimension basic dictionaries can be generated which constituted the unified tensor dictionary. During the conversion stage, the two dictionaries of source and target speaker were established by linear combination of the N basic dictionaries using the two speakers’ speech. The experimental results showed that when the number of the basic speaker was 14, our algorithm can obtain the compared performance of the traditional NMFbased method with few target speaker corpus, which greatly facilitate the application of voice conversion system.

引用

页码：371 / 384

页数：14

共 50 条

[21] Voice Conversion Based on Unified Dictionary with Clustered Features Between Non-parallel Corpus
Jin, Hui
Yu, Yi-Biao
2018 4TH ANNUAL INTERNATIONAL CONFERENCE ON NETWORK AND INFORMATION SYSTEMS FOR COMPUTERS (ICNISC 2018), 2018, : 229 - 232
[22] HMM-Based Maximum Likelihood Frame Alignment for Voice Conversion from a Nonparallel Corpus
Lee, Ki-Seung
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (12): : 3064 - 3067
[23] Emotional Voice Conversion for Mandarin using Tone Nucleus Model - Small Corpus and High Efficiency
Wang, Miaomiao
Wen, Miaomiao
Hirose, Keikichi
Minematsu, Nobuaki
PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SPEECH PROSODY, VOLS I AND II, 2012, : 163 - 166
[24] Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectrum tilt
NTT Human Interface Lab, Kanagawa, Japan
Speech Commun, 2 (153-164):
[25] VOICE CONVERSION
CHILDERS, DG
WU, K
HICKS, DM
YEGNANARAYANA, B
SPEECH COMMUNICATION, 1989, 8 (02) : 147 - 158
[26] Taco-VC: A Single Speaker Tacotron based Voice Conversion with Limited Data
Levy-Leshem, Roee
Giryes, Raja
28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 391 - 395
[27] Voice Conversion Based on Em pirical Conditional Distribution in Resource-limited Scenarios
Xu, Ning
Tang, Yibin
Bao, Jingyi
Yao, Xiao
Jiang, Aimin
Liu, Xiaofeng
2015 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2015, : 172 - 173
[28] Voice conversion based on Gaussian processes by coherent and asymmetric training with limited training data
Xu, Ning
Tang, Yibing
Bao, Jingyi
Jiang, Aiming
Liu, Xiaofeng
Yang, Zhen
SPEECH COMMUNICATION, 2014, 58 : 124 - 138
[29] A noise robust voice conversion algorithm based on joint dictionary optimization
Zhang, Shilei
Jian, Zhihua
Sun, Minhong
Zhong, Hua
Liu, Erxiao
Shengxue Xuebao/Acta Acustica, 2019, 44 (06): : 1074 - 1082
[30] INCA Algorithm for Training Voice Conversion Systems From Nonparallel Corpora
Erro, Daniel
Moreno, Asuncion
Bonafonte, Antonio
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (05): : 944 - 953

← 1 2 3 4 5 →