High-quality voice conversion system based on GMM statistical parameters and RBF neural network

被引:0
|
作者
CHEN Xian-tong
ZHANG Ling-hua
机构
[1] CollegeofTelecommunicationsandInformationEngineering,NanjingUniversityofPostsandTelecommunications
关键词
D O I
暂无
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
A voice conversion(VC) system was designed based on Gaussian mixture model(GMM) and radial basis function(RBF) neural network. As a voice conversion model, RBF network needs quantities of training data to improve its performance. For one speech, the networks trained by different segments of data have different transformation effects. Since trying segment by segment to obtain the best conversion effect is complex, a conversion method was proposed, that uses GMM for statistics before training RBF network to aim at the problem. The speech transformation and representation using adaptive interpolation of weighted spectrum(STRAIGHT) model is used for accurate extraction of vocal tract spectrum. Then GMM is used to classify the numerous spectral parameters. The obtained mean parameters were trained in RBF network. Experiment reveals that, the soft classification ability of GMM can promptly realize the reduction and classification of training data under the premise of ensuring the training effect. The selection complexity is decreased thereafter. Compared to the conventional RBF network training methods, this method can make the transformation of spectral parameters more effective and improve the quality of converted speech.
引用
收藏
页码:68 / 75+93 +93
页数:9
相关论文
共 50 条
  • [21] A ANN BASED HIGH QUALITY METHOD FOR VOICE CONVERSION
    Chen, Z.
    Zhang, L. H.
    2010 6TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS NETWORKING AND MOBILE COMPUTING (WICOM), 2010,
  • [22] Automatic tuning of fuzzy controller parameters based on RBF neural network
    Juan, Wei
    Ping, Wang
    2ND INTERNATIONAL SYMPOSIUM ON COMPUTER NETWORK AND MULTIMEDIA TECHNOLOGY (CNMT 2010), VOLS 1 AND 2, 2010, : 191 - 194
  • [23] Mechanical Property Parameters Prediction of Tube Based on RBF Neural Network
    Jia Meihui
    Tang Chengtong
    Liu Jianhua
    Zhang Tian
    MECHATRONICS AND APPLIED MECHANICS II, PTS 1 AND 2, 2013, 300-301 : 882 - 888
  • [24] A novel voice morphing system using Bi-GMM for high quality transformation
    Xu, Ning
    Shao, Xi
    Yang, Zhen
    PROCEEDINGS OF NINTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING, 2008, : 485 - 489
  • [25] The identification of dynamic system based on memory RBF neural network
    Qiang, L
    Li, JX
    ISTM/2005: 6th International Symposium on Test and Measurement, Vols 1-9, Conference Proceedings, 2005, : 1080 - 1083
  • [26] Research of Machine Vision System Based on RBF Neural Network
    Ge Dongyuan
    Yao Xifan
    Chen Weixiong
    Zhang Qing
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, 2008, : 218 - 222
  • [27] Based on RBF neural network modeling of well test system
    Tian, Jia
    Energy Engineering and Environment Engineering, 2014, 535 : 606 - 609
  • [28] The Scheduling of Flexible Manufacturing System Based on RBF Neural Network
    Yu, Lianqing
    Zhang, Zhiming
    Mei, Shunqi
    PROCEEDINGS OF THE 2009 PACIFIC-ASIA CONFERENCE ON CIRCUITS, COMMUNICATIONS AND SYSTEM, 2009, : 678 - 681
  • [29] An ADRC Parameters Self-Tuning Control Strategy of Tension System Based on RBF Neural Network
    Liu, Shanhui
    Ding, Haodi
    Wang, Ziyu
    Li, Zheng
    Ma, Li'e
    JOURNAL OF RENEWABLE MATERIALS, 2023, 11 (04) : 1991 - 2014
  • [30] Continuous vocoder applied in deep neural network based voice conversion
    Al-Radhi, Mohammed Salah
    Csapo, Tamas Gabor
    Nemeth, Geza
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (23) : 33549 - 33572