High-quality voice conversion system based on GMM statistical parameters and RBF neural network

被引:0
|
作者
CHEN Xian-tong
ZHANG Ling-hua
机构
[1] CollegeofTelecommunicationsandInformationEngineering,NanjingUniversityofPostsandTelecommunications
关键词
D O I
暂无
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
A voice conversion(VC) system was designed based on Gaussian mixture model(GMM) and radial basis function(RBF) neural network. As a voice conversion model, RBF network needs quantities of training data to improve its performance. For one speech, the networks trained by different segments of data have different transformation effects. Since trying segment by segment to obtain the best conversion effect is complex, a conversion method was proposed, that uses GMM for statistics before training RBF network to aim at the problem. The speech transformation and representation using adaptive interpolation of weighted spectrum(STRAIGHT) model is used for accurate extraction of vocal tract spectrum. Then GMM is used to classify the numerous spectral parameters. The obtained mean parameters were trained in RBF network. Experiment reveals that, the soft classification ability of GMM can promptly realize the reduction and classification of training data under the premise of ensuring the training effect. The selection complexity is decreased thereafter. Compared to the conventional RBF network training methods, this method can make the transformation of spectral parameters more effective and improve the quality of converted speech.
引用
收藏
页码:68 / 75+93 +93
页数:9
相关论文
共 50 条
  • [11] Pitch Transformation in Neural Network based Voice Conversion
    Xie, Feng-Long
    Qian, Yao
    Soong, Frank K.
    Li, Haifeng
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 197 - +
  • [12] RBF Neural Network Sliding Mode Control of a PMSG based Wind Energy Conversion System
    Boulouma, Sabri
    Belmili, Hocine
    PROCEEDINGS OF 2016 INTERNATIONAL RENEWABLE & SUSTAINABLE ENERGY CONFERENCE (IRSEC' 16), 2016, : 438 - 443
  • [13] Learning the architectures and parameters of RBF neural network based on MDL
    Liu, Meiqin
    Chen, Jida
    Cai, Zixing
    2000, Shenyang Inst Comput Technol, China (21):
  • [14] Learning the architectures and parameters of RBF neural network based on MDL
    Liu, Meiqin
    Chen, Jida
    Cai, Zixing
    Xiaoxing Weixing Jisuanji Xitong/Mini-Micro Systems, 2000, 21 (04): : 379 - 382
  • [15] Implementation of Artificial Neural Network to Predict Diabetes with High-Quality Health System
    Prakash, E. P.
    Srihari, K.
    Karthik, S.
    Kamal, M., V
    Dileep, P.
    Reddy, Bharath S.
    Mukunthan, M. A.
    Somasundaram, K.
    Jaikumar, R.
    Gayathri, N.
    Sahile, Kibebe
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [16] An intrusion detection system based on RBF neural network
    Yang, ZM
    Wei, XM
    Bi, LY
    Shi, DP
    Li, H
    PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, VOLS 1 AND 2, 2005, : 873 - 875
  • [17] Networked control system based on RBF neural network
    Zhang, Haitao
    Hu, Jinbo
    Bu, Wenshao
    Zhang, H. (zhang_haitao@163.com), 1600, Science and Engineering Research Support Society, 20 Virginia Court, Sandy Bay, Tasmania, Australia (06): : 167 - 178
  • [18] AN IMPROVED ALGORITHM OF GMM VOICE CONVERSION SYSTEM BASED ON CHANGING THE TIME-SCALE
    Zhou Ying Zhang LinghuaCollege of Telecommunications Information EngineeringNanjing University of Posts and Telecommunications Nanjing China
    Journal of Electronics(China), 2011, 28(Z1) (China) : 518 - 523
  • [19] AN IMPROVED ALGORITHM OF GMM VOICE CONVERSION SYSTEM BASED ON CHANGING THE TIME-SCALE
    Zhou Ying Zhang Linghua(College of Telecommunications & Information Engineering
    Journal of Electronics(China), 2011, (Z1) : 518 - 523
  • [20] XiaoiceSing: A High-Quality and Integrated Singing Voice Synthesis System
    Lu, Peiling
    Wu, Jie
    Luan, Jian
    Tan, Xu
    Zhou, Li
    INTERSPEECH 2020, 2020, : 1306 - 1310