High-quality voice conversion system based on GMM statistical parameters and RBF neural network

被引:0
|
作者
CHEN Xian-tong
ZHANG Ling-hua
机构
[1] CollegeofTelecommunicationsandInformationEngineering,NanjingUniversityofPostsandTelecommunications
关键词
D O I
暂无
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
A voice conversion(VC) system was designed based on Gaussian mixture model(GMM) and radial basis function(RBF) neural network. As a voice conversion model, RBF network needs quantities of training data to improve its performance. For one speech, the networks trained by different segments of data have different transformation effects. Since trying segment by segment to obtain the best conversion effect is complex, a conversion method was proposed, that uses GMM for statistics before training RBF network to aim at the problem. The speech transformation and representation using adaptive interpolation of weighted spectrum(STRAIGHT) model is used for accurate extraction of vocal tract spectrum. Then GMM is used to classify the numerous spectral parameters. The obtained mean parameters were trained in RBF network. Experiment reveals that, the soft classification ability of GMM can promptly realize the reduction and classification of training data under the premise of ensuring the training effect. The selection complexity is decreased thereafter. Compared to the conventional RBF network training methods, this method can make the transformation of spectral parameters more effective and improve the quality of converted speech.
引用
收藏
页码:68 / 75+93 +93
页数:9
相关论文
共 50 条
  • [31] A Probabilistic Interpretation for Artificial Neural Network-based Voice Conversion
    Hwang, Hsin-Te
    Tsao, Yu
    Wang, Hsin-Min
    Wang, Yih-Ru
    Chen, Sin-Horng
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 552 - 558
  • [32] Continuous vocoder applied in deep neural network based voice conversion
    Mohammed Salah Al-Radhi
    Tamás Gábor Csapó
    Géza Németh
    Multimedia Tools and Applications, 2019, 78 : 33549 - 33572
  • [33] A system approach to high-quality picture-rate conversion
    Bartels, Chris
    Cordes, Claus Nico
    Riemens, Bram
    de Haan, Gerard
    JOURNAL OF THE SOCIETY FOR INFORMATION DISPLAY, 2010, 18 (11) : 922 - 930
  • [34] Risk assessment of power system network security based on RBF neural network
    Yu Y.
    Di C.
    Guo X.
    International Journal of Power and Energy Conversion, 2023, 14 (2-3) : 148 - 158
  • [35] Neural network-based voice quality measurement technique
    Tarraf, A
    Meyers, M
    IEEE INTERNATIONAL SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, PROCEEDINGS, 1999, : 375 - 381
  • [36] Mandarin-Tibetan Cross-Lingual Voice Conversion System Based on Deep Neural Network
    Gan, Zhenye
    Xing, Xiaotian
    Yang, Hongwu
    Zhao, Guangying
    PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 67 - 71
  • [37] A Statistical Sample-Based Approach to GMM-Based Voice Conversion Using Tied-Covariance Acoustic Models
    Takamichi, Shinnosuke
    Toda, Tomoki
    Neubig, Graham
    Sakti, Sakriani
    Nakamura, Satoshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (10): : 2490 - 2498
  • [38] High Quality Voice Conversion based on ISODATA Clustering Algorithm
    Li, Yanping
    Zuo, Yutao
    Yang, Zhen
    Shao, Xi
    2017 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (IEEE ISKE), 2017,
  • [39] The USTC System for Voice Conversion Challenge 2016: Neural Network Based Approaches for Spectrum, Aperiodicity and F0 Conversion
    Chen, Ling-Hui
    Liu, Li-Juan
    Ling, Zhen-Hua
    Jiang, Yuan
    Dai, Li-Rong
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1642 - 1646
  • [40] Prediction of the high-quality development level of inbound tourism based on adaptive neural network technology
    Zhang, Hongxi
    Wei, Wei
    Liu, Qiong
    JOURNAL OF CONTROL AND DECISION, 2023, 10 (01) : 112 - 125