A ANN BASED HIGH QUALITY METHOD FOR VOICE CONVERSION

被引:0
|
作者
Chen, Z. [1 ]
Zhang, L. H. [1 ]
机构
[1] Nanjing Univ Post & Telecommun, Coll Telecommun & Informat Engn, Nanjing, Jiangsu, Peoples R China
关键词
voice conversion; ANN; GMM; pitch conversion; TRANSFORMATION;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we describe a novel conversion method for voice conversion (VC). Artificial Neural Network (ANN) model is employed for performing joint spectrum and pitch conversion between speakers. The conventional method converts spectral parameters and pitch independently. Those separate transformations lead to an unsatisfactory speech quality. The main reason maybe that F-0 sequences are usually converted by a simply linear function. To overcome this problem, we apply joint parameters for train and conversion. A comparative study of voice conversion with ANN and Gaussian Mixture Model (GMM) is conducted. Experimental results indicate that the performance of VC can be dramatically improved by the proposed method in view of both subjective evaluation and objective measurement.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] An Improved ANN Method Based on Clustering Optimization for Voice Conversion
    Chen Xiantong
    Zhang Linghua
    2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 464 - 469
  • [2] High Quality Voice Conversion based on ISODATA Clustering Algorithm
    Li, Yanping
    Zuo, Yutao
    Yang, Zhen
    Shao, Xi
    2017 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (IEEE ISKE), 2017,
  • [3] Design and Implementation of Voice Conversion System Based on GMM and ANN
    Yang, Man
    Que, Dashun
    Li, Bei
    MULTIMEDIA AND SIGNAL PROCESSING, 2012, 346 : 624 - 631
  • [4] IMPROVING VOICE QUALITY OF HMM-BASED SPEECH SYNTHESIS USING VOICE CONVERSION METHOD
    Jiao, Yishan
    Xie, Xiang
    Na, Xingyu
    Tu, Ming
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [5] Comparing ANN and GMM in a voice conversion framework
    Laskar, R. H.
    Chakrabarty, D.
    Talukdar, F. A.
    Rao, K. Sreenivasa
    Banerjee, K.
    APPLIED SOFT COMPUTING, 2012, 12 (11) : 3332 - 3342
  • [6] Runtime and Speech Quality Survey of a Voice Conversion Method
    Jokisch, Oliver
    Birhanu, Yitagessu
    Hoffmann, Ruediger
    2013 IEEE EUROCON, 2013, : 1684 - 1688
  • [7] Modeling glottal source for high quality voice conversion
    Sun, Jun
    Dai, Beiqian
    Zhang, Jian
    Xie, Yanlu
    WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 319 - 319
  • [8] A method for voice conversion based on viterbi algorithm
    Jian, Zhi-Hua
    Yang, Zhen
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2009, 37 (07): : 1470 - 1475
  • [9] Modified method for voice conversion based on GMM
    Shen, Yi
    Jian, Zhi-Hua
    Yang, Zhen
    Nanjing Youdian Daxue Xuebao (Ziran Kexue Ban)/Journal of Nanjing University of Posts and Telecommunications (Natural Science), 2007, 27 (05): : 11 - 15
  • [10] High-quality Voice Conversion Using Spectrogram-Based WaveNet Vocoder
    Chen, Kuan
    Chen, Bo
    Lai, Jiahao
    Yu, Kai
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1993 - 1997