Exemplar-based speech waveform generation

被引:0
|
作者
Watts, Oliver [1 ]
Valentini-Botinhao, Cassia [1 ]
Espic, Felipe [1 ]
King, Simon [1 ]
机构
[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh, Midlothian, Scotland
基金
英国工程与自然科学研究理事会;
关键词
speech synthesis; vocoder; unit selection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a simple but effective method for generating speech waveforms by selecting small units of stored speech to match a low-dimensional target representation. The method is designed as a drop-in replacement for the vocoder in a deep neural network-based text-to-speech system. Most previous work on hybrid unit selection waveform generation relies on phonetic annotation for determining unit boundaries, or for specifying target cost, or for candidate preselection. In contrast, our waveform generator requires no phonetic information, annotation, or alignment. Unit boundaries are determined by epochs, and spectral analysis provides representations which are compared directly with target features at runtime. As in unit selection, we minimise a combination of target cost and join cost, but find that greedy left-to-right nearest-neighbour search gives similar results to dynamic programming. The method is fast and can generate the waveform incrementally. We use publicly available data and provide a permissively-licensed open source toolkit for reproducing our results.
引用
收藏
页码:2022 / 2026
页数:5
相关论文
共 50 条
  • [41] Geometrically Guided Exemplar-Based Inpainting
    Cao, Frederic
    Gousseau, Yann
    Masnou, Simon
    Perez, Patrick
    SIAM JOURNAL ON IMAGING SCIENCES, 2011, 4 (04): : 1143 - 1179
  • [42] PROTOS - AN EXEMPLAR-BASED LEARNING APPRENTICE
    BAREISS, ER
    PORTER, BW
    WIER, CC
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1988, 29 (05): : 549 - 561
  • [43] Exemplar-based approaches to the syllable joint
    Proell, Simon
    Freienstein, Jan-Claas
    Ernst, Oliver
    ZEITSCHRIFT FUR GERMANISTISCHE LINGUISTIK, 2016, 44 (02): : 149 - 171
  • [44] A survey of exemplar-based texture synthesis
    Raad, Lara
    Davy, Axel
    Desolneux, Agnes
    Morel, Jean-Michel
    ANNALS OF MATHEMATICAL SCIENCES AND APPLICATIONS, 2018, 3 (01) : 89 - 148
  • [45] EXEMPLAR-BASED MODEL OF SOCIAL JUDGMENT
    SMITH, ER
    ZARATE, MA
    PSYCHOLOGICAL REVIEW, 1992, 99 (01) : 3 - 21
  • [46] Recovery guarantees for exemplar-based clustering
    Nellore, Abhinav
    Ward, Rachel
    INFORMATION AND COMPUTATION, 2015, 245 : 165 - 180
  • [47] Optimising Data for Exemplar-Based Inpainting
    Karos, Lena
    Bheed, Pinak
    Peter, Pascal
    Weickert, Joachim
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2018, 2018, 11182 : 547 - 558
  • [48] Estimating uncertainty to improve exemplar-based feature enhancement for noise robust speech recognition
    1600, Institute of Electrical and Electronics Engineers Inc., United States (22):
  • [49] EXEMPLAR-BASED LARGE VOCABULARY SPEECH RECOGNITION USING K-NEAREST NEIGHBORS
    Xu, Yanbo
    Siohan, Olivier
    Simcha, David
    Kumar, Sanjiv
    Liao, Hank
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5167 - 5171
  • [50] Estimating Uncertainty to Improve Exemplar-Based Feature Enhancement for Noise Robust Speech Recognition
    Kallasjoki, Heikki
    Gemmeke, Jort F.
    Palomaki, Kalle J.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 368 - 380