On the Construction of Unit Databanks for Text-to-Speech Systems

被引:0
|
作者
Latsch, Vagner L. [1 ]
Netto, Sergio L. [1 ]
机构
[1] UFRJ, COPPE, Elect Engn Program, BR-21941972 Rio De Janeiro, Brazil
关键词
Speech signal processing; speech synthesis; text-to-speech;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
This work deals with one stage in the development of a text-to-speech (TTS) system, which demands a great amount of time and effort, and is strongly related to the resulting speech quality: The determination of the speech-unit databank. For that matter, we present a software tool, the so-called Editor, integrating all major steps in the database determination in a single environment. The whole process includes recording, segmentation, and labeling of speech units to be concatenated in the time domain. The Editor includes a low-cost and precise method for determining the pitch marks, utilizing an auxiliary signal obtained from a contact (throat) microphone. For the phonetic speech labeling, we revise an algorithm for acoustic segmentation, which yields interesting results when proper operation conditions are imposed. The result is a simplified procedure for creating a complete unit database, fully integrated into a single and user-friendly system.
引用
收藏
页码:340 / 343
页数:4
相关论文
共 50 条
  • [21] TEXT-TO-SPEECH SYNTHESIS
    SPROAT, RW
    OLIVE, JP
    AT&T TECHNICAL JOURNAL, 1995, 74 (02): : 35 - 44
  • [22] The Art of Text-to-Speech
    Lindquist, Benjamin
    CRITICAL INQUIRY, 2024, 50 (02) : 225 - 251
  • [23] Text-to-speech for customers
    不详
    EXPERT SYSTEMS, 1998, 15 (01) : 66 - 66
  • [24] Globally optimal training of unit boundaries in unit selection text-to-speech synthesis
    Bellegarda, Jerome R.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03): : 957 - 965
  • [25] Software text-to-speech
    Hallahan W.I.
    International Journal of Speech Technology, 1997, 1 (2) : 121 - 134
  • [26] Text processing techniques for text-to-speech conversion systems to enhance the quality of synthesized speech
    ATR Interpreting Telecommunications, Research Lab
    NTT R&D, 10 (1011-1018):
  • [27] Objective evaluation methods for Chinese Text-To-Speech systems
    Zhang, Teng
    Chen, Zhipeng
    Wu, Ji
    Lail, Sam
    Lei, Wenhui
    Isert, Carsten
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 332 - 336
  • [28] Experiments with training corpora for statistical text-to-speech systems
    Podsiadlo, Monika
    Ungureanu, Victor
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2002 - 2006
  • [29] Building Text-to-Speech Systems for Resource Poor Languages
    Samsudin, Nur-Hana
    Lee, Mark
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 3327 - 3334
  • [30] Development of Prototype Text-to-Speech Systems for Northern Sotho
    Oosthuizen, H. J.
    Phihlela, S. T.
    Manamela, M. J. D.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1348 - 1351