On the Construction of Unit Databanks for Text-to-Speech Systems

被引：0

作者：

Latsch, Vagner L. ^{[1
]}

Netto, Sergio L. ^{[1
]}

机构：

[1] UFRJ, COPPE, Elect Engn Program, BR-21941972 Rio De Janeiro, Brazil

来源：

PROCEEDINGS OF THE IEEE INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM, VOLS 1 AND 2 | 2006年

关键词：

Speech signal processing; speech synthesis; text-to-speech;

D O I：

暂无

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

This work deals with one stage in the development of a text-to-speech (TTS) system, which demands a great amount of time and effort, and is strongly related to the resulting speech quality: The determination of the speech-unit databank. For that matter, we present a software tool, the so-called Editor, integrating all major steps in the database determination in a single environment. The whole process includes recording, segmentation, and labeling of speech units to be concatenated in the time domain. The Editor includes a low-cost and precise method for determining the pitch marks, utilizing an auxiliary signal obtained from a contact (throat) microphone. For the phonetic speech labeling, we revise an algorithm for acoustic segmentation, which yields interesting results when proper operation conditions are imposed. The result is a simplified procedure for creating a complete unit database, fully integrated into a single and user-friendly system.

引用

页码：340 / 343

页数：4

共 50 条

[21] TEXT-TO-SPEECH SYNTHESIS
SPROAT, RW
OLIVE, JP
AT&T TECHNICAL JOURNAL, 1995, 74 (02): : 35 - 44
[22] The Art of Text-to-Speech
Lindquist, Benjamin
CRITICAL INQUIRY, 2024, 50 (02) : 225 - 251
[23] Text-to-speech for customers
不详
EXPERT SYSTEMS, 1998, 15 (01) : 66 - 66
[24] Globally optimal training of unit boundaries in unit selection text-to-speech synthesis
Bellegarda, Jerome R.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03): : 957 - 965
[25] Software text-to-speech
Hallahan W.I.
International Journal of Speech Technology, 1997, 1 (2) : 121 - 134
[26] Text processing techniques for text-to-speech conversion systems to enhance the quality of synthesized speech
ATR Interpreting Telecommunications, Research Lab
NTT R&D, 10 (1011-1018):
[27] Objective evaluation methods for Chinese Text-To-Speech systems
Zhang, Teng
Chen, Zhipeng
Wu, Ji
Lail, Sam
Lei, Wenhui
Isert, Carsten
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 332 - 336
[28] Experiments with training corpora for statistical text-to-speech systems
Podsiadlo, Monika
Ungureanu, Victor
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2002 - 2006
[29] Building Text-to-Speech Systems for Resource Poor Languages
Samsudin, Nur-Hana
Lee, Mark
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 3327 - 3334
[30] Development of Prototype Text-to-Speech Systems for Northern Sotho
Oosthuizen, H. J.
Phihlela, S. T.
Manamela, M. J. D.
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1348 - 1351

← 1 2 3 4 5 →