Govorec (Speaker) - Slovenian text-to-speech synthesizer for various applications

被引：0

作者：

Sef, T ^{[1
]}

Gams, M ^{[1
]}

机构：

[1] Jozef Stefan Inst, Dept Intelligent Syst, SI-1000 Ljubljana, Slovenia

来源：

6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING I | 2002年

关键词：

text-to-speech system; natural language processing; intelligent systems; telecommunication applications; voice portals;

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

This paper presents a new text-to-speech (TTS) system called Speaker (Govorec) that is capable of automatic conversion of any Slovenian text into speech. The different phases of the synthesis task are performed by several sequentially operating independent modules (text analysis, prosody generation and segmental concatenation), which are pipelined together. With enhancements to the first module the weakest point of previous synthesizer has been eliminated, that is the correct lexical stress assignment of words. Higher naturalness and agitation of synthetic speech is achieved mainly with different transformations between labelled speech corpus and concrete text, which is synthesised. The system is used by members of the Slovenian Foundation for the Blind and Visually impaired and was awarded with tile first price for innovation in the field of life improvements for handicapped people. Currently, several leading Slovenian telecommunication companies are testing the system for providing information (e-mail, SMS, weather reports, traffic information) through mobile phones.

引用

页码：270 / 275

页数：6

共 50 条

[41] An RNN-based prosodic information synthesizer for Mandarin text-to-speech
Chen, SH
Hwang, SH
Wang, YR
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (03): : 226 - 239
[42] Using text-to-speech processors in embedded applications
Ibrahim, Dogan
ELECTRONICS WORLD, 2017, 123 (1975): : 14 - 16
[43] Using text-to-speech processors in embedded applications
DR Ibrahim, Dogan, 1600, Nexus Media Communications Ltd. (123):
[44] The Laureate text-to-speech system - Architecture and applications
Page, JH
Breen, AP
BT TECHNOLOGY JOURNAL, 1996, 14 (01): : 57 - 67
[45] TEXT-TO-SPEECH APPLICATIONS TO DEVELOP EDUCATIONAL MATERIALS
Sanchis, Raquel
Andres, Beatriz
Poler, Raul
12TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE (INTED), 2018, : 6085 - 6093
[46] Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora
Luong, Hieu-Thi
Wang, Xin
Yamagishi, Junichi
Nishizawa, Nobuyuki
INTERSPEECH 2019, 2019, : 1303 - 1307
[47] Synthesizing various speaking styles in a text-to-speech system
Abe, Masanobu
NTT R and D, 1996, 45 (10): : 1019 - 1025
[48] SYNTHE-SEES: FACE BASED TEXT-TO-SPEECH FOR VIRTUAL SPEAKER
Park, Jae Hyun
Maeng, Joon-Gyu
Bak, TaeJun
Jo, Young-Sun
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 10321 - 10325
[49] Investigation of Input Alphabets of End-to-End Lithuanian Text-to-Speech Synthesizer
Kasparaitis, Pijus
Antanavicius, Danielius
BALTIC JOURNAL OF MODERN COMPUTING, 2023, 11 (02): : 285 - 296
[50] Adjusting Pleasure-Arousal-Dominance for Continuous Emotional Text-to-speech Synthesizer
Rabiee, Azam
Kim, Tae-Ho
Lee, Soo-Young
INTERSPEECH 2019, 2019, : 3693 - 3694

← 1 2 3 4 5 →