Govorec (Speaker) - Slovenian text-to-speech synthesizer for various applications

被引:0
|
作者
Sef, T [1 ]
Gams, M [1 ]
机构
[1] Jozef Stefan Inst, Dept Intelligent Syst, SI-1000 Ljubljana, Slovenia
关键词
text-to-speech system; natural language processing; intelligent systems; telecommunication applications; voice portals;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper presents a new text-to-speech (TTS) system called Speaker (Govorec) that is capable of automatic conversion of any Slovenian text into speech. The different phases of the synthesis task are performed by several sequentially operating independent modules (text analysis, prosody generation and segmental concatenation), which are pipelined together. With enhancements to the first module the weakest point of previous synthesizer has been eliminated, that is the correct lexical stress assignment of words. Higher naturalness and agitation of synthetic speech is achieved mainly with different transformations between labelled speech corpus and concrete text, which is synthesised. The system is used by members of the Slovenian Foundation for the Blind and Visually impaired and was awarded with tile first price for innovation in the field of life improvements for handicapped people. Currently, several leading Slovenian telecommunication companies are testing the system for providing information (e-mail, SMS, weather reports, traffic information) through mobile phones.
引用
收藏
页码:270 / 275
页数:6
相关论文
共 50 条
  • [41] An RNN-based prosodic information synthesizer for Mandarin text-to-speech
    Chen, SH
    Hwang, SH
    Wang, YR
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (03): : 226 - 239
  • [42] Using text-to-speech processors in embedded applications
    Ibrahim, Dogan
    ELECTRONICS WORLD, 2017, 123 (1975): : 14 - 16
  • [43] Using text-to-speech processors in embedded applications
    DR Ibrahim, Dogan, 1600, Nexus Media Communications Ltd. (123):
  • [44] The Laureate text-to-speech system - Architecture and applications
    Page, JH
    Breen, AP
    BT TECHNOLOGY JOURNAL, 1996, 14 (01): : 57 - 67
  • [45] TEXT-TO-SPEECH APPLICATIONS TO DEVELOP EDUCATIONAL MATERIALS
    Sanchis, Raquel
    Andres, Beatriz
    Poler, Raul
    12TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE (INTED), 2018, : 6085 - 6093
  • [46] Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora
    Luong, Hieu-Thi
    Wang, Xin
    Yamagishi, Junichi
    Nishizawa, Nobuyuki
    INTERSPEECH 2019, 2019, : 1303 - 1307
  • [47] Synthesizing various speaking styles in a text-to-speech system
    Abe, Masanobu
    NTT R and D, 1996, 45 (10): : 1019 - 1025
  • [48] SYNTHE-SEES: FACE BASED TEXT-TO-SPEECH FOR VIRTUAL SPEAKER
    Park, Jae Hyun
    Maeng, Joon-Gyu
    Bak, TaeJun
    Jo, Young-Sun
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 10321 - 10325
  • [49] Investigation of Input Alphabets of End-to-End Lithuanian Text-to-Speech Synthesizer
    Kasparaitis, Pijus
    Antanavicius, Danielius
    BALTIC JOURNAL OF MODERN COMPUTING, 2023, 11 (02): : 285 - 296
  • [50] Adjusting Pleasure-Arousal-Dominance for Continuous Emotional Text-to-speech Synthesizer
    Rabiee, Azam
    Kim, Tae-Ho
    Lee, Soo-Young
    INTERSPEECH 2019, 2019, : 3693 - 3694