Govorec (Speaker) - Slovenian text-to-speech synthesizer for various applications

被引:0
|
作者
Sef, T [1 ]
Gams, M [1 ]
机构
[1] Jozef Stefan Inst, Dept Intelligent Syst, SI-1000 Ljubljana, Slovenia
关键词
text-to-speech system; natural language processing; intelligent systems; telecommunication applications; voice portals;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper presents a new text-to-speech (TTS) system called Speaker (Govorec) that is capable of automatic conversion of any Slovenian text into speech. The different phases of the synthesis task are performed by several sequentially operating independent modules (text analysis, prosody generation and segmental concatenation), which are pipelined together. With enhancements to the first module the weakest point of previous synthesizer has been eliminated, that is the correct lexical stress assignment of words. Higher naturalness and agitation of synthetic speech is achieved mainly with different transformations between labelled speech corpus and concrete text, which is synthesised. The system is used by members of the Slovenian Foundation for the Blind and Visually impaired and was awarded with tile first price for innovation in the field of life improvements for handicapped people. Currently, several leading Slovenian telecommunication companies are testing the system for providing information (e-mail, SMS, weather reports, traffic information) through mobile phones.
引用
收藏
页码:270 / 275
页数:6
相关论文
共 50 条
  • [21] Computerized speech simulation: Subjective evaluation of an Italian text-to-speech synthesizer
    Roccetti, M
    Salomoni, P
    Collinelli, I
    SIMULATION IN INDUSTRY 2001, 2001, : 364 - 368
  • [22] SPEAKER INTONATION ADAPTATION FOR TRANSFORMING TEXT-TO-SPEECH SYNTHESIS SPEAKER IDENTITY
    Langarani, Mahsa Sadat Elyasi
    van Santen, Jan
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 116 - 123
  • [23] Multi-Speaker Text-to-Speech Training With Speaker Anonymized Data
    Huang, Wen-Chin
    Wu, Yi-Chiao
    Toda, Tomoki
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2995 - 2999
  • [24] Frequency Warping for Speaker Adaption of Text-to-speech Synthesis
    Gao, Weixun
    Cao, Qiying
    ICWMMN 2010, PROCEEDINGS, 2010, : 307 - +
  • [25] Towards pooled-speaker concatenative text-to-speech
    Eide, Ellen M.
    Picheny, Michael A.
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 73 - 76
  • [26] TIME-DOMAIN PROSODIC MODIFICATIONS FOR TEXT-TO-SPEECH SYNTHESIZER
    Lopatka, Kuba
    Suchomski, Piotr
    Czyzewski, Andrzej
    SPA 2010: SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS CONFERENCE PROCEEDINGS, 2010, : 73 - 77
  • [27] NEURAL NETWORK SYNTHESIZER OF PAUSE DURATION FOR MANDARINE TEXT-TO-SPEECH
    HWANG, SH
    CHEN, SH
    ELECTRONICS LETTERS, 1992, 28 (08) : 720 - 721
  • [28] BUILD THE MICROVOX TEXT-TO-SPEECH SYNTHESIZER .1. HARDWARE
    CIARCIA, S
    BYTE, 1982, 7 (09): : 64 - &
  • [29] ILATalk: a new multilingual text-to-speech synthesizer with machine learning
    Abu-Soud, Saleh M.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (01) : 55 - 64
  • [30] Bangla text normalization for text-to-speech synthesizer using machine learning algorithms
    Islam, Md. Rezaul
    Ahmad, Arif
    Rahman, Mohammad Shahidur
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (01)