Govorec (Speaker) - Slovenian text-to-speech synthesizer for various applications

被引：0

作者：

Sef, T ^{[1
]}

Gams, M ^{[1
]}

机构：

[1] Jozef Stefan Inst, Dept Intelligent Syst, SI-1000 Ljubljana, Slovenia

来源：

6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING I | 2002年

关键词：

text-to-speech system; natural language processing; intelligent systems; telecommunication applications; voice portals;

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

This paper presents a new text-to-speech (TTS) system called Speaker (Govorec) that is capable of automatic conversion of any Slovenian text into speech. The different phases of the synthesis task are performed by several sequentially operating independent modules (text analysis, prosody generation and segmental concatenation), which are pipelined together. With enhancements to the first module the weakest point of previous synthesizer has been eliminated, that is the correct lexical stress assignment of words. Higher naturalness and agitation of synthetic speech is achieved mainly with different transformations between labelled speech corpus and concrete text, which is synthesised. The system is used by members of the Slovenian Foundation for the Blind and Visually impaired and was awarded with tile first price for innovation in the field of life improvements for handicapped people. Currently, several leading Slovenian telecommunication companies are testing the system for providing information (e-mail, SMS, weather reports, traffic information) through mobile phones.

引用

页码：270 / 275

页数：6

共 50 条

[21] Computerized speech simulation: Subjective evaluation of an Italian text-to-speech synthesizer
Roccetti, M
Salomoni, P
Collinelli, I
SIMULATION IN INDUSTRY 2001, 2001, : 364 - 368
[22] SPEAKER INTONATION ADAPTATION FOR TRANSFORMING TEXT-TO-SPEECH SYNTHESIS SPEAKER IDENTITY
Langarani, Mahsa Sadat Elyasi
van Santen, Jan
2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 116 - 123
[23] Multi-Speaker Text-to-Speech Training With Speaker Anonymized Data
Huang, Wen-Chin
Wu, Yi-Chiao
Toda, Tomoki
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2995 - 2999
[24] Frequency Warping for Speaker Adaption of Text-to-speech Synthesis
Gao, Weixun
Cao, Qiying
ICWMMN 2010, PROCEEDINGS, 2010, : 307 - +
[25] Towards pooled-speaker concatenative text-to-speech
Eide, Ellen M.
Picheny, Michael A.
2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 73 - 76
[26] TIME-DOMAIN PROSODIC MODIFICATIONS FOR TEXT-TO-SPEECH SYNTHESIZER
Lopatka, Kuba
Suchomski, Piotr
Czyzewski, Andrzej
SPA 2010: SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS CONFERENCE PROCEEDINGS, 2010, : 73 - 77
[27] NEURAL NETWORK SYNTHESIZER OF PAUSE DURATION FOR MANDARINE TEXT-TO-SPEECH
HWANG, SH
CHEN, SH
ELECTRONICS LETTERS, 1992, 28 (08) : 720 - 721
[28] BUILD THE MICROVOX TEXT-TO-SPEECH SYNTHESIZER .1. HARDWARE
CIARCIA, S
BYTE, 1982, 7 (09): : 64 - &
[29] ILATalk: a new multilingual text-to-speech synthesizer with machine learning
Abu-Soud, Saleh M.
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (01) : 55 - 64
[30] Bangla text normalization for text-to-speech synthesizer using machine learning algorithms
Islam, Md. Rezaul
Ahmad, Arif
Rahman, Mohammad Shahidur
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (01)

← 1 2 3 4 5 →