Current State of Text-to-Speech System ARTIC: A Decade of Research on the Field of Speech Technologies

被引：27

作者：

Tihelka, Daniel ^{[1
]}

Hanzlicek, Zdenek ^{[1
]}

Juzova, Marketa ^{[2
]}

Vit, Jakub ^{[2
]}

Matousek, Jindrich ^{[1
,2
]}

Gruber, Martin ^{[1
]}

机构：

[1] Univ West Bohemia, Fac Appl Sci, New Technol Informat Soc, Plzen, Czech Republic

[2] Univ West Bohemia, Fac Appl Sci, Dept Cybernet, Plzen, Czech Republic

来源：

TEXT, SPEECH, AND DIALOGUE (TSD 2018) | 2018年 / 11107卷

关键词：

Speech synthesis; Unit selection; Statistical-parametric synthesis; DNN; WaveNet; Hybrid synthesis; Personalized speech synthesis; Voice banking; VOICE CONSERVATION; ANNOTATION;

D O I：

10.1007/978-3-030-00794-2_40

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper provides a survey of the current state of ARTIC - the modern Czech concatenative corpus-based text-to-speech system. Through more than a decade of research & development in the field of speech technologies and applications, the system was enriched with new languages (and, as a consequence, language-dependent NLP methods), and its speech generation capabilities were significantly improved when new progressive speech generation modules (SPS, DNN, HSS) were (and are still being to) designed and incorporated into it. Also, ARTIC has to deal with various requirements on data used to generate speech from, ranging in size, quality and domain of the output speech, while there always was the requirement to achieve the highest quality in terms of both naturalness and intelligibility. Thus, the paper summarizes some of the most significant achievements and demanding tasks which had to be tackled by the system, illustrating the universality and flexibility of this Czech TTS system.

引用

页码：369 / 378

页数：10

共 50 条

[1] Current state of Czech text-to-speech system ARTIC
Matousek, Jindrich
Tihelka, Daniel
Romportl, Jan
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 439 - 446
[2] Slovak text-to-speech synthesis in ARTIC system
Matousek, J
Tihelka, D
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 155 - 162
[3] Measuring a decade of progress in Text-to-Speech
King, Simon
LOQUENS, 2014, 1 (01):
[4] Slovenian text-to-speech system
Sef, T
ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL V: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 41 - 44
[5] A Hakka text-to-speech system
Yu, Hsiu-Min
Hwang, Hsin-Te
Lin, Dong-Yi
Chen, Sin-Horng
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 241 - +
[6] A TEXT-TO-SPEECH CONVERSION SYSTEM
KLATT, DH
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1982, 184 (SEP): : 11 - CINF
[7] Text-to-speech system for Danish
1600, Publ by Elsevier Science Publishers B.V., Amsterdam, Neth
[8] A Mandarin text-to-speech system
Hwang, SH
Chen, SH
Wang, YR
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1421 - 1424
[9] State of the Art Review on Thai Text-to-Speech System
Yimngam, Sukanya
Premchaisawadi, Wichian
Kreesuradej, Worapoj
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, 2008, : 194 - +
[10] Part of Speech Tagging for Romanian Text-to-Speech System
Teodorescu, Lucian Radu
Boldizsar, Razvan
Ordean, Mihai
Duma, Melania
Detesan, Laura
Ordean, Mihaela
13TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2011), 2012, : 153 - 159

← 1 2 3 4 5 →