Current State of Text-to-Speech System ARTIC: A Decade of Research on the Field of Speech Technologies

被引:27
|
作者
Tihelka, Daniel [1 ]
Hanzlicek, Zdenek [1 ]
Juzova, Marketa [2 ]
Vit, Jakub [2 ]
Matousek, Jindrich [1 ,2 ]
Gruber, Martin [1 ]
机构
[1] Univ West Bohemia, Fac Appl Sci, New Technol Informat Soc, Plzen, Czech Republic
[2] Univ West Bohemia, Fac Appl Sci, Dept Cybernet, Plzen, Czech Republic
来源
关键词
Speech synthesis; Unit selection; Statistical-parametric synthesis; DNN; WaveNet; Hybrid synthesis; Personalized speech synthesis; Voice banking; VOICE CONSERVATION; ANNOTATION;
D O I
10.1007/978-3-030-00794-2_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper provides a survey of the current state of ARTIC - the modern Czech concatenative corpus-based text-to-speech system. Through more than a decade of research & development in the field of speech technologies and applications, the system was enriched with new languages (and, as a consequence, language-dependent NLP methods), and its speech generation capabilities were significantly improved when new progressive speech generation modules (SPS, DNN, HSS) were (and are still being to) designed and incorporated into it. Also, ARTIC has to deal with various requirements on data used to generate speech from, ranging in size, quality and domain of the output speech, while there always was the requirement to achieve the highest quality in terms of both naturalness and intelligibility. Thus, the paper summarizes some of the most significant achievements and demanding tasks which had to be tackled by the system, illustrating the universality and flexibility of this Czech TTS system.
引用
收藏
页码:369 / 378
页数:10
相关论文
共 50 条
  • [1] Current state of Czech text-to-speech system ARTIC
    Matousek, Jindrich
    Tihelka, Daniel
    Romportl, Jan
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 439 - 446
  • [2] Slovak text-to-speech synthesis in ARTIC system
    Matousek, J
    Tihelka, D
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 155 - 162
  • [3] Measuring a decade of progress in Text-to-Speech
    King, Simon
    LOQUENS, 2014, 1 (01):
  • [4] Slovenian text-to-speech system
    Sef, T
    ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL V: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 41 - 44
  • [5] A Hakka text-to-speech system
    Yu, Hsiu-Min
    Hwang, Hsin-Te
    Lin, Dong-Yi
    Chen, Sin-Horng
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 241 - +
  • [6] A TEXT-TO-SPEECH CONVERSION SYSTEM
    KLATT, DH
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1982, 184 (SEP): : 11 - CINF
  • [7] Text-to-speech system for Danish
    1600, Publ by Elsevier Science Publishers B.V., Amsterdam, Neth
  • [8] A Mandarin text-to-speech system
    Hwang, SH
    Chen, SH
    Wang, YR
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1421 - 1424
  • [9] State of the Art Review on Thai Text-to-Speech System
    Yimngam, Sukanya
    Premchaisawadi, Wichian
    Kreesuradej, Worapoj
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, 2008, : 194 - +
  • [10] Part of Speech Tagging for Romanian Text-to-Speech System
    Teodorescu, Lucian Radu
    Boldizsar, Razvan
    Ordean, Mihai
    Duma, Melania
    Detesan, Laura
    Ordean, Mihaela
    13TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2011), 2012, : 153 - 159