A Romanian Language Corpus for a Commercial Text-To-Speech Application

被引:0
|
作者
Ordean, Mihai Alexandru [1 ]
Saupe, Andrei [1 ]
Ordean, Mihaela [1 ]
Silaghi, Gheorghe Cosmin [2 ]
Giurgea, Corina [1 ]
机构
[1] iQuest Technol, Str Motilor 6-8, Cluj Napoca 400001, Romania
[2] Babe Bolyai Univ, Business Informat Syst Dept, Cluj Napoca, Romania
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text and speech corpora are a prerequisite for the development of an effective commercial text-to-speech system, using the concatenative technology. Given that such a system needs to synthesize both common and domain-specific discourses, the considered corpora are of main importance. This paper presents the authors' experience in creating a corpus for the Romanian language, designed to support a concatenative TTS system, able to reproduce common and domain-specific sentences with naturalness.
引用
收藏
页码:405 / 414
页数:10
相关论文
共 50 条
  • [1] Efficient Parsing of Romanian Language for Text-to-Speech Purposes
    Saupe, Andrei
    Teodorescu, Lucian Radu
    Ordean, Mihai Alexandru
    Boldizsar, Razvan
    Ordean, Mihaela
    Silaghi, Gheorghe Cosmin
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2009, 5729 : 323 - +
  • [2] Romanian language statistics and resources for text-to-speech systems
    Stan, Adriana
    Giurgiu, Mircea
    2010 9TH INTERNATIONAL SYMPOSIUM ON ELECTRONICS AND TELECOMMUNICATIONS (ISETC), 2010, : 381 - 384
  • [3] RECENT ADVANCES IN ROMANIAN LANGUAGE TEXT-TO-SPEECH SYNTHESIS
    Burileanu, Dragos
    Negrescu, Cristian
    Surmei, Mihai
    PROCEEDINGS OF THE ROMANIAN ACADEMY SERIES A-MATHEMATICS PHYSICS TECHNICAL SCIENCES INFORMATION SCIENCE, 2010, 11 (01): : 92 - 99
  • [4] Design of a Yoruba Language Speech Corpus for the Purposes of Text-to-Speech (TTS) Synthesis
    Dagba, Theophile K.
    Aoga, John O. R.
    Fanou, Codjo C.
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2016, PT I, 2016, 9621 : 161 - 169
  • [5] Part of Speech Tagging for Romanian Text-to-Speech System
    Teodorescu, Lucian Radu
    Boldizsar, Razvan
    Ordean, Mihai
    Duma, Melania
    Detesan, Laura
    Ordean, Mihaela
    13TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2011), 2012, : 153 - 159
  • [6] Text-to-speech for Slovak language
    Caky, P
    Klimo, M
    Mihálik, I
    Mladsik, R
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 291 - 298
  • [7] Burmese Speech Corpus, Finite-State Text Normalization and Pronunciation Grammars with an Application to Text-to-Speech
    Oo, Yin May
    Wattanavekin, Theeraphol
    Li, Chenfang
    De Silva, Pasindu
    Sarin, Supheakmungkol
    Pipatsrisawat, Knot
    Jansche, Martin
    Kjartansson, Oddur
    Gutkin, Alexander
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6328 - 6339
  • [8] IndicSpeech: Text-to-Speech Corpus for Indian Languages
    Srivastava, Nimisha
    Mukhopadhyay, Rudrabha
    Prajwal, K. R.
    Jawahar, C., V
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6417 - 6422
  • [9] RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis
    Zandie, Rohola
    Mahoor, Mohammad H.
    Madsen, Julia
    Emamian, Eshrat S.
    INTERSPEECH 2021, 2021, : 2751 - 2755
  • [10] LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
    Zen, Heiga
    Dang, Viet
    Clark, Rob
    Zhang, Yu
    Weiss, Ron J.
    Jia, Ye
    Chen, Zhifeng
    Wu, Yonghui
    INTERSPEECH 2019, 2019, : 1526 - 1530