A Romanian Language Corpus for a Commercial Text-To-Speech Application

被引：0

作者：

Ordean, Mihai Alexandru ^{[1
]}

Saupe, Andrei ^{[1
]}

Ordean, Mihaela ^{[1
]}

Silaghi, Gheorghe Cosmin ^{[2
]}

Giurgea, Corina ^{[1
]}

机构：

[1] iQuest Technol, Str Motilor 6-8, Cluj Napoca 400001, Romania

[2] Babe Bolyai Univ, Business Informat Syst Dept, Cluj Napoca, Romania

来源：

TEXT, SPEECH AND DIALOGUE, TSD 2012 | 2012年 / 7499卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Text and speech corpora are a prerequisite for the development of an effective commercial text-to-speech system, using the concatenative technology. Given that such a system needs to synthesize both common and domain-specific discourses, the considered corpora are of main importance. This paper presents the authors' experience in creating a corpus for the Romanian language, designed to support a concatenative TTS system, able to reproduce common and domain-specific sentences with naturalness.

引用

页码：405 / 414

页数：10

共 50 条

[1] Efficient Parsing of Romanian Language for Text-to-Speech Purposes
Saupe, Andrei
Teodorescu, Lucian Radu
Ordean, Mihai Alexandru
Boldizsar, Razvan
Ordean, Mihaela
Silaghi, Gheorghe Cosmin
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2009, 5729 : 323 - +
[2] Romanian language statistics and resources for text-to-speech systems
Stan, Adriana
Giurgiu, Mircea
2010 9TH INTERNATIONAL SYMPOSIUM ON ELECTRONICS AND TELECOMMUNICATIONS (ISETC), 2010, : 381 - 384
[3] RECENT ADVANCES IN ROMANIAN LANGUAGE TEXT-TO-SPEECH SYNTHESIS
Burileanu, Dragos
Negrescu, Cristian
Surmei, Mihai
PROCEEDINGS OF THE ROMANIAN ACADEMY SERIES A-MATHEMATICS PHYSICS TECHNICAL SCIENCES INFORMATION SCIENCE, 2010, 11 (01): : 92 - 99
[4] Design of a Yoruba Language Speech Corpus for the Purposes of Text-to-Speech (TTS) Synthesis
Dagba, Theophile K.
Aoga, John O. R.
Fanou, Codjo C.
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2016, PT I, 2016, 9621 : 161 - 169
[5] Part of Speech Tagging for Romanian Text-to-Speech System
Teodorescu, Lucian Radu
Boldizsar, Razvan
Ordean, Mihai
Duma, Melania
Detesan, Laura
Ordean, Mihaela
13TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2011), 2012, : 153 - 159
[6] Text-to-speech for Slovak language
Caky, P
Klimo, M
Mihálik, I
Mladsik, R
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 291 - 298
[7] Burmese Speech Corpus, Finite-State Text Normalization and Pronunciation Grammars with an Application to Text-to-Speech
Oo, Yin May
Wattanavekin, Theeraphol
Li, Chenfang
De Silva, Pasindu
Sarin, Supheakmungkol
Pipatsrisawat, Knot
Jansche, Martin
Kjartansson, Oddur
Gutkin, Alexander
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6328 - 6339
[8] IndicSpeech: Text-to-Speech Corpus for Indian Languages
Srivastava, Nimisha
Mukhopadhyay, Rudrabha
Prajwal, K. R.
Jawahar, C., V
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6417 - 6422
[9] RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis
Zandie, Rohola
Mahoor, Mohammad H.
Madsen, Julia
Emamian, Eshrat S.
INTERSPEECH 2021, 2021, : 2751 - 2755
[10] LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
Zen, Heiga
Dang, Viet
Clark, Rob
Zhang, Yu
Weiss, Ron J.
Jia, Ye
Chen, Zhifeng
Wu, Yonghui
INTERSPEECH 2019, 2019, : 1526 - 1530

← 1 2 3 4 5 →