Myanmar text-to-speech system with rule-based tone synthesis

被引：6

作者：

Win, Kyawt Yin ^{[1
]}

Takara, Tomio ^{[1
]}

机构：

[1] Univ Ryukyus, Dept Informat Engn, 1 Senbaru, Nishihara, Okinawa 9030213, Japan

来源：

ACOUSTICAL SCIENCE AND TECHNOLOGY | 2011年 / 32卷 / 05期

关键词：

Myanmar; Tonal languages; Text-to-speech; Tone synthesis; Normalization; Rule-based;

D O I：

10.1250/ast.32.174

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We have introduced a novel Myanmar text to speech (MyanmarTTS) system with rule-based tone synthesis. Myanmar is a tonal language that possesses unique characteristics compared with other tonal languages such as Chinese, Vietnamese and Thai. Such languages have complicated fundamental-frequency (F-0) patterns of tones, and F-0 is of foremost importance. Myanmar tones are unique in their simplistic pattern related not only to F-0 but also, more specifically to duration. Myanmar tones have different durations between short-tone and long-tone groups. In accordance, we defined a tone rule employing two parameters F-0 at the center of the syllable and the syllable's duration. The rule is implemented with a linear F-0 pattern. Large variability exists in the F-0 and duration uttered by different speakers of different syllables. Hence, for tone synthesis, normalization of the F-0 and duration is important and necessary to discriminate tones. We proposed a normalization method and the effectiveness of this method was confirmed in the distribution of the F-0 and duration. The intelligibility of the synthesized tone was confirmed through listening tests with correct rates of 95.6% for male and 97.8% for female speech. As a result, we showed that the linear pattern is sufficient for Myanmar tone synthesis.

引用

页码：174 / 181

页数：8

共 50 条

[31] TEXT-TO-SPEECH SYNTHESIS: A PROTOTYPE SYSTEM FOR CROATIAN LANGUAGE
Pobar, Miran
Martincic-Ipsic, Sanda
Ipsic, Ivo
ENGINEERING REVIEW, 2008, 28 (02) : 31 - 44
[32] HMM Based Myanmar Text to Speech System
Thu, Ye Kyaw
Pa, Win Pa
Ni, Jinfu
Shiga, Yoshinori
Finch, Andrew
Hori, Chiori
Kawai, Hisashi
Sumita, Eiichiro
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2237 - 2241
[33] Evaluation of Prosody in Text-to-Speech Synthesis System of Bangla
Basu, Tulika
Saha, Arup
2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
[34] Semantic dependency and local convolution for enhancing naturalness and tone in text-to-speech synthesis
Jiang, Chenglong
Gao, Ying
Ng, Wing W. Y.
Zhou, Jiyong
Zhong, Jinghui
Zhen, Hongzhong
Hu, Xiping
NEUROCOMPUTING, 2024, 608
[35] Text-to-speech synthesis system with Arabic diacritic recognition system
Rebai, Ilyes
BenAyed, Yassine
COMPUTER SPEECH AND LANGUAGE, 2015, 34 (01): : 43 - 60
[36] Text-to-speech synthesis: A complete system for the Slovenian language
Faculty for Electrical Engineering, University of Ljubljana, Tržaška cesta 25, Ljubljana
SI-1001, Slovenia
J. Compt. Inf. Technol., 1 (11-19):
[37] Implementation and evaluation of a text-to-speech synthesis system for Turkish
Salor, Özgül
Pellom, Bryan
Demirekler, Mübeccel
EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology, 2003, : 1573 - 1576
[38] A framework for a Bangla concatenative text-to-speech synthesis system
Syed, MR
Chakrobartty, S
Bignall, RJ
Innovations Through Information Technology, Vols 1 and 2, 2004, : 1318 - 1320
[39] Slovenian text-to-speech system
Sef, T
ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL V: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 41 - 44
[40] A Hakka text-to-speech system
Yu, Hsiu-Min
Hwang, Hsin-Te
Lin, Dong-Yi
Chen, Sin-Horng
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 241 - +

← 1 2 3 4 5 →