Myanmar text-to-speech system with rule-based tone synthesis

被引:6
|
作者
Win, Kyawt Yin [1 ]
Takara, Tomio [1 ]
机构
[1] Univ Ryukyus, Dept Informat Engn, 1 Senbaru, Nishihara, Okinawa 9030213, Japan
关键词
Myanmar; Tonal languages; Text-to-speech; Tone synthesis; Normalization; Rule-based;
D O I
10.1250/ast.32.174
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We have introduced a novel Myanmar text to speech (MyanmarTTS) system with rule-based tone synthesis. Myanmar is a tonal language that possesses unique characteristics compared with other tonal languages such as Chinese, Vietnamese and Thai. Such languages have complicated fundamental-frequency (F-0) patterns of tones, and F-0 is of foremost importance. Myanmar tones are unique in their simplistic pattern related not only to F-0 but also, more specifically to duration. Myanmar tones have different durations between short-tone and long-tone groups. In accordance, we defined a tone rule employing two parameters F-0 at the center of the syllable and the syllable's duration. The rule is implemented with a linear F-0 pattern. Large variability exists in the F-0 and duration uttered by different speakers of different syllables. Hence, for tone synthesis, normalization of the F-0 and duration is important and necessary to discriminate tones. We proposed a normalization method and the effectiveness of this method was confirmed in the distribution of the F-0 and duration. The intelligibility of the synthesized tone was confirmed through listening tests with correct rates of 95.6% for male and 97.8% for female speech. As a result, we showed that the linear pattern is sufficient for Myanmar tone synthesis.
引用
收藏
页码:174 / 181
页数:8
相关论文
共 50 条
  • [31] TEXT-TO-SPEECH SYNTHESIS: A PROTOTYPE SYSTEM FOR CROATIAN LANGUAGE
    Pobar, Miran
    Martincic-Ipsic, Sanda
    Ipsic, Ivo
    ENGINEERING REVIEW, 2008, 28 (02) : 31 - 44
  • [32] HMM Based Myanmar Text to Speech System
    Thu, Ye Kyaw
    Pa, Win Pa
    Ni, Jinfu
    Shiga, Yoshinori
    Finch, Andrew
    Hori, Chiori
    Kawai, Hisashi
    Sumita, Eiichiro
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2237 - 2241
  • [33] Evaluation of Prosody in Text-to-Speech Synthesis System of Bangla
    Basu, Tulika
    Saha, Arup
    2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [34] Semantic dependency and local convolution for enhancing naturalness and tone in text-to-speech synthesis
    Jiang, Chenglong
    Gao, Ying
    Ng, Wing W. Y.
    Zhou, Jiyong
    Zhong, Jinghui
    Zhen, Hongzhong
    Hu, Xiping
    NEUROCOMPUTING, 2024, 608
  • [35] Text-to-speech synthesis system with Arabic diacritic recognition system
    Rebai, Ilyes
    BenAyed, Yassine
    COMPUTER SPEECH AND LANGUAGE, 2015, 34 (01): : 43 - 60
  • [36] Text-to-speech synthesis: A complete system for the Slovenian language
    Faculty for Electrical Engineering, University of Ljubljana, Tržaška cesta 25, Ljubljana
    SI-1001, Slovenia
    J. Compt. Inf. Technol., 1 (11-19):
  • [37] Implementation and evaluation of a text-to-speech synthesis system for Turkish
    Salor, Özgül
    Pellom, Bryan
    Demirekler, Mübeccel
    EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology, 2003, : 1573 - 1576
  • [38] A framework for a Bangla concatenative text-to-speech synthesis system
    Syed, MR
    Chakrobartty, S
    Bignall, RJ
    Innovations Through Information Technology, Vols 1 and 2, 2004, : 1318 - 1320
  • [39] Slovenian text-to-speech system
    Sef, T
    ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL V: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 41 - 44
  • [40] A Hakka text-to-speech system
    Yu, Hsiu-Min
    Hwang, Hsin-Te
    Lin, Dong-Yi
    Chen, Sin-Horng
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 241 - +