Myanmar text-to-speech system with rule-based tone synthesis

被引:6
|
作者
Win, Kyawt Yin [1 ]
Takara, Tomio [1 ]
机构
[1] Univ Ryukyus, Dept Informat Engn, 1 Senbaru, Nishihara, Okinawa 9030213, Japan
关键词
Myanmar; Tonal languages; Text-to-speech; Tone synthesis; Normalization; Rule-based;
D O I
10.1250/ast.32.174
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We have introduced a novel Myanmar text to speech (MyanmarTTS) system with rule-based tone synthesis. Myanmar is a tonal language that possesses unique characteristics compared with other tonal languages such as Chinese, Vietnamese and Thai. Such languages have complicated fundamental-frequency (F-0) patterns of tones, and F-0 is of foremost importance. Myanmar tones are unique in their simplistic pattern related not only to F-0 but also, more specifically to duration. Myanmar tones have different durations between short-tone and long-tone groups. In accordance, we defined a tone rule employing two parameters F-0 at the center of the syllable and the syllable's duration. The rule is implemented with a linear F-0 pattern. Large variability exists in the F-0 and duration uttered by different speakers of different syllables. Hence, for tone synthesis, normalization of the F-0 and duration is important and necessary to discriminate tones. We proposed a normalization method and the effectiveness of this method was confirmed in the distribution of the F-0 and duration. The intelligibility of the synthesized tone was confirmed through listening tests with correct rates of 95.6% for male and 97.8% for female speech. As a result, we showed that the linear pattern is sufficient for Myanmar tone synthesis.
引用
收藏
页码:174 / 181
页数:8
相关论文
共 50 条
  • [41] A TEXT-TO-SPEECH CONVERSION SYSTEM
    KLATT, DH
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1982, 184 (SEP): : 11 - CINF
  • [42] Text-to-speech system for Danish
    1600, Publ by Elsevier Science Publishers B.V., Amsterdam, Neth
  • [43] A Mandarin text-to-speech system
    Hwang, SH
    Chen, SH
    Wang, YR
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1421 - 1424
  • [44] Multilingual text analysis for text-to-speech synthesis
    Bell Lab, Murray Hill, United States
    International Conference on Spoken Language Processing, ICSLP, Proceedings, 1996, 3 : 1365 - 1368
  • [45] Intensity Modeling for Syllable Based Text-to-Speech Synthesis
    Reddy, V. Ramu
    Rao, K. Sreenivasa
    CONTEMPORARY COMPUTING, 2012, 306 : 106 - 117
  • [46] Multilingual text analysis for text-to-speech synthesis
    Sproat, R
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1365 - 1368
  • [47] A Rule-Based Kurdish Text Transliteration System
    Ahmadi, Sina
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2019, 18 (02)
  • [48] A Modular System for Rule-based Text Categorisation
    Del Tredici, Marco
    Nissim, Malvina
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [49] Text analysis for the Slovenian text-to-speech system
    Sef, T
    ICECS 2001: 8TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS, VOLS I-III, CONFERENCE PROCEEDINGS, 2001, : 1355 - 1358
  • [50] Residual-based speech modification algorithms for text-to-speech synthesis
    Edgington, M
    Lowry, A
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1425 - 1428