Northern Thai Dialect Text to Speech

被引：0

作者：

Chao-angthong, Pannakorn ^{[1
]}

Suchato, Atiwong ^{[1
]}

Punyabukkana, Proadpran ^{[1
]}

机构：

[1] Chulalongkorn Univ, Fac Engn, Dept Comp Engn, Spoken Language Syst Res Grp, Bangkok, Thailand

来源：

PROCEEDINGS OF 2017 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE) | 2017年

关键词：

Text to speech system; Grapheme to phoneme conversion; Northern Thai dialect; Speech corpus;

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Each of the dialects of Thai Language has a distinct identity associated with its accents. The conversation between different native speakers of these dialects despite their standard language origination cannot be avoided when visiting each region. Communication with people who understand only the Northern Thai Dialect (NTD) brought us to the idea of inventing the Northern Thai Dialect Text to Speech (NTD-TTS). This idea derives from the same concept as a translating program; after getting text input in the Center Thai Dialect (CTD), the TTS system will translate and synthesize speech output in NTD. TTS used a software structure and modified two components: Grapheme to Phoneme (G2P) and Speech models. The NTD-G2P conversion was created by using rule-based and dictionary-based approaches. It was evaluated by 100 randomly selected sentences from ORCHID. The NTD-G2P reports a conversion accuracy of 83.19% on the syllable level and it is used for implementing the NTD-corpus. The sentence selections were presented to train the NTD speech model. The selection chosen covers 95.32% in the first percentile of phoneme distribution in the NTD-corpus. After connecting the speech models to the TTS system, the whole system was evaluated with Mean Opinion Score (MOS) and the comprehension on the syllable level by the native speakers. The NTD-MOS evaluations indicated that the accent, naturalness, and intelligibility of synthetic speech ranged from "acceptable" to "good". The test set of the NTD-TTS system earned a good MOS and high comprehension percentage from the NTD native listeners. The results are 3.73 in the accent, 3.68 in the naturalness, 3.63 in the intelligibility, and the comprehension percentage is 97.16%.

引用

页数：6

共 50 条

[1] Constructing a Phonetic Transcribed Text Corpus for Southern Thai Dialect Speech Recognition
Aunkaew, Sittichok
Karnjanadecha, Montri
Wutiwiwatchai, Chai
PROCEEDINGS OF THE 2015 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2015, : 69 - 73
[2] A FRAMEWORK OF THAI TEXT RETRIEVAL USING SPEECH
Sopon, Paponput
Suksamer, Thongparn
Polpinij, Jantima
Chamchong, Rapeeporn
PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON COMPUTING AND INFORMATICS: EMBRACING ECO-FRIENDLY COMPUTING, 2017, : 517 - 522
[3] Text to speech of the Venezuelan dialect via diphone concatenation
Rodriguez, Manuel
Mora, Elsa
CIENCIA E INGENIERIA, 2006, 27 (02): : 79 - 87
[4] The Text Analysis and Processing of Thai Language Text to Speech Conversion System
Lin, Xuee
Yang, Jian
Zhao, Juan
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 436 - 436
[5] Thai Dialect Corpus and Transfer-based Curriculum Learning Investigation for Dialect Automatic Speech Recognition
Suwanbandit, Artit
Naowarat, Burin
Sangpetch, Orathai
Chuangsuwanich, Ekapol
INTERSPEECH 2023, 2023, : 4069 - 4073
[6] TEXT VARIABILITY AS A DIALECT LANGUAGE PERSONALITY SPEECH CULTURE MANIFESTATION
Yekaterina, Ivantsova V.
TOMSK STATE UNIVERSITY JOURNAL, 2013, (376): : 14 - +
[7] Prosodic Annotation in a Thai Text-to-speech System
Potisuk, Siripong
PACLIC 21: THE 21ST PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, PROCEEDINGS, 2007, : 405 - 414
[8] Prosodic annotation in a Thai Text-to-speech system
Department of Electrical and Computer Engineering, Citadel, Military College of South Carolina, 171 Moultrie Street, Charleston, SC 29409, United States
PACLIC - Pacific Asia Conf. Lang., Inf. Comput., Proc., 2007, (405-414):
[9] SPEECH FORMS OF OTHER DISCOURSES IN THE PERSONALITY- ORIENTED DIALECT TEXT
Tubalova, Inna V.
VESTNIK TOMSKOGO GOSUDARSTVENNOGO UNIVERSITETA FILOLOGIYA-TOMSK STATE UNIVERSITY JOURNAL OF PHILOLOGY, 2016, 44 (06): : 68 - 82
[10] An Isarn Dialect HMM-based Text-to-speech System
Janyoi, Pongsathon
Seresangtakul, Pusadee
2017 2ND INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (INCIT), 2017, : 1 - 6

← 1 2 3 4 5 →