Do text-to-speech synthesisers pronounce correctly? A preliminary study

被引：0

作者：

Evans, D. G. ^{[1
]}

Draffan, E. A.

James, A.

Blenkborn, P.

机构：

[1] Univ Manchester, Evans Draffan & Blenkhorn Sch Informat, Manchester, Lancs, England

[2] James IanSyst Ltd, Cambridge, England

来源：

COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, PROCEEDINGS | 2006年 / 4061卷

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper evaluates 4 commercial text-to-speech synthesisers used by dyslexic people to listen to and proof read text. Two evaluators listened to 704 common English words and determined whether the words were correctly pronounced or not. Where the evaluators agree on incorrect pronunciation, the proportion of correct pronunciations for the four synthesisers is in the range 98.9% to 99.6% of the 704 words. The evaluators also listened to the same synthesisers speaking phrases in which there were 44 pairs of homographs and determined whether each instance of the homograph was correctly spoken or not. The level of correctness for the four synthesisers ranged from 76.3% to 91.3%.

引用

页码：855 / 862

页数：8

共 50 条

[1] Beyond intelligibility - The performance of text-to-speech synthesisers
Johnston, RD
BT TECHNOLOGY JOURNAL, 1996, 14 (01): : 100 - 111
[2] Beyond intelligibility - the performance of text-to-speech synthesisers
Johnston, R.D.
British Telecom technology journal, 1996, 14 (01): : 100 - 111
[3] ACRONYMS, DO WE SPELL THEM OR PRONOUNCE THEM AS WORDS - GUIDELINES FOR A TEXT-TO-SPEECH CONVERTER
BOULADEMAREUIL, P
LINGUISTIQUE, 1995, 31 (01): : 93 - 103
[4] Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech
Jiang, Ziyue
Su, Zhe
Zhao, Zhou
Yang, Qian
Ren, Yi
Liu, Jinglin
Ye, Zhenhui
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[5] Generic Indic Text-to-speech Synthesisers with Rapid Adaptation in an End-to-end Framework
Prakash, Anusha
Murthy, Hema A.
INTERSPEECH 2020, 2020, : 2962 - 2966
[6] Study on Cantonese text-to-speech system
Long, Qinghua
Jing, Huisheng
Ren, Ping
Situ, Xikang
Shengxue Xuebao/Acta Acustica, 1993, 18 (02): : 143 - 147
[7] A Preliminary Study on Wav2Vec 2.0 Embeddings for Text-to-Speech
Lim, Yohan
Kim, Namhyeong
Yun, Seung
Kim, Hun
Lee, Seung-Ik
12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 343 - 347
[8] HOW TO MAKE TEXT-TO-SPEECH SYSTEM PRONOUNCE "VOLDEMORT": AN EXPERIMENTAL APPROACH OF FOREIGN WORD PHONEMIZATION IN VIETNAMESE
Dang-Khoa Mac
Van-Huy Nguyen
Dinh-Nghi Nguyen
Kim-Anh Nguyen
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6483 - 6487
[9] TEXT-TO-SPEECH SYNTHESIS
SPROAT, RW
OLIVE, JP
AT&T TECHNICAL JOURNAL, 1995, 74 (02): : 35 - 44
[10] The Art of Text-to-Speech
Lindquist, Benjamin
CRITICAL INQUIRY, 2024, 50 (02) : 225 - 251

← 1 2 3 4 5 →