Do text-to-speech synthesisers pronounce correctly? A preliminary study

被引:0
|
作者
Evans, D. G. [1 ]
Draffan, E. A.
James, A.
Blenkborn, P.
机构
[1] Univ Manchester, Evans Draffan & Blenkhorn Sch Informat, Manchester, Lancs, England
[2] James IanSyst Ltd, Cambridge, England
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper evaluates 4 commercial text-to-speech synthesisers used by dyslexic people to listen to and proof read text. Two evaluators listened to 704 common English words and determined whether the words were correctly pronounced or not. Where the evaluators agree on incorrect pronunciation, the proportion of correct pronunciations for the four synthesisers is in the range 98.9% to 99.6% of the 704 words. The evaluators also listened to the same synthesisers speaking phrases in which there were 44 pairs of homographs and determined whether each instance of the homograph was correctly spoken or not. The level of correctness for the four synthesisers ranged from 76.3% to 91.3%.
引用
收藏
页码:855 / 862
页数:8
相关论文
共 50 条
  • [1] Beyond intelligibility - The performance of text-to-speech synthesisers
    Johnston, RD
    BT TECHNOLOGY JOURNAL, 1996, 14 (01): : 100 - 111
  • [2] Beyond intelligibility - the performance of text-to-speech synthesisers
    Johnston, R.D.
    British Telecom technology journal, 1996, 14 (01): : 100 - 111
  • [3] ACRONYMS, DO WE SPELL THEM OR PRONOUNCE THEM AS WORDS - GUIDELINES FOR A TEXT-TO-SPEECH CONVERTER
    BOULADEMAREUIL, P
    LINGUISTIQUE, 1995, 31 (01): : 93 - 103
  • [4] Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech
    Jiang, Ziyue
    Su, Zhe
    Zhao, Zhou
    Yang, Qian
    Ren, Yi
    Liu, Jinglin
    Ye, Zhenhui
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [5] Generic Indic Text-to-speech Synthesisers with Rapid Adaptation in an End-to-end Framework
    Prakash, Anusha
    Murthy, Hema A.
    INTERSPEECH 2020, 2020, : 2962 - 2966
  • [6] Study on Cantonese text-to-speech system
    Long, Qinghua
    Jing, Huisheng
    Ren, Ping
    Situ, Xikang
    Shengxue Xuebao/Acta Acustica, 1993, 18 (02): : 143 - 147
  • [7] A Preliminary Study on Wav2Vec 2.0 Embeddings for Text-to-Speech
    Lim, Yohan
    Kim, Namhyeong
    Yun, Seung
    Kim, Hun
    Lee, Seung-Ik
    12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 343 - 347
  • [8] HOW TO MAKE TEXT-TO-SPEECH SYSTEM PRONOUNCE "VOLDEMORT": AN EXPERIMENTAL APPROACH OF FOREIGN WORD PHONEMIZATION IN VIETNAMESE
    Dang-Khoa Mac
    Van-Huy Nguyen
    Dinh-Nghi Nguyen
    Kim-Anh Nguyen
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6483 - 6487
  • [9] TEXT-TO-SPEECH SYNTHESIS
    SPROAT, RW
    OLIVE, JP
    AT&T TECHNICAL JOURNAL, 1995, 74 (02): : 35 - 44
  • [10] The Art of Text-to-Speech
    Lindquist, Benjamin
    CRITICAL INQUIRY, 2024, 50 (02) : 225 - 251