Improved pronunciation prediction accuracy using morphology

被引:0
|
作者
Sharma, Dravyansh [1 ]
Sahai, Saumya Yashmohini [1 ]
Chaudhari, Neha [1 ]
Bruguier, Antoine [1 ]
机构
[1] Google LLC, Mountain View, CA 94043 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pronunciation lexicons and prediction models are a key component in several speech synthesis and recognition systems. We know that morphologically related words typically follow a fixed pattern of pronunciation which can be described by language-specific paradigms. In this work we explore how deep recurrent neural networks can be used to automatically learn and exploit this pattern to improve the pronunciation prediction quality of words related by morphological inflection. We propose two novel approaches for supplying morphological information, using the word's morphological class and its lemma, which are typically annotated in standard lexicons. We report improvements across a number of European languages with varying degrees of phonological and morphological complexity, and two language families, with greater improvements for languages where the pronunciation prediction task is inherently more challenging. We also observe that combining bidirectional LSTM networks with attention mechanisms is an effective neural approach for the computational problem considered, across languages. Our approach seems particularly beneficial in the low resource setting, both by itself and in conjunction with transfer learning.
引用
收藏
页码:222 / 228
页数:7
相关论文
共 50 条
  • [1] IMPROVED AND ROBUST PREDICTION OF PRONUNCIATION DISTANCE FOR INDIVIDUAL-BASIS CLUSTERING OF WORLD ENGLISHES PRONUNCIATION
    Kasahara, S.
    Kitahara, S.
    Minematsu, N.
    Shen, H. -P.
    Makino, T.
    Saito, D.
    Hirose, K.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [2] Improved accuracy of embryo scoring using morphokinetic compared with strict morphology
    Adolfsson, E.
    Andershed, A. Nowosad
    HUMAN REPRODUCTION, 2015, 30 : 333 - 333
  • [3] Using multilingual units for improved modeling of pronunciation variants
    Bartkova, K.
    Jouvet, D.
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 5895 - 5898
  • [4] PREDICTORS OF PRONUNCIATION ACCURACY - A REEXAMINATION
    PURCELL, ET
    SUTER, RW
    LANGUAGE LEARNING, 1980, 30 (02) : 271 - 287
  • [5] Improved Accuracy of PSO and DE using Normalization: an Application to Stock Price Prediction
    Kaur, Savinderjit
    Mangat, Veenu
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2012, 3 (09) : 197 - 205
  • [6] Improved latency and accuracy for neural branch prediction
    Jiménez, DA
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2005, 23 (02): : 197 - 218
  • [7] Improved pronunciation modelling by inverse word frequency and pronunciation entropy
    Tsai, MY
    Chou, FC
    Lee, LS
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 53 - 56
  • [8] Improved prediction accuracy for disease risk mapping using Gaussian process stacked generalization
    Bhatt, Samir
    Cameron, Ewan
    Flaxman, Seth R.
    Weiss, Daniel J.
    Smith, David L.
    Gething, Peter W.
    JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2017, 14 (134)
  • [9] Predicting sulfotyrosine sites using the random forest algorithm with significantly improved prediction accuracy
    Zheng Rong Yang
    BMC Bioinformatics, 10
  • [10] Predicting sulfotyrosine sites using the random forest algorithm with significantly improved prediction accuracy
    Yang, Zheng Rong
    BMC BIOINFORMATICS, 2009, 10 : 361