Phonological feature-based speech recognition system for pronunciation training in non-native language learning

被引:18
|
作者
Arora, Vipul [1 ]
Lahiri, Aditi [1 ]
Reetz, Henning [2 ]
机构
[1] Univ Oxford, Fac Linguist Philol & Phonet, Oxford, England
[2] Goethe Univ, Frankfurt, Germany
来源
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2018年 / 143卷 / 01期
基金
欧洲研究理事会;
关键词
MISPRONUNCIATION DETECTION; ACOUSTIC INVARIANCE; STOP CONSONANTS; VISUAL FEEDBACK; ARTICULATION; DIAGNOSIS; FRAMEWORK; MODELS; PLACE;
D O I
10.1121/1.5017834
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The authors address the question whether phonological features can be used effectively in an automatic speech recognition (ASR) system for pronunciation training in non-native language (L2) learning. Computer-aided pronunciation training consists of two essential tasks-detecting mispronunciations and providing corrective feedback, usually either on the basis of full words or phonemes. Phonemes, however, can be further disassembled into phonological features, which in turn define groups of phonemes. A phonological feature-based ASR system allows the authors to perform a sub-phonemic analysis at feature level, providing a more effective feedback to reach the acoustic goal and perceptual constancy. Furthermore, phonological features provide a structured way for analysing the types of errors a learner makes, and can readily convey which pronunciations need improvement. This paper presents the authors implementation of such an ASR system using deep neural networks as an acoustic model, and its use for detecting mispronunciations, analysing errors, and rendering corrective feedback. Quantitative as well as qualitative evaluations are carried out for German and Italian learners of English. In addition to achieving high accuracy of mispronunciation detection, the system also provides accurate diagnosis of errors. (C) 2018 Acoustical Society of America.
引用
收藏
页码:98 / 108
页数:11
相关论文
共 50 条
  • [21] A feature-based hierarchical speech recognition system for Hindi
    K Samudravijaya
    R Ahuja
    N Bondale
    T Jose
    S Krishnan
    P Poddar
    xxPVS Rao
    R Raveendran
    Sadhana, 1998, 23 : 313 - 340
  • [22] Feature-based hierarchical speech recognition system for Hindi
    Samudravijaya, K.
    Ahuja, R.
    Bondale, N.
    Jose, T.
    Krishnan, S.
    Poddar, P.
    Rao, P.V.S.
    Raveendran, R.
    Sadhana - Academy Proceedings in Engineering Sciences, 1998, 23 (pt 4): : 313 - 340
  • [23] A feature-based hierarchical speech recognition system for Hindi
    Samudravijaya, K
    Ahuja, R
    Bondale, N
    Jose, T
    Krishnan, S
    Poddar, P
    Rao, PVS
    Raveendran, R
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1998, 23 (4): : 313 - 340
  • [24] AMERICAN SIGN LANGUAGE FINGERSPELLING RECOGNITION WITH PHONOLOGICAL FEATURE-BASED TANDEM MODELS
    Kim, Taehwan
    Livescu, Karen
    Shakhnarovich, Gregory
    2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 119 - 124
  • [25] Disentangling the Contribution of Non-native Speech in Automated Pronunciation Assessment
    Shi, Shuju
    Fu, Kaiqi
    Gu, Yiwei
    Tian, Xiaohai
    Gao, Shaojun
    Li, Wei
    Ma, Zejun
    INTERSPEECH 2023, 2023, : 954 - 958
  • [26] Perceptual Learning for Native and Non-native Speech
    Baese-Berk, Melissa
    CURRENT TOPICS IN LANGUAGE, 2018, 68 : 1 - 29
  • [27] ON THE USE OF FEATURE-SPACE MLLR ADAPTATION FOR NON-NATIVE SPEECH RECOGNITION
    Oh, Yoo Rhee
    Kim, Hong Kook
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4314 - 4317
  • [28] Feature-based approach to speech recognition
    Zhang, Li
    Edmondson, William
    2002, World Scientific and Engineering Academy and Society
  • [29] The impact of non-native English speakers' phonological and prosodic features on automatic speech recognition accuracy
    Emara, Ingy Farouk
    Shaker, Nabil Hamdy
    SPEECH COMMUNICATION, 2024, 157
  • [30] Non-native speech recognition sentences: A new materials set for non-native speech perception research
    Stringer, Louise
    Iverson, Paul
    BEHAVIOR RESEARCH METHODS, 2020, 52 (02) : 561 - 571