Phonological feature-based speech recognition system for pronunciation training in non-native language learning

被引:18
|
作者
Arora, Vipul [1 ]
Lahiri, Aditi [1 ]
Reetz, Henning [2 ]
机构
[1] Univ Oxford, Fac Linguist Philol & Phonet, Oxford, England
[2] Goethe Univ, Frankfurt, Germany
来源
基金
欧洲研究理事会;
关键词
MISPRONUNCIATION DETECTION; ACOUSTIC INVARIANCE; STOP CONSONANTS; VISUAL FEEDBACK; ARTICULATION; DIAGNOSIS; FRAMEWORK; MODELS; PLACE;
D O I
10.1121/1.5017834
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The authors address the question whether phonological features can be used effectively in an automatic speech recognition (ASR) system for pronunciation training in non-native language (L2) learning. Computer-aided pronunciation training consists of two essential tasks-detecting mispronunciations and providing corrective feedback, usually either on the basis of full words or phonemes. Phonemes, however, can be further disassembled into phonological features, which in turn define groups of phonemes. A phonological feature-based ASR system allows the authors to perform a sub-phonemic analysis at feature level, providing a more effective feedback to reach the acoustic goal and perceptual constancy. Furthermore, phonological features provide a structured way for analysing the types of errors a learner makes, and can readily convey which pronunciations need improvement. This paper presents the authors implementation of such an ASR system using deep neural networks as an acoustic model, and its use for detecting mispronunciations, analysing errors, and rendering corrective feedback. Quantitative as well as qualitative evaluations are carried out for German and Italian learners of English. In addition to achieving high accuracy of mispronunciation detection, the system also provides accurate diagnosis of errors. (C) 2018 Acoustical Society of America.
引用
收藏
页码:98 / 108
页数:11
相关论文
共 50 条
  • [41] Phonetic accommodation in non-native directed speech supports L2 word learning and pronunciation
    Piazza, Giorgio
    Kalashnikova, Marina
    Martin, Clara D.
    SCIENTIFIC REPORTS, 2023, 13 (01):
  • [42] Phonetic accommodation in non-native directed speech supports L2 word learning and pronunciation
    Giorgio Piazza
    Marina Kalashnikova
    Clara D. Martin
    Scientific Reports, 13 (1)
  • [43] How Noise and Language Proficiency Influence Speech Recognition by Individual Non-Native Listeners
    Zhang, Jin
    Xie, Lingli
    Li, Yongjun
    Chatterjee, Monita
    Ding, Nai
    PLOS ONE, 2014, 9 (11):
  • [44] Non-native Speech Learning in Older Adults
    Ingvalson, Erin M.
    Nowicki, Casandra
    Zong, Audrey
    Wong, Patrick C. M.
    FRONTIERS IN PSYCHOLOGY, 2017, 8
  • [45] Acoustic model interpolation for non-native speech recognition
    Tan, Tien-Ping
    Besacier, Laurent
    2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol IV, Pts 1-3, 2007, : 1009 - 1012
  • [46] Multilingual acoustic models for the recognition of non-native speech
    Fischer, V
    Janke, E
    Kunzmann, S
    Ross, T
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 331 - 334
  • [47] Accent neutralization for speech recognition of non-native speakers
    Radzikowski, Kacper
    Forc, Mateusz
    Wang, Le
    Yoshie, Osamu
    Nowak, Robert
    IIWAS2019: THE 21ST INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES, 2019, : 136 - 141
  • [48] Investigating automatic recognition of non-native arabic speech
    Selouani, Sid-Ahmed
    Alotaibi, Yousef Ajami
    2007 INNOVATIONS IN INFORMATION TECHNOLOGIES, VOLS 1 AND 2, 2007, : 204 - +
  • [49] Development of Non-Native Speech Relying on Language Isomorphism
    Tsalikova, Madina
    NEW PERSPECTIVES IN SCIENCE EDUCATION, 8TH EDITION, 2019, : 74 - 78
  • [50] Audio style transfer for non-native speech recognition
    Radzikowski, Kacper
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2018, 2018, 10808