Weakly-supervised word-level pronunciation error detection in non-native English speech

被引:1
|
作者
Korzekwa, Daniel [1 ,2 ]
Lorenzo-Trueba, Jaime [3 ]
Drugman, Thomas [3 ]
Calamaro, Shira [3 ]
Kostek, Bozena [2 ]
机构
[1] Amazon, Warsaw, Poland
[2] Gdansk Univ Technol, Fac ETI, Gdansk, Poland
[3] Amazon, London, England
来源
关键词
automated pronunciation assessment; speech processing; second-language learning; deep learning;
D O I
10.21437/Interspeech.2021-38
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
We propose a weakly-supervised model for word-level mispronunciation detection in non-native (L2) English speech. To train this model, phonetically transcribed L2 speech is not required and we only need to mark mispronounced words. The lack of phonetic transcriptions for L2 speech means that the model has to learn only from a weak signal of word-level mispronunciations. Because of that and due to the limited amount of mispronounced L2 speech, the model is more likely to overfit. To limit this risk, we train it in a multi-task setup. In the first task, we estimate the probabilities of word-level mispronunciation. For the second task, we use a phoneme recognizer trained on phonetically transcribed L1 speech that is easily accessible and can be automatically annotated. Compared to state-of-the-art approaches, we improve the accuracy of detecting word-level pronunciation errors in AUC metric by 30% on the GUT Isle Corpus of L2 Polish speakers, and by 21.5% on the Isle Corpus of L2 German and Italian speakers.
引用
收藏
页码:4408 / 4412
页数:5
相关论文
共 50 条
  • [1] Automatic Detection of Word-Level Reading Errors in Non-native English Speech Based on ASR Output
    Qin, Ying
    Qian, Yao
    Loukina, Anastassia
    Lange, Patrick
    Misra, Abhinav
    Evanini, Keelan
    Lee, Tan
    2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [2] Automatic pronunciation error detection in non-native speech: The case of vowel errors in Dutch
    van Doremalen, Joost
    Cucchiarini, Catia
    Strik, Helmer
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (02): : 1336 - 1347
  • [3] Automatic detection of accent and lexical pronunciation errors in spontaneous non-native English speech
    Kyriakopoulos, Konstantinos
    Knill, Kate M.
    Gales, Mark J. E.
    INTERSPEECH 2020, 2020, : 3052 - 3056
  • [4] An approach for Correcting the Word-level Mispronunciations for non-native English-speaking Indian Children
    Kasture, Neha
    Jain, Pooja
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (06) : 10799 - 10813
  • [5] Automatic Speech Recognition and Pronunciation Error Detection of Dutch Non-native Speech: cumulating speech resources in a pluricentric language
    Wei, X.
    Cucchiarini, C.
    van Hout, R.
    Strik, H.
    SPEECH COMMUNICATION, 2022, 144 : 1 - 9
  • [6] Pronunciation accuracy and intelligibility of non-native speech
    Loukina, Anastassia
    Lopez, Melissa
    Evanini, Keelan
    Suenderinann-Oeft, David
    Ivanov, Alexei V.
    Zechner, Klaus
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1917 - 1921
  • [7] Detection of Typical Pronunciation Errors in Non-native English Speech Using Convolutional Recurrent Neural Networks
    Diment, Aleksandr
    Fagerlund, Eemi
    Benfield, Adrian
    Virtanen, Tuomas
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [8] Word durations in non-native English
    Baker, Rachel E.
    Baese-Berk, Melissa
    Bonnasse-Gahot, Laurent
    Kim, Midam
    Van Engen, Kristin J.
    Bradlow, Ann R.
    JOURNAL OF PHONETICS, 2011, 39 (01) : 1 - 17
  • [9] Improving Pronunciation Modeling for Non-Native Speech Recognition
    Tan, Tien-Ping
    Besacier, Laurent
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1801 - 1804
  • [10] PRIDE AND PREJUDICE? Judging non-native pronunciation of English
    Koet, Ton
    van den Bergh, Huub
    L1 EDUCATIONAL STUDIES IN LANGUAGE AND LITERATURE, 2018, 18 : 1 - 13